Telecom- Customer Churn Prediction

Churn rate, also referred to as attrition rate, measures the number of individuals or units leaving a group over a specified time. The term is used in many contexts, including in business, human resources, and IT. Most notably, churn rate is referred to as the proportion of contractual (or subscribed) customers who terminate their contractual relationships/subscriptions with a company in each timeframe. In this context, the term is primarily associated with companies operating on a subscription basis. We must predict future churn rates, because it will help the business to gain a better understanding of future expected revenue. In addition, when we can use churn prediction to forecast the potential churn rate of a particular customer, it allows us to target that individual to prevent them from discontinuing their subscription with the company. And, since the cost of acquiring a new customer as per research is much higher, 5 to 6 times more to acquire new customers than keeping an existing one, there’s plenty of revenue-based reason to do everything in our power to keep those existing customers.

import numpy as np import pandas as pd import seaborn as sns import matplotlib from matplotlib import pyplot as plt %matplotlib inline

print('Numpy Version',np.__version__) print('Pandas Version',pd.__version__) print('Seaborn Version',sns.__version__) print('Matplotlib Version',matplotlib.__version__)

The DTH company has collected data for the purchases of the customer for various account segments. The data has information about the Tenure of the selected plan, the city tier where the plan was selected, the payment method used, gender demographic, marital status of the customer, revenue generated per month, type of login device used for the account and other factors

churn_data = pd.read_excel("Customer Churn Data.xlsx")

churn_data.head()

1. Initially below dataset had 11260 Rows and 19 columns, we have removed Account ID column since it is not of much use. 2. The data has 5Float variables, 1 Integer variable and 12 object variables. 3. We have renamed values for columns which have naming inconsistency- renamed the values for Gender and Account segment variable. 4. We have kept same naming for Male and Female data, earlier we had Male, Female, F and M values for this column. So converted F and M to Female and Male respectively. 5. We have renamed values for Account segment variable, to deal with naming inconsistencies. Regular + to Regular_Plus and Super + to Super_Plus 6. There are duplicate records in the dataset, around 259, we have removed them. 7. There are also a lot of special characters in every column as well as missing values around 3616, we have replaced special characters with null value, and to compute null value we will see the distribution of numerical variables first and decide on appropriate method for imputation of null values

churn_data.shape

churn_data.info()

churn_data = churn_data.drop(['AccountID'], axis=1)

dups = churn_data.duplicated() print('Number of duplicate rows = %d' % (dups.sum())) churn_data[dups]

#Removing Duplicates churn_data =churn_data.drop_duplicates()

#To check if the data has any missing values #only blank data is shown as missing churn_data.isnull().sum()

#Let's check if there are any other undesirable value present #unique values for categorical variables for column in churn_data.columns: if churn_data[column].dtype == 'object': print(column.upper(),': ',churn_data[column].nunique()) print(churn_data[column].value_counts().sort_values()) print('\n')

# keeping common naming for the values of gender column # replace F and M by Female and Male churn_data['Gender'] = churn_data['Gender'].replace(['F'],'Female') churn_data['Gender'] = churn_data['Gender'].replace(['M'],'Male')

churn_data['Gender'].value_counts()

#replacing blank values by a new value 'unknown' churn_data['Gender'].fillna('unknown', inplace = True)

# Again check the value count for Gender column after replacing blank values by 'unknown' churn_data['Gender'].value_counts()

# Check the count of the values of account segement after imputing blank values churn_data['account_segment'].value_counts()

churn_data[['Tenure','City_Tier','CC_Contacted_LY','Service_Score','CC_Agent_Score','rev_per_month','Complain_ly','rev_growth_yoy','coupon_used_for_payment','Day_Since_CC_connect','cashback']].describe() #descriptive stats of continuous columns

#Distribution plot for Tenure sns.distplot(churn_data.Tenure, bins=20)

#boxplot for Tenure sns.boxplot(churn_data.Tenure)

churn_data["Tenure"] = churn_data["Tenure"].replace(np.NaN, churn_data["Tenure"].median())

churn_data["Tenure"].isnull().sum()

#Distribution plot for CC_Contacted_LY sns.distplot(churn_data.CC_Contacted_LY, bins=20)

#boxplot for CC_Contacted_Ly sns.boxplot(churn_data.CC_Contacted_LY)

churn_data["CC_Contacted_LY"] = churn_data["CC_Contacted_LY"].replace(np.NaN, churn_data["CC_Contacted_LY"].median())

churn_data["CC_Contacted_LY"].isnull().sum()

#Distribution plot for Service_Score sns.distplot(churn_data.Service_Score, bins=20)

#boxplot for Service_Score sns.boxplot(churn_data.Service_Score)

#dealing missing value with most common value occuring in this column churn_data['Service_Score'].fillna(3, inplace = True)

#Distribution plot for City_Tier sns.distplot(churn_data.City_Tier, bins=20)

#boxplot for City_Tier sns.boxplot(churn_data.City_Tier)

#count plot for Payment column sns.countplot(data = churn_data, x = 'Payment')

# also use underscore for renaming churn_data['Payment'] = churn_data['Payment'].replace(['Debit Card'],'Debit_Card') churn_data['Payment'] = churn_data['Payment'].replace(['Cash on Delivery'],'COD') churn_data['Payment'] = churn_data['Payment'].replace(['E wallet'],'E_wallet') churn_data['Payment'] = churn_data['Payment'].replace(['Credit Card'],'Credit_Card')

#Count plot for Account_user_count sns.countplot(data = churn_data, x = 'Account_user_count')

#Distribution plot for Account_user_count sns.distplot(churn_data.Account_user_count, bins=20)

#boxplot for Service_Score sns.boxplot(churn_data.Account_user_count)

#Count plot for Account_user_count sns.countplot(data = churn_data, x = 'account_segment')

#Distribution plot for CC_Agent_Score sns.distplot(churn_data.CC_Agent_Score, bins=20)

#boxplot for CC_Agent_Score sns.boxplot(churn_data.CC_Agent_Score)

#count plot for Marital_Status column sns.countplot(data = churn_data, x = 'Marital_Status')

#Distribution plot for rev_per_month sns.distplot(churn_data.rev_per_month, bins=20)

#boxplot for rev_per_month sns.boxplot(churn_data.rev_per_month)

#Distribution plot for rev_growth_yoy sns.distplot(churn_data.rev_growth_yoy, bins=20)

#boxplot for rev_growth_yoy sns.boxplot(churn_data.rev_growth_yoy)

#Distribution plot for coupon_used_for_payment sns.distplot(churn_data.coupon_used_for_payment, bins=20)

#boxplot for coupon_used_for_payment sns.boxplot(churn_data.coupon_used_for_payment)

#Distribution plot for Day_Since_CC_connect sns.distplot(churn_data.Day_Since_CC_connect, bins=20)

#boxplot for Day_Since_CC_connect sns.boxplot(churn_data.Day_Since_CC_connect)

#count plot for Login_device column sns.countplot(data = churn_data, x = 'Login_device')

#Distribution plot for cashback sns.distplot(churn_data.cashback, bins=20)

#boxplot for cashback sns.boxplot(churn_data.cashback)

churn_data['Churn'].value_counts()

1) 0 means a customer is retained, 1 means customer has churned 2) The DTH company kept 83% of its users. Since the data is skewed, the number of instances in the 'Retained' class outnumbers the number of instances in the 'Churned' class by a lot. 3) But since Industry rate of churning is 14 to 16% for DTH companies, this distribution does not need oversampling. If needed, we can decide it later by building models and analysing their performance metrics

churn_data["Churn"].value_counts(normalize=True)

churn_data.groupby(by=['account_segment'])['Tenure'].sum().reset_index().sort_values(['Tenure']).tail(6).plot(x='account_segment', y='Tenure', kind='bar', figsize=(15,5)) plt.show()

Tenure: Tenure variable does not seem to have significant effect on churn rate, average Tenure is 11 years. Payment method: Most preferred payment method is Debit card since the number of Debit card users is highest around 3800 and a greater number of users that have churned were using Debit card. Least preferred method is UPI, having around 520 users out of which 200 have churned. This number is more than Debit card users that have churned. So, customers using UPI are more highly likely to churn. Login device: Mobile users are the highest number of DTH users around 6000 from which 1000 have churned. Similar can be said for users using Computer. Using a particular type of device does not seem to affect churn rate that much. Gender: Male users are highest around 5000 and up, and that have churned are around 1000, we can say that male users are highest users that have churned. But this variable does not have a significant impact on churn rate, since number of male users continuing the service is higher. Complain_ly: Customers that have contacted the customer care highest number of times are more likely to churn CC_Agent_score: Customers that have given low rating to the Agents are most likely to churn since they might be dissatisfied with the service of the agent, thus resulting to churn from the dth service. Marital_status: There are a greater number of users that are married around 5000 and up followed by Single users around 2500. The number of Single users that have churned is highest around 1000. So, we can say that Marital_status is affecting churn rate. City_Tier: Users from city tier 1 are highest around 6000 and around 1000 have churned from tier 1 city. City tier does not seem to have greater effect on churn rate. Account_user_count: Highest number of users tagged to an account is 4, and they are around 3500 and above out of which 550 customers have churned. There is no significant intercorrelation between our features, so we do not have to worry about multicollinearity.

plt.figure(figsize=(8,8)) sns.countplot(churn_data['account_segment'],hue=churn_data['Gender']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['account_segment'],hue=churn_data['Marital_Status']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['City_Tier'],hue=churn_data['Gender']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['account_segment'],hue=churn_data['Account_user_count']) plt.show()

plt.figure(figsize=(15,5)) sns.pointplot(x="account_segment", y="Tenure", hue = 'Gender', data=churn_data) plt.show()

plt.figure(figsize=(15,5)) sns.pointplot(x="Payment", y="CC_Contacted_LY", data=churn_data) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Payment'],hue=churn_data['Login_device']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Payment'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Login_device'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Gender'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Complain_ly'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['CC_Agent_Score'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Marital_Status'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['City_Tier'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(8,8)) sns.countplot(churn_data['Account_user_count'],hue=churn_data['Churn']) plt.show()

plt.figure(figsize=(10,8)) sns.heatmap(churn_data.corr(),annot=True) plt.show()

churn_data.corr()

#Checking outliers # construct box plot for continuous variables plt.figure(figsize=(30,30)) churn_data.boxplot(vert=0) plt.show()

churn_data.head()

sns.pairplot(churn_data,hue="Churn")

Before applying machine learning models on the dataset, we split it into Train and Test data in 70:30 ratio. We have scaled the data using z score, since the range for some numerical attributes are quite high and some machine learning algorithms like KNN are biased towards variables with high magnitude. Initially we have chosen not to oversample the target variable since percentage of churn in this dataset is 0.16. And the standard Industry rate of churn for DTH companies is 14 to 16%. Therefore, we first build models with the data as it is.

from scipy.stats import zscore

# Applying Z score X[['Tenure', 'City_Tier', 'CC_Contacted_LY', 'Payment', 'Gender', 'Service_Score', 'Account_user_count', 'account_segment', 'CC_Agent_Score', 'Marital_Status', 'rev_per_month', 'Complain_ly', 'rev_growth_yoy', 'coupon_used_for_payment', 'Day_Since_CC_connect', 'cashback', 'Login_device']]=X[['Tenure', 'City_Tier', 'CC_Contacted_LY', 'Payment', 'Gender', 'Service_Score', 'Account_user_count', 'account_segment', 'CC_Agent_Score', 'Marital_Status', 'rev_per_month', 'Complain_ly', 'rev_growth_yoy', 'coupon_used_for_payment', 'Day_Since_CC_connect', 'cashback', 'Login_device']].apply(zscore)

X.head()

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis from sklearn import metrics,model_selection from sklearn.model_selection import train_test_split from sklearn.metrics import roc_auc_score,roc_curve,classification_report,confusion_matrix,plot_confusion_matrix

X_train, X_test,Y_train,Y_test = train_test_split(X,Y,test_size=.30,random_state=1)

X_train.shape

X_test.shape

#Build LDA Model clf = LinearDiscriminantAnalysis() modelld=clf.fit(X_train,Y_train)

# Training Data Class Prediction with a cut-off value of 0.5 pred_class_train = modelld.predict(X_train) # Test Data Class Prediction with a cut-off value of 0.5 pred_class_test = modelld.predict(X_test)

model_score = clf.score(X_train, Y_train) print(model_score)

print('Classification Report of the training data:\n\n',metrics.classification_report(Y_train,pred_class_train),'\n') print('Classification Report of the test data:\n\n',metrics.classification_report(Y_test,pred_class_test),'\n')

lda_metrics=classification_report(Y_train,pred_class_train,output_dict=True) dfld=pd.DataFrame(lda_metrics).transpose() lda_train_precision_con=round(dfld.iloc[0][0],2) lda_train_recall_con=round(dfld.iloc[0][1],2) lda_train_f1_con=round(dfld.iloc[0][2],2) lda_train_acc=round(dfld.loc["accuracy"][0],2) print ('lda_train_precision Retain',lda_train_precision_con) print ('lda_train_recall Retain',lda_train_recall_con) print ('lda_train_f1 Retain',lda_train_f1_con) print ('lda_train_accuracy ',lda_train_acc) lda_train_precision_lab=round(dfld.iloc[1][0],2) lda_train_recall_lab=round(dfld.iloc[1][1],2) lda_train_f1_lab=round(dfld.iloc[1][2],2) print ('lda_train_precision churn',lda_train_precision_lab) print ('lda_train_recall churn',lda_train_recall_lab) print ('lda_train_f1 churn',lda_train_f1_lab)

model_score = clf.score(X_test, Y_test) print(model_score)

lda_metrics=classification_report(Y_test,pred_class_test,output_dict=True) dfld=pd.DataFrame(lda_metrics).transpose() lda_test_precision_con=round(dfld.iloc[0][0],2) lda_test_recall_con=round(dfld.iloc[0][1],2) lda_test_f1_con=round(dfld.iloc[0][2],2) lda_test_acc=round(dfld.loc["accuracy"][0],2) print ('lda_test_precision Retain',lda_test_precision_con) print ('lda_test_recall Retain',lda_test_recall_con) print ('lda_test_f1 Retain',lda_test_f1_con) print ('lda_test_accuracy ',lda_test_acc) lda_test_precision_lab=round(dfld.iloc[1][0],2) lda_test_recall_lab=round(dfld.iloc[1][1],2) lda_test_f1_lab=round(dfld.iloc[1][2],2) print ('lda_test_precision Churn',lda_test_precision_lab) print ('lda_test_recall Churn',lda_test_recall_lab) print ('lda_test_f1 Churn',lda_test_f1_lab)

# AUC and ROC for the training data # calculate AUC ldtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % ldtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC ldrtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % ldrtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

from sklearn.model_selection import train_test_split,GridSearchCV from sklearn.linear_model import LogisticRegression

#Data has already been splitted lets apply the logistic regression model on it modelg = LogisticRegression(solver='newton-cg',max_iter=10000,penalty='none',verbose=True,n_jobs=2) modelg.fit(X_train, Y_train)

#Coeffients of the variables modelg.coef_

churn_data.columns

modelg.intercept_

ytrain_predict = modelg.predict(X_train) ytest_predict = modelg.predict(X_test)

print('Classification Report of the training data:\n',metrics.classification_report(Y_train,ytrain_predict),'\n') print('Classification Report of the test data:\n',metrics.classification_report(Y_test,ytest_predict),'\n')

log_metrics=classification_report(Y_train,ytrain_predict,output_dict=True) dflog=pd.DataFrame(log_metrics).transpose() log_train_precision_con=round(dflog.iloc[0][0],2) log_train_recall_con=round(dflog.iloc[0][1],2) log_train_f1_con=round(dflog.iloc[0][2],2) log_train_acc=round(dflog.loc["accuracy"][0],2) print ('log_train_precision Retain',log_train_precision_con) print ('log_train_recall Retain',log_train_recall_con) print ('log_train_f1 Retain',log_train_f1_con) print ('log_train_accuracy ',log_train_acc) log_metrics1=classification_report(Y_train,ytrain_predict,output_dict=True) dflog1=pd.DataFrame(log_metrics1).transpose() log_train_precision_lab=round(dflog1.iloc[1][0],2) log_train_recall_lab=round(dflog1.iloc[1][1],2) log_train_f1_lab=round(dflog1.iloc[1][2],2) print ('log_train_precision Churn',log_train_precision_lab) print ('log_train_recall Churn',log_train_recall_lab) print ('log_train_f1 Churn',log_train_f1_lab)

log_metrics2=classification_report(Y_test,ytest_predict,output_dict=True) dflog2=pd.DataFrame(log_metrics2).transpose() log_test_precision_con=round(dflog2.iloc[0][0],2) log_test_recall_con=round(dflog2.iloc[0][1],2) log_test_f1_con=round(dflog2.iloc[0][2],2) log_test_acc=round(dflog2.loc["accuracy"][0],2) print ('log_test_precision Retain',log_test_precision_con) print ('log_test_recall Retain',log_test_recall_con) print ('log_test_f1 Retain',log_test_f1_con) print ('log_test_accuracy ',log_test_acc) log_metrics3=classification_report(Y_test,ytest_predict,output_dict=True) dflog3=pd.DataFrame(log_metrics3).transpose() log_test_precision_lab=round(dflog3.iloc[1][0],2) log_test_recall_lab=round(dflog3.iloc[1][1],2) log_test_f1_lab=round(dflog3.iloc[1][2],2) print ('log_train_precision Churn',log_test_precision_lab) print ('log_train_recall Churn',log_test_recall_lab) print ('log_train_f1 Churn',log_test_f1_lab)

# Training Data Probability Prediction pred_prob_train = modelg.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = modelg.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC lgtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % lgtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC lgtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % lgtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

importances = pd.DataFrame(data={ 'Attribute': X_train.columns, 'Importance': modelg.coef_[0] }) importances = importances.sort_values(by='Importance', ascending=False)

plt.bar(x=importances['Attribute'], height=importances['Importance'], color='#087E8B') plt.title('Feature importances obtained from coefficients', size=20) plt.xticks(rotation='vertical') plt.show()

grid={'penalty':['l2','none'], 'solver':['sag','lbfgs'], 'tol':[0.0001,0.00001,0.001]}

modelg1 = LogisticRegression(max_iter=10000,n_jobs=2)

grid_search = GridSearchCV(estimator = modelg1, param_grid = grid, cv = 3,n_jobs=-1,scoring='f1')

grid_search.fit(X_train, Y_train)

print(grid_search.best_params_,'\n') print(grid_search.best_estimator_) best_model = grid_search.best_estimator_

ytrain_predict = best_model.predict(X_train) ytest_predict = best_model.predict(X_test)

# Split X and y into training and test set in 70:30 ratio X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=0.30 , random_state=1)

from sklearn.neighbors import KNeighborsClassifier KNN_model=KNeighborsClassifier() KNN_model.fit(X_train,Y_train)

## Performance Matrix on train data set y_train_predict = KNN_model.predict(X_train) model_score = KNN_model.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) print(metrics.classification_report(Y_train, y_train_predict)) ## Performance Matrix on test data set y_test_predict = KNN_model.predict(X_test) model_score = KNN_model.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) print(metrics.classification_report(Y_test, y_test_predict))

# Training Data Probability Prediction pred_prob_train = KNN_model.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = KNN_model.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC ldtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % ldtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) #[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC ldrtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) #[:,1]) print('AUC for the Test Data: %.3f' % ldrtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) #[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

from sklearn.naive_bayes import GaussianNB from sklearn import metrics

NB_model = GaussianNB() NB_model.fit(X_train, Y_train)

y_train_predict = NB_model.predict(X_train) model_score = NB_model.score(X_train, Y_train) ## Accuracy print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) ## confusion_matrix print(metrics.classification_report(Y_train, y_train_predict)) ## classification_report ## Performance Matrix on test data set y_test_predict = NB_model.predict(X_test) model_score = NB_model.score(X_test, Y_test) ## Accuracy print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) ## confusion_matrix print(metrics.classification_report(Y_test, y_test_predict)) ## classification_report

nb_metrics=classification_report(Y_train,y_train_predict,output_dict=True) dfnb=pd.DataFrame(nb_metrics).transpose() nb_train_precision_con=round(dfnb.iloc[0][0],2) nb_train_recall_con=round(dfnb.iloc[0][1],2) nb_train_f1_con=round(dfnb.iloc[0][2],2) nb_train_acc=round(dfnb.loc["accuracy"][0],2) print ('nb_train_precision churn',nb_train_precision_con) print ('nb_train_recall churn',nb_train_recall_con) print ('nb_train_f1 churn',nb_train_f1_con) print ('nb_train_accuracy ',nb_train_acc) nb_metrics1=classification_report(Y_train,y_train_predict,output_dict=True) dfnb1=pd.DataFrame(nb_metrics1).transpose() nb_train_precision_lab=round(dfnb1.iloc[1][0],2) nb_train_recall_lab=round(dfnb1.iloc[1][1],2) nb_train_f1_lab=round(dfnb1.iloc[1][2],2) print ('nb_train_precision retain',nb_train_precision_lab) print ('nb_train_recall retain',nb_train_recall_lab) print ('nb_train_f1 retain',nb_train_f1_lab)

nb_metrics=classification_report(Y_test,y_test_predict,output_dict=True) dfnb=pd.DataFrame(nb_metrics).transpose() nb_test_precision_con=round(dfnb.iloc[0][0],2) nb_test_recall_con=round(dfnb.iloc[0][1],2) nb_test_f1_con=round(dfnb.iloc[0][2],2) nb_test_acc=round(dfnb.loc["accuracy"][0],2) print ('nb_train_precision churn',nb_test_precision_con) print ('nb_train_recall churn',nb_test_recall_con) print ('nb_train_f1 churn',nb_test_f1_con) print ('nb_train_accuracy ',nb_test_acc) nb_metrics1=classification_report(Y_test,y_test_predict,output_dict=True) dfnb1=pd.DataFrame(nb_metrics1).transpose() nb_test_precision_lab=round(dfnb1.iloc[1][0],2) nb_test_recall_lab=round(dfnb1.iloc[1][1],2) nb_test_f1_lab=round(dfnb1.iloc[1][2],2) print ('nb_train_precision retain',nb_test_precision_lab) print ('nb_train_recall retain',nb_test_recall_lab) print ('nb_train_f1 retain',nb_test_f1_lab)

# Training Data Probability Prediction pred_prob_train = NB_model.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = NB_model.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC gbtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % gbtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC gbtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % gbtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

from sklearn.ensemble import BaggingClassifier from sklearn.tree import DecisionTreeClassifier cart = DecisionTreeClassifier() Bagging_model=BaggingClassifier(base_estimator=cart,n_estimators=100,random_state=1) Bagging_model.fit(X_train, Y_train)

## Performance Matrix on train data set y_train_predict = Bagging_model.predict(X_train) model_score =Bagging_model.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) print(metrics.classification_report(Y_train, y_train_predict)) ## Performance Matrix on test data set y_test_predict = Bagging_model.predict(X_test) model_score = Bagging_model.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) print(metrics.classification_report(Y_test, y_test_predict))

# Training Data Probability Prediction pred_prob_train = Bagging_model.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = Bagging_model.predict_proba(X_test)

bag_metrics=classification_report(Y_train,y_train_predict,output_dict=True) dfbag=pd.DataFrame(bag_metrics).transpose() bag_train_precision_con=round(dfbag.iloc[0][0],2) bag_train_recall_con=round(dfbag.iloc[0][1],2) bag_train_f1_con=round(dfbag.iloc[0][2],2) bag_train_acc=round(dfbag.loc["accuracy"][0],2) print ('bag_train_precision churn',bag_train_precision_con) print ('bag_train_recall churn',bag_train_recall_con) print ('bag_train_f1 churn',bag_train_f1_con) print ('bag_train_accuracy ',bag_train_acc) bag_metrics1=classification_report(Y_train,y_train_predict,output_dict=True) dfbag1=pd.DataFrame(bag_metrics1).transpose() bag_train_precision_lab=round(dfbag1.iloc[1][0],2) bag_train_recall_lab=round(dfbag1.iloc[1][1],2) bag_train_f1_lab=round(dfbag1.iloc[1][2],2) print ('bag_train_precision retain',bag_train_precision_lab) print ('bag_train_recall retain',bag_train_recall_lab) print ('bag_train_f1 retain',bag_train_f1_lab)

bag_metrics=classification_report(Y_test,y_test_predict,output_dict=True) dfbag=pd.DataFrame(bag_metrics).transpose() bag_train_precision_con=round(dfbag.iloc[0][0],2) bag_train_recall_con=round(dfbag.iloc[0][1],2) bag_train_f1_con=round(dfbag.iloc[0][2],2) bag_train_acc=round(dfbag.loc["accuracy"][0],2) print ('bag_train_precision churn',bag_train_precision_con) print ('bag_train_recall churn',bag_train_recall_con) print ('bag_train_f1 churn',bag_train_f1_con) print ('bag_train_accuracy ',bag_train_acc) bag_metrics1=classification_report(Y_test,y_test_predict,output_dict=True) dfbag1=pd.DataFrame(bag_metrics1).transpose() bag_train_precision_lab=round(dfbag1.iloc[1][0],2) bag_train_recall_lab=round(dfbag1.iloc[1][1],2) bag_train_f1_lab=round(dfbag1.iloc[1][2],2) print ('bag_train_precision retain',bag_train_precision_lab) print ('bag_train_recall retain',bag_train_recall_lab) print ('bag_train_f1 retain',bag_train_f1_lab)

from sklearn.model_selection import GridSearchCV param_grid = { 'n_estimators': [101, 301,500], 'random_state':[1,0,100] } bcl = BaggingClassifier(base_estimator=cart) #base_estimator=cart grid_search = GridSearchCV(estimator = bcl,param_grid = param_grid, cv = 3)

grid_search.fit(X_train, Y_train)

grid_search.best_params_

best_grid = grid_search.best_estimator_

best_grid

ytrain_predict_best = best_grid.predict(X_train) ytest_predict_best = best_grid.predict(X_test)

## Performance Matrix on train data set model_score =best_grid.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, ytrain_predict_best)) print(metrics.classification_report(Y_train, ytrain_predict_best)) ## Performance Matrix on test data set model_score = best_grid.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, ytest_predict_best)) print(metrics.classification_report(Y_test, ytest_predict_best))

# AUC and ROC for the training data # calculate AUC bagtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % bagtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC bagtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % bagtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

from sklearn.ensemble import RandomForestClassifier RF_model=RandomForestClassifier(n_estimators=100,random_state=1) RF_model.fit(X_train, Y_train)

## Performance Matrix on train data set y_train_predict = RF_model.predict(X_train) model_score =RF_model.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) print(metrics.classification_report(Y_train, y_train_predict)) ## Performance Matrix on test data set y_test_predict = RF_model.predict(X_test) model_score = RF_model.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) print(metrics.classification_report(Y_test, y_test_predict))

# Training Data Probability Prediction pred_prob_train = RF_model.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = RF_model.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC rdtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % rdtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC rdtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % rdtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

from sklearn.ensemble import AdaBoostClassifier ADB_model = AdaBoostClassifier(n_estimators=100,random_state=1) ADB_model.fit(X_train,Y_train)

## Performance Matrix on train data set y_train_predict = ADB_model.predict(X_train) model_score = ADB_model.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) print(metrics.classification_report(Y_train, y_train_predict)) ## Performance Matrix on test data set y_test_predict = ADB_model.predict(X_test) model_score = ADB_model.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) print(metrics.classification_report(Y_test, y_test_predict))

# Training Data Probability Prediction pred_prob_train = ADB_model.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = ADB_model.predict_proba(X_test)

param_grid = { 'learning_rate': [1, 0.1,0.01], 'n_estimators': [50,101, 301] } adbos = AdaBoostClassifier() grid_search = GridSearchCV(estimator = adbos, param_grid = param_grid, cv = 3)

grid_search.fit(X_train, Y_train)

grid_search.best_params_

best_grid = grid_search.best_estimator_

best_grid

ytrain_predict_best = best_grid.predict(X_train) ytest_predict_best = best_grid.predict(X_test)

# Training Data Probability Prediction pred_prob_train = best_grid.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = best_grid.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC adtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % adtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC adtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % adtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

adbos_metrics=classification_report(Y_train,ytrain_predict_best,output_dict=True) dfad=pd.DataFrame(adbos_metrics).transpose() ad_train_precision_con=round(dfad.iloc[0][0],2) ad_train_recall_con=round(dfad.iloc[0][1],2) ad_train_f1_con=round(dfad.iloc[0][2],2) ad_train_acc=round(dfad.loc["accuracy"][0],2) print ('ad_train_precision churn',ad_train_precision_con) print ('ad_train_recall churn',ad_train_recall_con) print ('ad_train_f1 churn',ad_train_f1_con) print ('ad_train_accuracy ',ad_train_acc) ad_metrics1=classification_report(Y_train,ytrain_predict_best,output_dict=True) dfad1=pd.DataFrame(ad_metrics1).transpose() ad_train_precision_lab=round(dfad1.iloc[1][0],2) ad_train_recall_lab=round(dfad1.iloc[1][1],2) ad_train_f1_lab=round(dfad1.iloc[1][2],2) print ('ad_train_precision retain',ad_train_precision_lab) print ('ad_train_recall retain',ad_train_recall_lab) print ('ad_train_f1 retain',ad_train_f1_lab)

adbos_metrics=classification_report(Y_test,ytest_predict_best,output_dict=True) dfad=pd.DataFrame(adbos_metrics).transpose() ad_train_precision_con=round(dfad.iloc[0][0],2) ad_train_recall_con=round(dfad.iloc[0][1],2) ad_train_f1_con=round(dfad.iloc[0][2],2) ad_train_acc=round(dfad.loc["accuracy"][0],2) print ('ad_train_precision churn',ad_train_precision_con) print ('ad_train_recall churn',ad_train_recall_con) print ('ad_train_f1 churn',ad_train_f1_con) print ('ad_train_accuracy ',ad_train_acc) ad_metrics1=classification_report(Y_test,ytest_predict_best,output_dict=True) dfad1=pd.DataFrame(ad_metrics1).transpose() ad_train_precision_lab=round(dfad1.iloc[1][0],2) ad_train_recall_lab=round(dfad1.iloc[1][1],2) ad_train_f1_lab=round(dfad1.iloc[1][2],2) print ('ad_train_precision retain',ad_train_precision_lab) print ('ad_train_recall retain',ad_train_recall_lab) print ('ad_train_f1 retain',ad_train_f1_lab)

from sklearn.ensemble import GradientBoostingClassifier gbcl = GradientBoostingClassifier(random_state=1) gbcl = gbcl.fit(X_train, Y_train)

## Performance Matrix on train data set y_train_predict = gbcl.predict(X_train) model_score = gbcl.score(X_train, Y_train) print(model_score) print(metrics.confusion_matrix(Y_train, y_train_predict)) print(metrics.classification_report(Y_train, y_train_predict)) ## Performance Matrix on test data set y_test_predict = gbcl.predict(X_test) model_score = gbcl.score(X_test, Y_test) print(model_score) print(metrics.confusion_matrix(Y_test, y_test_predict)) print(metrics.classification_report(Y_test, y_test_predict))

# Training Data Probability Prediction pred_prob_train = gbcl.predict_proba(X_train) # Test Data Probability Prediction pred_prob_test = gbcl.predict_proba(X_test)

# AUC and ROC for the training data # calculate AUC gdtr_auc = metrics.roc_auc_score(Y_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % gdtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_train,pred_prob_train[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC gdtst_auc = metrics.roc_auc_score(Y_test,pred_prob_test[:,1]) print('AUC for the Test Data: %.3f' % gdtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y_test,pred_prob_test[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

grbos_metrics=classification_report(Y_train,ytrain_predict_best,output_dict=True) dfgr=pd.DataFrame(grbos_metrics).transpose() gr_train_precision_con=round(dfgr.iloc[0][0],2) gr_train_recall_con=round(dfgr.iloc[0][1],2) gr_train_f1_con=round(dfgr.iloc[0][2],2) gr_train_acc=round(dfgr.loc["accuracy"][0],2) print ('gr_train_precision churn',gr_train_precision_con) print ('gr_train_recall churn',gr_train_recall_con) print ('gr_train_f1 churn',gr_train_f1_con) print ('gr_train_accuracy ',gr_train_acc) gr_metrics1=classification_report(Y_train,ytrain_predict_best,output_dict=True) dfgr1=pd.DataFrame(gr_metrics1).transpose() gr_train_precision_lab=round(dfgr1.iloc[1][0],2) gr_train_recall_lab=round(dfgr1.iloc[1][1],2) gr_train_f1_lab=round(dfgr1.iloc[1][2],2) print ('gr_train_precision retain',gr_train_precision_lab) print ('gr_train_recall retain',gr_train_recall_lab) print ('gr_train_f1 retain',gr_train_f1_lab)

grbos_metrics=classification_report(Y_test,ytest_predict_best,output_dict=True) dfgr=pd.DataFrame(grbos_metrics).transpose() gr_train_precision_con=round(dfgr.iloc[0][0],2) gr_train_recall_con=round(dfgr.iloc[0][1],2) gr_train_f1_con=round(dfgr.iloc[0][2],2) gr_train_acc=round(dfgr.loc["accuracy"][0],2) print ('gr_train_precision churn',gr_train_precision_con) print ('gr_train_recall churn',gr_train_recall_con) print ('gr_train_f1 churn',gr_train_f1_con) print ('gr_train_accuracy ',gr_train_acc) gr_metrics1=classification_report(Y_test,ytest_predict_best,output_dict=True) dfgr1=pd.DataFrame(gr_metrics1).transpose() gr_train_precision_lab=round(dfgr1.iloc[1][0],2) gr_train_recall_lab=round(dfgr1.iloc[1][1],2) gr_train_f1_lab=round(dfgr1.iloc[1][2],2) print ('gr_train_precision retain',gr_train_precision_lab) print ('gr_train_recall retain',gr_train_recall_lab) print ('gr_train_f1 retain',gr_train_f1_lab)

param_grid = { # 'learning_rate': [1, 0.1], 'n_estimators': [50,101], 'tol': [0.01,0.0001], 'max_depth': [3,7,20], 'min_samples_leaf': [1,5,20], 'min_samples_split': [2,3,60], } gbos = GradientBoostingClassifier() grid_search = GridSearchCV(estimator = gbos, param_grid = param_grid, cv = 3)

grid_search.fit(X_train, Y_train)

grid_search.best_params_

best_grid = grid_search.best_estimator_

best_grid

ytrain_predict_best = best_grid.predict(X_train) ytest_predict_best = best_grid.predict(X_test)

from numpy import where import matplotlib.pyplot as plt from collections import Counter from sklearn.datasets import make_classification from imblearn.over_sampling import SMOTE

oversample = SMOTE() X, Y = oversample.fit_resample(X, Y)

counter=Counter(Y) counter

X.head()

Y.head()

Y.value_counts()

# Split X and y into training and test set in 70:30 ratio X1_train, X1_test, Y1_train, Y1_test = train_test_split(X, Y, test_size=0.30 , random_state=1)

from sklearn.neighbors import KNeighborsClassifier KNN_model=KNeighborsClassifier() KNN_model.fit(X1_train,Y1_train)

## Performance Matrix on train data set y1_train_predict = KNN_model.predict(X1_train) model_score = KNN_model.score(X1_train, Y1_train) print(model_score) print(metrics.confusion_matrix(Y1_train, y1_train_predict)) print(metrics.classification_report(Y1_train, y1_train_predict)) ## Performance Matrix on test data set y1_test_predict = KNN_model.predict(X1_test) model_score = KNN_model.score(X1_test, Y1_test) print(model_score) print(metrics.confusion_matrix(Y1_test, y1_test_predict)) print(metrics.classification_report(Y1_test, y1_test_predict))

# Training Data Probability Prediction pred_prob_train = KNN_model.predict_proba(X1_train) # Test Data Probability Prediction pred_prob_test = KNN_model.predict_proba(X1_test)

# AUC and ROC for the training data # calculate AUC ldtr_auc = metrics.roc_auc_score(Y1_train,pred_prob_train[:,1]) print('AUC for the Training Data: %.3f' % ldtr_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y1_train,pred_prob_train[:,1]) #[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label = 'Training Data') # AUC and ROC for the test data # calculate AUC ldrtst_auc = metrics.roc_auc_score(Y1_test,pred_prob_test[:,1]) #[:,1]) print('AUC for the Test Data: %.3f' % ldrtst_auc) # calculate roc curve fpr, tpr, thresholds = metrics.roc_curve(Y1_test,pred_prob_test[:,1]) #[:,1]) plt.plot([0, 1], [0, 1], linestyle='--') # plot the roc curve for the model plt.plot(fpr, tpr, marker='.',label='Test Data') # show the plot plt.legend(loc='best') plt.show()

We will focus on reducing possibility of False negative. So primary criteria for evaluation will be Recall first and then Accuracy Using single metrics is not the only way of comparing the predictive performance of classification models. The ROC curve (Receiver Operating Characteristic curve) is a graph showing the performance of a classifier at different classification thresholds. It plots the true positive rate (another name for recall) against the false positive rate.

After oversampling the data, the scores of all the models have increased a lot. KNN, Random Forest and Bagging have best overall scores from all the models. But KNN has overfit on Recall on Train data as well as Test data Bagging has overfit on Train data on Recall as well as Accuracy Random Forest shows better scores on Recall as well as Accuracy for both Train and Test set, so we can choose best model as Random Forest as it has best overall scores.

Tenure, Account_segment, Days_since_cc_connect, Cashback, rev_growth_yoy, Login_device, Payment method, service score are negative predictors of churn. These are the attributes that prevent customer from churning. Tenure variable does not seem to have significant effect on churn rate, average Tenure is 11 years. So, it is obvious that a customer who has stayed with DTH service for more year’s greater than 11 years, is less likely to churn than customer who has a less duration service. Days since cc connect represents the least no of days after which customer has contacted customer care, average days being 4. This is also negative predictor for churn because if customer has not called customer care in a smaller number of days means he is satisfied with the service and is less likely to churn from DTH provider. Customer receiving more cashback, average 196 is less likely to churn. Customers belonging to higher Account segment, ‘Regukar plus’ and ‘Super plus’ are spending more and are more involved with services provided by DTH company, and hence less likely to churn. Attributes that are positive predictors for churn: Complaint_ly, CC_Agent_score, Rev_per_month, Marital_status, city_tier, account user count, coupon used for payment, cc contacted ly are the variables that are giving rise to customer churn. Complain_ly: Customers that have contacted the customer care highest number of times are more likely to churn CC_Agent_score: Customers that have given low rating to the Agents are most likely to churn since they might be dissatisfied with the service of the agent, thus resulting to churn from the dth service. Marital_status: There are a greater number of users that are married around 5000 and up followed by Single users around 2500. The number of Single users that have churned is highest around 1000. So, we can say that Marital_status is affecting churn rate. City_Tier: Users from city tier 1 are highest around 6000 and around 1000 have churned from tier 1 city. City tier does not seem to have greater effect on churn rate. Account_user_count: Highest number of users tagged to an account is 4, and they are around 3500 and above out of which 550 customers have churned.

.css-15w88e5{color:var(--chakra-colors-fg-neutral-primary);font-weight:inherit;letter-spacing:-0.09px;}Telecom- Customer Churn Prediction

Telecom- Customer Churn Prediction