Icon

01_​Training_​a_​Churn_​Predictor_​LogReg

Four basic steps in Data Preparation before Training a Churn Predictor
Read Data Contract Data Calls Data Training SMOTE Logistic Regression Learner Save all Models Churn Prediction - Training - Logistic Regression This workflow is an example of how to train a basic machine learning model for a churn prediction task, using a Logistic Regression algorithm. Notice the four basic data prep steps: missing value imputation, normalization, type conversion, and SMOTE. Also notice the usage of (Apply) nodes to prevent data leakage. Testing all transformations (Apply) Logistic regression Predictor Scorer Training Data Preparation Partitioning 80%-20% Data Explorer for no of missing values Number to String (Churn) Data Preparation Category To Number (State) Missing Value (Median, Median, Unknown) Normalizer (z-score) Testing 80%vs. 20%churn -> StringCalls datacontract dataperformancescoringpredict churnremove rowswith misisng valuesz-scoreoversamplingminority classStatechurn -> StringPartitioning Joiner Number To String Excel Reader CSV Reader Scorer LogisticRegression Learner Logistic RegressionPredictor Missing Value Missing Value(Apply) Normalizer Normalizer (Apply) Model Writing SMOTE Category To Number Category ToNumber (Apply) Number To String Data Explorer Read Data Contract Data Calls Data Training SMOTE Logistic Regression Learner Save all Models Churn Prediction - Training - Logistic Regression This workflow is an example of how to train a basic machine learning model for a churn prediction task, using a Logistic Regression algorithm. Notice the four basic data prep steps: missing value imputation, normalization, type conversion, and SMOTE. Also notice the usage of (Apply) nodes to prevent data leakage. Testing all transformations (Apply) Logistic regression Predictor Scorer Training Data Preparation Partitioning 80%-20% Data Explorer for no of missing values Number to String (Churn) Data Preparation Category To Number (State) Missing Value (Median, Median, Unknown) Normalizer (z-score) Testing 80%vs. 20%churn -> StringCalls datacontract dataperformancescoringpredict churnremove rowswith misisng valuesz-scoreoversamplingminority classStatechurn -> StringPartitioning Joiner Number To String Excel Reader CSV Reader Scorer LogisticRegression Learner Logistic RegressionPredictor Missing Value Missing Value(Apply) Normalizer Normalizer (Apply) Model Writing SMOTE Category To Number Category ToNumber (Apply) Number To String Data Explorer

Nodes

Extensions

Links