Icon

8.1 Practical Machine Learning with R Churn Analysis (unbalanced)

This workflow repeats the Churn Analysis from the textbook Practical Machine Learning with R (https:\\ai.lange-analytics.com). We are using unbalanced data for a churn analysis. "Unbalanced" means that one class (customers who did not churned) contains significantly more observations than the other class (customers who churned). Check the Value Count Node to see how imbalanced the data are. Consequently, the model focuses too much on the majority class. Check the Sensitivity and Specificity in the Scorer to see the problem.

URL: Practical Machine Learning with R (Chapter 8) https://ai.lange-analytics.com
URL: Open the related R analysis in RStudio https://ai.lange-analytics.com/exc/?file=09-LogRegrExerc100.Rmd
URL: Another workflow in this space uses SMOTE to balance the data https://hub.knime.com/-/spaces/-/~N65TXOu1X9_n0It6/current-state/
URL: Contact the author https://ai.lange-analytics.com/EmailForw.html

Nodes

Extensions

Links