Icon

cross validation demo

Advanced Data Mining - Solution

Solution to the exercise 10 for KNIME User Training
- Training a Random Forest model to predict a nominal target column
- Evaluating the performance of a classification model
- Optimizing parameters of the Random Forest model
- Performing the classification multiple times in a cross validation loop

Part I: Random Forest Model - Read CurrentDetailData.table data - Partition the data 70/30 using stratified sampling on the "Target" column - Train and apply a Random Forest model to predict the "Target" column - Use a tree depth of 5 and 160 models Part II: Cross Validation - Create a 10-fold cross validation for your model - Take a look at the error rates produced by the different iterations. Does the model seem stable? X-Partitioner X-Aggregator Partitioning Random ForestLearner Random ForestPredictor Random ForestLearner Random ForestPredictor Scorer Table Reader Scorer Part I: Random Forest Model - Read CurrentDetailData.table data - Partition the data 70/30 using stratified sampling on the "Target" column - Train and apply a Random Forest model to predict the "Target" column - Use a tree depth of 5 and 160 models Part II: Cross Validation - Create a 10-fold cross validation for your model - Take a look at the error rates produced by the different iterations. Does the model seem stable? X-Partitioner X-Aggregator Partitioning Random ForestLearner Random ForestPredictor Random ForestLearner Random ForestPredictor Scorer Table Reader Scorer

Nodes

Extensions

Links