Icon

02_​Regression_​Tree_​solution

Regression Tree - Solution

Introduction to Machine Learning Algorithms course - Session 2
Solution to exercise 2
- Partition data into training and test set
- Train a regression tree model
- Apply the trained model to the test set
- Handle missing values
- Evaluate the model performance with the Numeric Scorer node



Regresion: price prediction Exercise: Regression TreeIn this exercise we will predict the price of an house in Ames (Iowa, USA) given a number of features: size, neighborhood, heating type, ..1) Add Partitioning node to File Reader output - Top port should have 70 % of the rows - Draw randomly such rows2) Add Simple Regression Tree Learner to top output port of Partitioning node - Select price column to be learned - Execute the node and open its decision tree view. Which column is used in the beginning of the tree?3) Add Simple Regression Tree Predictor - Predict test set (remaining 30% rows) by simply connecting the remaining unconnected output ports4) Remove Rows with missing values5) Add Numeric Scorer to Regression Predictor Output - Reference Column: the column you learned - Predicted Column: the new column created by the predictor node housing dataset Partitioning Numeric Scorer Simple RegressionTree Learner Simple RegressionTree Predictor Missing Value CSV Reader Regresion: price prediction Exercise: Regression TreeIn this exercise we will predict the price of an house in Ames (Iowa, USA) given a number of features: size, neighborhood, heating type, ..1) Add Partitioning node to File Reader output - Top port should have 70 % of the rows - Draw randomly such rows2) Add Simple Regression Tree Learner to top output port of Partitioning node - Select price column to be learned - Execute the node and open its decision tree view. Which column is used in the beginning of the tree?3) Add Simple Regression Tree Predictor - Predict test set (remaining 30% rows) by simply connecting the remaining unconnected output ports4) Remove Rows with missing values5) Add Numeric Scorer to Regression Predictor Output - Reference Column: the column you learned - Predicted Column: the new column created by the predictor node housing dataset Partitioning Numeric Scorer Simple RegressionTree Learner Simple RegressionTree Predictor Missing Value CSV Reader

Nodes

Extensions

Links