Icon

RegressionFinal

Linear Regression Learner

Has an R^2 of 0.603

Removed features such as total bedrooms/rooms, population. Left out only the relevant stuff and that resulted in an increased accuracy

Random Forest Learner (Regression)

Has an R^2 of 0.689

Used features such as the ocean proximity and all things related to the location of the house, except latitude and longitude since they aren't of much help. Features such as bedrooms, rooms and were not considered since they decreased the accuracy

Gradient Boosted Trees Predictor (Regression)

Has an R^2 of 0.705

Used every feature we had and was used as the best machine learning regressor based on its accuracy. There are several parameters that were chosen to be optimized, such as the number of models, from 100 to 400. Increasing the number of models, implies decreasing the learning rate a little, thus the learning rate became 0.05. The depth of the tree was also increased to 6. This optimization improved accuracy, from 0.705 to 0.735.

The model trained directly with the whole dataset has an even better accuracy of 0.842.

CSV Reader
Used in order to split Ocean Proximity into numeric values
One to Many
Math Formula
Math Formula
Formulas used for determining population, rooms, bedrooms per household
Math Formula
Scatter Plot
Gradient Boosted Trees Learner (Regression)
Gradient Boosted Trees Predictor (Regression)
Numeric Scorer
Gradient Boosted Trees Predictor (Regression)
Column Filter
Numeric Scorer
Numeric Scorer
Scatter Plot
Random Forest Predictor (Regression)
Random Forest Learner (Regression)
X-Aggregator
Color Manager
X-Partitioner
X-Partitioner
Missing Value
Regression Predictor
X-Partitioner
X-Aggregator
Linear Regression Learner
Scatter Plot
Gradient Boosted Trees Learner (Regression)
Numeric Scorer
X-Aggregator

Nodes

Extensions

Links