Icon

EAISI - Ames housing example flow

Descriptive analysis

In order to understand your data, you first make some basic descriptives and plots. You can repeat this step if you have made new variables.

Feature engineering

If you have analyzed your data, you can construct new features. These are a few examples. After feature engineering, it is wise to again check some basic statistics (such as the correlation matrix).

Data cleaning

If you have performed your descriptive analysis, you want to remove redundant rows based on your scope (if necessary) and have a look at your missing values.

Modelling

If you have your dataset ready, you can start analyzing. This is the basic flow to test a linear regression model, but you can easily extend this to (1) different models and (2) classification problems. It is logical to go back in your flow based on the test results you have. You can add / remove variables and run your flow again.

Data input

Loading in your data from your csv files.

housing
CSV Reader
Correlation matrix
Linear Correlation
Missing valueanalysis
Missing Value
Save your model
Model Writer
isNew
Math Formula
Bar chart
Bar Chart
reModeled
Math Formula
totalSqFeet
Math Formula
houseAge
Math Formula
Histogram
Histogram
Descriptives
Statistics
Apply your train / test split
Table Partitioner
Filter based on scope
Row Filter
Model testing
Regression Predictor
Select relevant variables
Column Filter
Evaluate the performance
Numeric Scorer
Correlation matrix
Linear Correlation
Scatter plot
Scatter Plot
Model building
Linear Regression Learner

Nodes

Extensions

Links