Icon

2_​linear regression_​solution

train

Model requirements

Data Collection

Data Cleaning

Data Labeling

Feature Engineering

Model Training

Model Evaluation

Model Deployment

Model Monitoring

Model requirements

Medical Cost Personal Datasets

Problem Identification

Objectives & Resources

Data Collection

Load Data

Raw Data Extraction (CSV Reader)

Understand Data (Statistics)

Data Cleaning

Filter Columns that are important (Column Filter)

Add missing values (Missing Value)

Data Labeling

Calculate Domains (Domain Calculator)

Feature Engineering

One Hot Enconding (One to Many)

Cluster Data into train/test) (80/20)

Add Baseline (Constant Value Column Appender)

pred_baseline - No

Score Baseline (Scorer)

Model Training

Decistion Tree Learner

Model Evaluation

Decistion Tree Learner

Model Deployment

Decistion Tree Learner

Feature Engineering

Add Baseline (

GroupBy

Constant Value Column Appender - join_key = 1)

Joiner

Column Renamer

Numeric Scorer

Statistics
Column Filter
CSV Reader
Domain Calculator
One to Many
train - 80%test - 20%
Table Partitioner
Mean Charge
GroupBy
Column Filter
Constant Value Column Appender
Joiner
Linear Regression Learner
Constant Value Column Appender
Regression Predictor
Column Renamer
Numeric Scorer
Scatter Plot
Numeric Scorer

Nodes

Extensions

Links