Icon

3_​logistic regression_​empty

train

Model requirements

Data Collection

Data Cleaning

Data Labeling

Feature Engineering

Model Training

Model Evaluation

Model Deployment

Model Monitoring

Model requirements

Titanic - Machine Learning from Disaster | Kaggle

Problem Identification

Objectives & Resources

Data Collection

Load Data

Raw Data Extraction (CSV Reader)

Understand Data (Statistics)

Data Cleaning

Filter Columns that are important (Column Filter)

Add missing values (Missing Value)

Data Labeling

Prepare target Variable (Rule Engine)

$Survived$ = 1 => "Yes"

$Survived$ =0 => "No"

Calculate Domains (Domain Calculator)

Feature Engineering

One Hot Enconding (One to Many)

Cluster Data into train/test) (80/20)

Add Baseline (Constant Value Column Appender)

pred_baseline - No

Score Baseline (Scorer)

Model Training

Decistion Tree Learner

Model Evaluation

Decistion Tree Learner

Model Deployment

Decistion Tree Learner

Scorer
Train Split
CSV Reader
Table Partitioner
Constant Value Column Appender

Nodes

Extensions

Links