Icon

3_​logistic regression_​solution

train

Model requirements

Data Collection

Data Cleaning

Data Labeling

Feature Engineering

Model Training

Model Evaluation

Model Deployment

Model Monitoring

Model requirements

Titanic - Machine Learning from Disaster | Kaggle

Problem Identification

Objectives & Resources

Data Collection

Load Data

Raw Data Extraction (CSV Reader)

Understand Data (Statistics)

Data Cleaning

Filter Columns that are important (Column Filter)

Add missing values (Missing Value)

Data Labeling

Prepare target Variable (Rule Engine)

$Survived$ = 1 => "Yes"

$Survived$ =0 => "No"

Calculate Domains (Domain Calculator)

Feature Engineering

One Hot Enconding (One to Many)

Cluster Data into train/test) (80/20)

Add Baseline (Constant Value Column Appender)

pred_baseline - No

Score Baseline (Scorer)

Model Training

Decistion Tree Learner

Model Evaluation

Decistion Tree Learner

Model Deployment

Decistion Tree Learner

Train Split
CSV Reader
Column Filter
Missing Value
Rule Engine
Domain Calculator
Logistic Regression Predictor
One to Many
Scorer
Table Partitioner
Constant Value Column Appender
Column Filter
Logistic Regression Learner
Scorer

Nodes

Extensions

Links