Icon

DSC723 - Flight Delays Group Project

Data Understanding & EDA

Target Construction & Leakage Removal

Data Cleaning

Encoding

Data Splitting Method 1 — Holdout (70% Train / 30% Test)

Feature Selection Method A — Variance Threshold

Model 1 — Logistic Regression

Model 2 — Decision Tree

Model 3 — Random Forest

Model 4 — Naive Bayes

Splitting Method 2 — 10-Fold Stratified Cross-Validation for Model 2 — Decision Tree

Splitting Method 2 — 10-Fold Stratified Cross-Validation for Model 5 — XGBoost

Splitting Method 2 — 10-Fold Stratified Cross-Validation for Model 2 — Decision Tree

Model 5 — XGBoost

Data Splitting Method 1 — Holdout (70% Train / 30% Test)

Model 2 — Decision Tree

Data Cleaning

Encoding

Splitting Method 2 — 10-Fold Stratified Cross-Validation for Model 2 — Decision Tree

Data Splitting Method 1 — Holdout (70% Train / 30% Test)

Model 2 — Decision Tree

Feature Selection Method A — Variance Threshold

Data Splitting Method 1 — Holdout (70% Train / 30% Test)

Splitting Method 2 — 10-Fold Stratified Cross-Validation for Model 2 — Decision Tree

Model 2 — Decision Tree

Target Construction

Feature Selection Method B — Chi-Square + Pearson Redundancy

Feature Selection Method B — Chi-Square + Pearson Redundancy

Naive Bayes Learner
Naive Bayes Predictor
X-Aggregator
Scorer
Scorer
X-Partitioner
XGBoost Predictor
Decision Tree Learner
Decision Tree Predictor
XGBoost Tree Ensemble Learner
Scorer
Scorer
X-Partitioner
X-Aggregator
Decision Tree Predictor
XGBoost Predictor
XGBoost Tree Ensemble Learner
CSV Reader
Column Filter
Linear Correlation
Column Renamer
Low Variance Filter
Joiner
Table Partitioner
Correlation Filter
Decision Tree Predictor
Decision Tree Predictor
X-Aggregator
X-Partitioner
Decision Tree Learner
Normalizer
Scorer
Normalizer
Table Partitioner
Normalizer (Apply)
Normalizer (Apply)
Scorer
Decision Tree Learner
Logistic Regression Predictor
Logistic Regression Learner
Statistics
Random Forest Predictor
Decision Tree Learner
Random Forest Learner
Scorer
Decision Tree Predictor
Linear Correlation
Scorer
Low Variance Filter
Scorer
Correlation Filter
Scorer
Decision Tree Learner
Decision Tree Predictor
Decision Tree Predictor
X-Partitioner
Group by departure hour
GroupBy
Bar Chart
Bar Chart
Scorer
Table Partitioner
Bar Chart
X-Aggregator
Histogram
Normalizer (Apply)
Rule Engine
Decision Tree Learner
Column Filter
Row Filter
Missing Value
Normalizer
Row Filter
Column Filter
Rule Engine
Rule Engine
Numeric Outliers
Group by carrier
GroupBy
Duplicate Row Filter
Joiner
Group by Dest
GroupBy
Joiner
Group by Origin
GroupBy
Joiner
Scorer
Decision Tree Learner
X-Partitioner
X-Aggregator
Normalizer (Apply)
Scorer
Table Partitioner
Normalizer
Decision Tree Learner
Decision Tree Predictor
Missing Value
Group by Origin
GroupBy
Joiner
Duplicate Row Filter
Numeric Outliers
Column Renamer
Column Filter
Group by Dest
GroupBy
Joiner
Group by carrier
GroupBy
Bar Chart
Group by month
GroupBy
Data Explorer
Group by carrier
GroupBy

Nodes

Extensions

Links