Icon

Class Project 2 - Heart Failure Work flow - GBoost and LogReg

Data Explorer: Contains all columns for statistical view
CSV Reader
Checks for missing values and excludes them.
Missing Value
Scatter Plots: Shows correlations and patterns
Scatter Plots
Gradient Boosted Trees Learner
One hot-encoding
One to Many
Gradient Boosted Trees Predictor
GBoost Evaluation Metrics
Scorer
Logistic Regression Evaluation Metrics
Scorer
Filter out inconclusive columns and their values
Column Filter
Logistic Regression Learner
Split data into train (80%) and test (20%)
Table Partitioner
Logistic Regression Predictor
Box Plots: Shows whether numeric variables differ between Heart Failure vs non Heart Failure groups
Box Plots
Bar charts: Shows class imbalance and category distribution
Bar Charts
Correlation: Shows which numeric variables are correlated, and which is important for modeling.
Linear Correlation
Histograms: Shows the distribution, skewness, outliers of attributes in the dataset.
Histograms
Number to String (PMML)

Nodes

Extensions

Links