Icon

BME499_​PROJECT_​KNIMEFINAL_​FXCHEN002_​CHENFUXUAN.knar

Future Label Engineering (2h, 4h, 6, 8h) + window splits

Hyperparameter tuning for XGboost model

Data Partitioning at Patient Level (70-15-15)

Initial cleaning and imputation for static features

To sort and imputation of Vital Signs in 'Per-patient loop' structure

Imputation of Lab variables in 'Per-patient loop' structure

ADDED XXX_MISSING FOR 26 LAB VARIABLES EACH

Feature Engineering

Created 3 hours lag (History) for all vital and lab markers

Lag1 = t - 1

Lag2 = t - 2

Lag3 = t - 3

Created 3 delta (Sudden changes) for all vital and lab markers

Delta1 = t - (t-1)

Delta2 = t - (t-2)

Delta3 = t - (t-3)

Created Rolling averages (Trends) for all vital and lab markers with mean values and resolve missing values for incomplete windows.

Window type: backward

Window length: 3

Time Encoding Features

Sanity Check Temporal Features (Lag and Delta correctness for one patient)

Groupby patient with ICULOS max.

Prefer a patient with ICULOS >50 to check temporal features

Lag1 = t - 1
Lag2 = t - 2
Lag3 = t - 3
Delta1 = t - (t-1)
Delta2 = t - (t-2)
Delta3 = t - (t-3)

Mean/MA = [ t + (t-1), + (t-2) / 3 ]
Since Physionet 2019 dataset, Sepsislabel as 1 indicated that sepsis will occur within the next 6 hours.
Created 2h, 4h, 6h, and 8h windows to challenge if the model is able to predict sepsis earlier than 6 hrs before clinical recognition.
SepsisLabel_2h, Sepsis will occur within the next 8 hours
SepsisLabel_4h, Sepsis will occur within the next 10 hours
SepsisLabel_6h ,Sepsis will occur within the next 12 hours
SepsisLabel_8h, Sepsis will occur within the next 14 hours
Remove patient data that has less than 9 rows of data.

Reason: The node crashes if the table has fewer rows than the window size. Since sepsislabel_8h is window length = 9

It need at least 9 rows of per patient data for the node to function without errors.

328 patients has less than 9 rows of data will be excluded.

total patient strength will be 40008 instead of 40336

Sanity Check Temporal Features (Future label)

Training set (70%)
Validation set (15%)
Test set (15%)

Sepsislabel window

Sepsislabel_2h window

Sepsislabel_8h window

Sepsislabel_6h window

Sepsislabel_4h window

Class imbalance of Sepsislabel

Class imbalance of Sepsislabel_2h

Class imbalance of Sepsislabel_4h

Class imbalance of Sepsislabel_6h

Class imbalance of Sepsislabel_8h

Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Nested Hyperparameter tuning for Logistic Regression model

Evaluation for Logistic Regression Model (Test Set)

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for Logistic Regression Model (Test Set)

Hyperparameter tuning for Logistic Regression model

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for Logistic Regression Model (Test Set)

Hyperparameter tuning for Logistic Regression model

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for Logistic Regression Model (Test Set)

Hyperparameter tuning for Logistic Regression model

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for Logistic Regression Model (Test Set)

Hyperparameter tuning for Logistic Regression model

Evaluation for XGBoost Model (Test Set)

Evaluation for XGBoost Model (Test Set)

MICU

SICU

Sepsislabel_8h window

Class imbalance of Sepsislabel_8h

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Data Partitioning at Patient Level (70-15-15)

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Class imbalance of Sepsislabel_6h

Sepsislabel_4h window

Class imbalance of Sepsislabel_4h

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Sepsislabel_2h window

Class imbalance of Sepsislabel_2h

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Evaluation for XGBoost Model (Test Set)

Training set (70%)
Validation set (15%)
Test set (15%)

Class imbalance of Sepsislabel

Sepsislabel_8h window

Class imbalance of Sepsislabel_8h

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Data Partitioning at Patient Level (70-15-15)

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Hyperparameter tuning for XGboost model

Class imbalance of Sepsislabel_6h

Sepsislabel_4h window

Class imbalance of Sepsislabel_4h

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Sepsislabel_2h window

Class imbalance of Sepsislabel_2h

Hyperparameter tuning for XGboost model

Training set (70%)
Validation set (15%)
Test set (15%)
Remove patient_ID to prevent the models to memorize it during model training which cause data leakage

Evaluation for XGBoost Model (Test Set)

Evaluation for XGBoost Model (Test Set)

Training set (70%)
Validation set (15%)
Test set (15%)

Class imbalance of Sepsislabel

Column Renamer
Missing Value
Concatenate
ROC Curve (legacy)
Bar Chart
GroupBy
ROC Curve (legacy)
Table Column to Variable
Scorer (JavaScript)
log
Parameter Optimization Loop Start
Logistic Regression Predictor
Parameter Optimization Loop End
Heatmap (JavaScript) (legacy)
Reference Row Filter
Table Column to Variable
Scorer (JavaScript)
Normalizer
Missing Value
Number to String
Column Resorter
Logistic Regression Predictor
Rule Engine
Rule Engine
ROC Curve (legacy)
Excel Reader
XGBoost Predictor
Sorter
Sorter
Scatter Plot
Row Filter
Line Plot
Bar Chart
CSV Reader
Row Filter
Scatter Plot
GroupBy
Heatmap
Sorter
Unpivot
Row Filter
Sorter
Heatmap
Reference Row Filter
Scorer (JavaScript)
Missing Value
Joiner
ROC Curve (legacy)
Box Plot
XGBoost Predictor
Box Plot
ROC Curve (legacy)
Unpivot
XGBoost Tree Ensemble Learner
Sorter
String to Number
Reference Row Filter
XGBoost Predictor
Row Filter
GroupBy
String to Number
Universal patient list testing dataset (15%)
CSV Reader
String to Number
Missing Value (Apply)
Joiner
Number to String
Number to String
Missing Value (Apply)
Number to String
Reference Row Filter
XGboost
Parameter Optimization Loop Start
Missing Value (Apply)
Column Filter
Number to String
Universal patient list validation dataset (15%)
CSV Reader
Column Filter
Row Filter
Row Filter
Row Filter
Universal patient list training dataset (70%)
CSV Reader
Bar Chart
Reference Row Filter
Table Row to Variable
Column Splitter
Column Filter
Number to String
Column Filter
Column Splitter
Champion
XGBoost Tree Ensemble Learner
Column Filter
Column Splitter
FiO2
Scatter Plot
Temp
Scatter Plot
Column Splitter
Stratified sampling to ensure the ratio of the class imbalance is preserved in both partition70% Training set (first)30% validation and test set (second)
Table Partitioner
Normalizer (Apply)
Convert max*sepsislabel column to string type (To proceed stratified sampling)
Number to String
Normalizer (Apply)
XGBoost Tree Ensemble Learner
Groupby patient_ID
GroupBy
ROC Curve (legacy)
Stratified sampling15% validation set (first)15% test set (second)
Table Partitioner
Missing Value (Apply)
Shuffle
XGBoost Predictor
ROC Curve (legacy)
Universal patient list training dataset (70%)
CSV Writer
Missing Value
Rule Engine
XGboost
Parameter Optimization Loop Start
Universal patient list testing dataset (15%)
CSV Writer
Bar Chart
Universal patient list validation dataset (15%)
CSV Writer
Table Column to Variable
Reference Row Filter
GroupBy
log
Parameter Optimization Loop Start
Reference Row Filter
Logistic Regression Predictor
ROC Curve (legacy)
Universal patient list training dataset (70%)
CSV Reader
Logistic Regression Learner
Reference Row Filter
Scorer (JavaScript)
Universal patient list testing dataset (15%)
CSV Reader
Table Row to Variable
Universal patient list validation dataset (15%)
CSV Reader
Universal patient list testing dataset (15%)
CSV Reader
GroupBy
Rule Engine
Champion
Logistic Regression Learner
Rule Engine
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Number to String
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Bar Chart
Table Column to Variable
Table Column to Variable
Reference Row Filter
Reference Row Filter
XGBoost Predictor
Missing Value (Apply)
Parameter Optimization Loop End
Rule Engine
XGBoost Predictor
GroupBy
Normalizer
GroupBy
Number to String
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Universal patient list validation dataset (15%)
CSV Reader
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Universal patient list training dataset (70%)
CSV Reader
Rule Engine
Row Filter
Column Splitter
Reference Row Filter
GroupBy
ROC Curve (legacy)
Column Splitter
Column Splitter
Row Filter
Sorter
Sorter
Number to String
Bar Chart
Table Row to Variable
Column Splitter
Champion
XGBoost Tree Ensemble Learner
Column Splitter
Logistic Regression Predictor
Column Splitter
Scorer (JavaScript)
ROC Curve (legacy)
Number to String
Scorer (JavaScript)
Equal Size Sampling
Number to String
Number to String
Missing Value (Apply)
Derive MAP with formulaAppend col: MAP_CALC
Math Formula
Champion
XGBoost Tree Ensemble Learner
Rule Engine
Row Filter
Column Filter
CombineMAP_CLAC with MAPAppend Col: MAP_Clean
Rule Engine
String to Number
ROC Curve (legacy)
remove MAP and MAP_CALC
Column Filter
String to Number
GroupBy
Normalizer (Apply)
Universal patient list testing dataset (15%)
CSV Reader
String to Number
Universal patient list validation dataset (15%)
CSV Reader
Rule Engine
ROC Curve (legacy)
Rule Engine
Normalizer (Apply)
Rename MAP_CALC to MAP
Column Renamer
Rule Engine
XGBoost Tree Ensemble Learner
Rule Engine
XGBoost Predictor
Rule Engine
Rule Engine
Rule Engine
Rule Engine
ROC Curve (legacy)
Rule Engine
Scorer (JavaScript)
Rule Engine
XGBoost Predictor
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Bar Chart
Table Column to Variable
Reference Row Filter
XGBoost Predictor
Parameter Optimization Loop End
Column Renamer
Constant Value Column Appender
Number to String
Scorer (JavaScript)
CSV Reader
Table Column to Variable
Column Splitter
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
XGBoost Predictor
Row Filter
XGBoost Predictor
Table Row to Variable
Parameter Optimization Loop End
Missing Value (Apply)
ROC Curve (legacy)
Sorter
ICU monitors collect continuously, Missing value for SBP, DBP, HR, O2Sat,Temp, Resp, EtCO2,MAP represent recording gaps, not true absence. Replaced with previous value (Forward fill)
Missing Value
Group Loop Start
Sorter
Impute SBP/DBP with backward fill
Missing Value
After forward and backward fill, DBP still has 54 missing value
Statistics View
Number to String
Added SBP_MISSING
Rule Engine
Added DBP_MISSING
Rule Engine
ROC Curve (legacy)
Missing Value (Apply)
Number to String
ROC Curve (legacy)
Row Filter
Column Splitter
GroupBy
Scorer (JavaScript)
XGboost
Parameter Optimization Loop Start
XGBoost Predictor
Row Filter
ROC Curve (legacy)
Reference Row Filter
XGBoost Predictor
Stratified sampling to ensure the ratio of the class imbalance is preserved in both partition70% Training set (first)30% validation and test set (second)
Table Partitioner
Missing Value (Apply)
Champion
XGBoost Tree Ensemble Learner
Bar Chart
Rule Engine
Convert max*sepsislabel column to string type (To proceed stratified sampling)
Number to String
Column Splitter
Sorter
Missing Value
GroupBy
Row Filter
Column Splitter
Missing Value (Apply)
Champion
XGBoost Tree Ensemble Learner
Column Filter
XGBoost Tree Ensemble Learner
Universal patient list validation dataset (15%)
CSV Reader
Reference Row Filter
Reference Row Filter
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Sorter
Column Splitter
Sorter
Table Row to Variable
Column Filter
Parameter Optimization Loop End
Table Column to Variable
GroupBy
Number to String
Row Filter
CSV Reader
Column Splitter
Number to String
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Champion
XGBoost Tree Ensemble Learner
Universal patient list training dataset (70%)
CSV Reader
Rule Engine
Column Filter
Universal patient list testing dataset (15%)
CSV Reader
ROC Curve (legacy)
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
GroupBy
Rule Engine
Universal patient list testing dataset (15%)
CSV Reader
CSV Reader
Universal patient list validation dataset (15%)
CSV Reader
Reference Row Filter
Column Splitter
Reference Row Filter
Row Filter
Reference Row Filter
Table Row to Variable
Column Splitter
Column Filter
Universal patient list validation dataset (15%)
CSV Reader
Universal patient list testing dataset (15%)
CSV Reader
XGboost
Parameter Optimization Loop Start
Rule Engine
Column Splitter
Row Filter
Column Splitter
Row Filter
Universal patient list training dataset (70%)
CSV Reader
Universal patient list training dataset (70%)
CSV Reader
Missing Value (Apply)
XGboost
Parameter Optimization Loop Start
Number to String
High Class imbalance for Sepsislabel Sepsis: 2686 (6.71%) Non-Sepsis: 37322 (93.29%)
Pie Chart
Reference Row Filter
Universal patient list validation dataset (15%)
CSV Writer
Universal patient list training dataset (70%)
CSV Writer
XGBoost Predictor
Universal patient list testing dataset (15%)
CSV Writer
Reference Row Filter
Missing Value
GroupBy
XGboost
Parameter Optimization Loop Start
Shuffle
ROC Curve (legacy)
Parameter Optimization Loop End
Rule Engine
Missing Value (Apply)
Rule Engine
Rule Engine
CSV Reader
XGBoost Predictor
XGBoost Predictor
Table Row to Variable
Missing Value (Apply)
ROC Curve (legacy)
Groupby patient_ID
GroupBy
Sorter
XGBoost Tree Ensemble Learner
Rule Engine
Stratified sampling15% validation set (first)15% test set (second)
Table Partitioner
Row Filter
Sorter
ROC Curve (legacy)
Convert max*sepsislabel column to string type (To proceed stratified sampling)
Number to String
Stratified sampling to ensure the ratio of the class imbalance is preserved in both partition70% Training set (first)30% validation and test set (second)
Table Partitioner
Reference Row Filter
Bar Chart
Scorer (JavaScript)
Missing Value (Apply)
Champion
XGBoost Tree Ensemble Learner
Column Splitter
Row Filter
Reference Row Filter
Table Column to Variable
Sorter
Column Filter
Parameter Optimization Loop End
Column Filter
Missing Value
Column Filter
Missing Value (Apply)
Row Filter
XGboost
Parameter Optimization Loop Start
XGboost
Parameter Optimization Loop Start
Row Filter
Rule Engine
Universal patient list training dataset (70%)
CSV Reader
Column Filter
Column Splitter
Row Filter
Table Row to Variable
Logistic Regression Predictor
Scorer (JavaScript)
Unpivot
Column Filter
Sorter
GroupBy
Bar Chart
Row Filter
Column Filter
Bar Chart
GroupBy
Row Filter
Unpivot
Sorter
Joiner
ROC Curve (legacy)
Column Renamer
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Column Renamer
Rule Engine
Column Renamer
Column Renamer
Column Renamer
XGBoost Predictor
XGBoost Tree Ensemble Learner
Normalizer (Apply)
CSV Reader
Universal patient list validation dataset (15%)
CSV Reader
XGBoost Predictor
Missing Value (Apply)
ROC Curve (legacy)
Number to String
String to Number
ROC Curve (legacy)
String to Number
Row Filter
ROC Curve (legacy)
String to Number
XGBoost Predictor
Table Column to Variable
ROC Curve (legacy)
ROC Curve (legacy)
Table Row to Variable
Universal patient list training dataset (70%)
CSV Reader
Reference Row Filter
Reference Row Filter
Number to String
XGboost
Parameter Optimization Loop Start
Reference Row Filter
Missing Value
Missing Value (Apply)
Row Filter
Column Splitter
Table Column to Variable
Sorter
Universal patient list validation dataset (15%)
CSV Reader
Sorter
Universal patient list testing dataset (15%)
CSV Reader
Missing Value
Bar Chart
Missing Value (Apply)
Bar Chart
Number to String
ROC Curve (legacy)
Number to String
log
Parameter Optimization Loop Start
Number to String
Scorer (JavaScript)
Champion
XGBoost Tree Ensemble Learner
Logistic Regression Predictor
Missing Value (Apply)
Column Splitter
XGBoost Predictor
Logistic Regression Learner
Scorer (JavaScript)
Table Row to Variable
Universal patient list testing dataset (15%)
CSV Reader
Bar Chart
Table Row to Variable
XGboost
Parameter Optimization Loop Start
Rule Engine
ROC Curve (legacy)
Champion
XGBoost Tree Ensemble Learner
Universal patient list training dataset (70%)
CSV Reader
Column Splitter
ROC Curve (legacy)
Column Splitter
Row Filter
Table Column to Variable
Champion
Logistic Regression Learner
Column Splitter
Parameter Optimization Loop End
Universal patient list validation dataset (15%)
CSV Reader
Equal Size Sampling
Reference Row Filter
Table Column to Variable
Missing Value (Apply)
Parameter Optimization Loop End
Number to String
Number to String
Table Column to Variable
Row Sampler
Reference Row Filter
Universal patient list training dataset (70%)
CSV Reader
Reference Row Filter
XGBoost Tree Ensemble Learner
Parameter Optimization Loop End
Missing Value (Apply)
Scorer (JavaScript)
Number to String
Number to String
Column Splitter
XGBoost Predictor
Normalizer (Apply)
Bar Chart
Row Sampler
Logistic Regression Predictor
Number to String
Normalizer
XGBoost Tree Ensemble Learner
Reference Row Filter
CSV Reader
Champion
Logistic Regression Learner
Normalizer (Apply)
Table Row to Variable
XGBoost Tree Ensemble Learner
Row Sampler
XGBoost Predictor
Universal patient list validation dataset (15%)
CSV Reader
Missing Value (Apply)
Number to String
ROC Curve (legacy)
Row Sampler
Row Filter
ROC Curve (legacy)
Universal patient list training dataset (70%)
CSV Reader
XGBoost Predictor
Champion
XGBoost Tree Ensemble Learner
Table Row to Variable
ROC Curve (legacy)
Reference Row Filter
XGboost
Parameter Optimization Loop Start
Missing Value
Row Filter
Table Column to Variable
Sorter
Sorter
Bar Chart
Sorter
Bar Chart
log
Parameter Optimization Loop Start
Row Filter
Bar Chart
Sorter
Row Filter
ROC Curve (legacy)
Bar Chart
Scorer (JavaScript)
Logistic Regression Predictor
Column Splitter
ICULOS_quad
Scatter Plot
Logistic Regression Learner
Table Row to Variable
Universal patient list testing dataset (15%)
CSV Reader
Column Splitter
Table Row to Variable
Logistic Regression Predictor
Shapley Values Loop Start
Champion
XGBoost Tree Ensemble Learner
Shapley Values Loop End
Column Splitter
Champion
Logistic Regression Learner
Parameter Optimization Loop End
Scatter Plot
Equal Size Sampling
Shapley Values Loop End
Missing Value (Apply)
Unpivot
Number to String
CSV Writer
Parameter Optimization Loop End
Shapley Values Loop Start
Table Column to Variable
Row Filter
Number to String
CSV Reader
Reference Row Filter
CSV Reader
CSV Reader
CSV Reader
Normalizer (Apply)
Normalizer (Apply)
ROC Curve (legacy)
XGBoost Tree Ensemble Learner
XGboost
Parameter Optimization Loop Start
Missing Value (Apply)
XGBoost Predictor
Column Splitter
Column Splitter
Column Splitter
Table Row to Variable
Champion
XGBoost Tree Ensemble Learner
Parameter Optimization Loop End
Equal Size Sampling
Missing Value (Apply)
Row Filter
XGBoost Predictor
Reference Row Filter
Number to String
XGBoost Predictor
Normalizer
GroupBy
ROC Curve (legacy)
Line Plot
Reference Row Filter
Number to String
Line Plot
Create future labelSepsislabel_2h
Moving Aggregator
Universal patient list validation dataset (15%)
CSV Reader
Create future labelSepsislabel_6h
Moving Aggregator
Create future labelSepsislabel_8h
Moving Aggregator
Universal patient list training dataset (70%)
CSV Reader
Create future labelSepsislabel_4h
Moving Aggregator
Column Name Replacer
Missing Value (Apply)
Universal patient list testing dataset (15%)
CSV Reader
Number to String
Column Name Replacer
Column Name Replacer
Logistic Regression Learner
Column Name Replacer
Table Row to Variable
Reference Row Filter
Champion
Logistic Regression Learner
Row Filter
Value Counter
Patient_ID 188 will be choosenICULOS max = 84
Row Filter
Universal patient list testing dataset (15%)
CSV Reader
Universal patient list training dataset (70%)
CSV Reader
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
CSV Reader
CSV Reader
Universal patient list testing dataset (15%)
CSV Reader
Row Filter
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
Number to String
GroupBy
Column Filter
Sorter
Data Explorer
Column Resorter
Rule Engine
Line Plot
Universal patient list training dataset (70%)
CSV Reader
ROC Curve (legacy)
XGboost
Parameter Optimization Loop Start
Column Splitter
Reference Row Filter
Missing Value (Apply)
Column Splitter
Reference Row Filter
Universal patient list validation dataset (15%)
CSV Reader
Column Splitter
Reference Row Filter
Column Splitter
Reference Row Filter
Table Row to Variable
High Class imbalance for Sepsislabel Sepsis: 2686 (6.71%) Non-Sepsis: 37322 (93.29%)
Pie Chart
Row Filter
GroupBy
Bar Chart
Universal patient list testing dataset (15%)
CSV Writer
Column Splitter
Column Filter
log
Parameter Optimization Loop Start
Champion
XGBoost Tree Ensemble Learner
Missing Value
Column Splitter
Reference Row Filter
Logistic Regression Predictor
Number to String
Number to String
Universal patient list training dataset (70%)
CSV Writer
XGBoost Predictor
Missing Value
Universal patient list validation dataset (15%)
CSV Writer
Rule Engine
Groupby patient_ID
GroupBy
ROC Curve (legacy)
Logistic Regression Learner
Stratified sampling15% validation set (first)15% test set (second)
Table Partitioner
Missing Value (Apply)
Shuffle
XGBoost Predictor
Parameter Optimization Loop End
Rule Engine
XGboost
Parameter Optimization Loop Start
Normalizer
CSV Reader
XGBoost Tree Ensemble Learner
Column Name Replacer
Column Renamer
Using Python script to create delta features
Python Script
Constant Value Column Appender
Constant Value Column Appender
Constant Value Column Appender
ICULOS representation (Log)
Math Formula
Loop End
Moving Aggregator
Concatenate
ICULOS representation (Quadratic)
Math Formula
Sorter
Patient_ID 188 will be choosenICULOS max = 84
Row Filter
Row Filter
GroupBy
Column Renamer
Column Renamer
Constant Value Column Appender
Column Renamer
Column Filter
Constant Value Column Appender
Column Resorter
Column Renamer
Constant Value Column Appender
Sorter
Column Renamer
Group Loop Start
Column Renamer
Line Plot
Data Explorer
Constant Value Column Appender
Loop End
Constant Value Column Appender
Sorter
Sorter
Group Loop Start
Constant Value Column Appender
Column Renamer
Column Renamer
Row Filter
Column Filter
Row Filter
Row Filter
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Rule Engine
Row Filter
Rule Engine
Row Filter
Rule Engine
Rule Engine
Row Filter
Row Filter
impute 25 lab variables with forward fill
Missing Value
Row Filter
Row Filter
Sorter
Rule Engine
Sorter
Sorter
Sorter
High Class imbalance for Sepsislabel Sepsis: 2686 (6.71%) Non-Sepsis: 37322 (93.29%)
Pie Chart
Lag Column(Multi Column)
Row Filter
Column Name Replacer
Sorter
Lag Column(Multi Column)
Lag Column(Multi Column)
Sorter
Sorter
Column Name Replacer
Sorter
Sorter
Scorer (JavaScript)
Number to String
Rule Engine
Table Column to Variable
Missing Value
Parameter Optimization Loop End
Reference Row Filter
Rule Engine
Number to String
Rule Engine
Heatmap
Row Splitter
Heatmap
Row Filter
GroupBy
Unpivot
Column Filter
Joiner
Math Formula
Loop End
Bar Chart
Rule Engine
Loop End
Group Loop Start
Row Filter
GroupBy
Sorter
CSV Reader
XGBoost Tree Ensemble Learner
XGBoost Tree Ensemble Learner
Universal patient list validation dataset (15%)
CSV Reader
Reference Row Filter
High Class imbalance for Sepsislabel Sepsis: 2506 (6.29%) Non-Sepsis: 37322 (93.71%)
Pie Chart
CSV Reader
Remove column0 and Hour
Column Filter
Rename unit1 and unit2 to MICU and SICU
Column Renamer
check missing value for static categorical features
Statistics View
Replaced missing value for SICU, MICU and HospAdmTime with fix value 0
Missing Value
Statistics View
Column Splitter
Column Splitter
Table Row to Variable
Sorter
Champion
XGBoost Tree Ensemble Learner
XGBoost Tree Ensemble Learner
XGBoost Predictor
Number to String
Missing Value (Apply)
Column Splitter
Sorted Patient_ID and ICULOS in ascending
Sorter
XGboost
Parameter Optimization Loop Start
Bar Chart
Universal patient list training dataset (70%)
CSV Reader
Rule Engine
Row Filter
ROC Curve (legacy)
Scorer (JavaScript)
Column Splitter
XGBoost Predictor
Champion
XGBoost Tree Ensemble Learner
Table Row to Variable
Number to String
Reference Row Filter
Missing Value (Apply)
Reference Row Filter
Missing Value
Column Splitter
Table Column to Variable
Universal patient list validation dataset (15%)
CSV Reader
ROC Curve (legacy)
Universal patient list testing dataset (15%)
CSV Reader
Sorter
Reference Row Filter
Parameter Optimization Loop End
CSV Reader
Number to String
Missing Value
Missing Value
Rule Engine
Parameter Optimization Loop End
Table Column to Variable
Equal Size Sampling
Rule Engine
Number to String
Table Column to Variable
Rule Engine
Parameter Optimization Loop End
Reference Row Filter
Reference Row Filter
Scorer (JavaScript)
Parameter Optimization Loop End
Scorer (JavaScript)
Scorer (JavaScript)
XGBoost Predictor
Number to String
Sorter
Normalizer (Apply)
Logistic Regression Predictor
ROC Curve (legacy)
Reference Row Filter
Universal patient list testing dataset (15%)
CSV Reader
Parameter Optimization Loop End
XGBoost Tree Ensemble Learner
Reference Row Filter
Normalizer
Number to String

Nodes

Extensions

Links