Icon

final team project1

Age could be translated into values from 1 to 10. Weight is almost always missing. Payer code and medical specialty is usually missing.
CSV Reader
Statistics
Logistic Regression Predictor
Too many missing weight values for it to be a useful feature and removing ids that could lead to bias. Payer code and medical specialty are missing over 50% of the time so those were also removed.
Column Filter
CSV Reader
Changes ? values in race to Unknown to make results more readable.
Rule Engine
Logistic Regression Predictor
Changes ? values in race to Unknown to make results more readable.
Rule Engine
ROC Curve
Normalizes large columns like num lab procedures and num medications to prevent bias in logistic regression as large values can lead to bias.
Normalizer
partitions data into training and test
Table Partitioner
Scorer
Normalizes large columns like num lab procedures and num medications to prevent bias in logistic regression as large values can lead to bias.
Normalizer
Too many missing weight values for it to be a useful feature and removing ids that could lead to bias. Payer code and medical specialty are missing over 50% of the time so those were also removed.
Column Filter
ROC Curve
Creates new readmitted in 30 column to which sorts into yes if less than 30 days before remittance and to no if more than 30 days or no readmittance
Rule Engine
Logistic Regression Learner
Changes diag 1 back to String
Number to String
Removes .0 values that were a holdover from changing a float to a string, but keeps decimals.
String Manipulation

Nodes

Extensions

Links