Icon

final team project1

Age could be translated into values from 1 to 10. Weight is almost always missing. Payer code and medical specialty is usually missing.
CSV Reader
Statistics
Logistic Regression Predictor
Too many missing weight values for it to be a useful feature and removing ids that could lead to bias. Payer code and medical specialty are missing over 50% of the time so those were also removed.
Column Filter
CSV Reader
Changes ? values in race to Unknown to make results more readable.
Rule Engine
CSV Writer
Changes ? values in race to Unknown to make results more readable.
Rule Engine
Column Filter
Decision Tree Predictor
Scorer
ROC Curve
partitions data into training and test
Table Partitioner
Normalizes large columns like num lab procedures and num medications to prevent bias in logistic regression as large values can lead to bias.
Normalizer
Logistic Regression Learner
Logistic Regression Predictor
Number to String
Random Forest Learner
Random Forest Predictor
Domain Calculator
Random Forest Predictor
Normalizes large columns like num lab procedures and num medications to prevent bias in logistic regression as large values can lead to bias.
Normalizer
Scorer
ROC Curve
Too many missing weight values for it to be a useful feature and removing ids that could lead to bias. Payer code and medical specialty are missing over 50% of the time so those were also removed.
Column Filter
Scorer
Decision Tree Learner
Changes diag 1 back to String
Number to String
Removes .0 values that were a holdover from changing a float to a string, but keeps decimals.
String Manipulation
ROC Curve

Nodes

Extensions

Links