Icon

Project_​v.01

creating DV

Builds the Decision Tree model
Decision Tree Learner
Measures the overall classification performance
ROC Curve
evaluates how well the model performed
Scorer
Balance classes in training folds
Equal Size Sampling
Uses the trained Decision Tree model to make predictions
Decision Tree Predictor
Combines the results from all folds into one final output
X-Aggregator
Measures the overall classification performance
ROC Curve
Gradient Boosted Trees Predictor
Splits the data into training and testing folds for cross-validation
X-Partitioner
Gradient Boosted Trees Learner
Balance classes in training folds
Equal Size Sampling
Combines the results from all folds into one final output
X-Aggregator
evaluates how well the model performed
Scorer
Accident
SAS7BDAT Reader
vehicle
SAS7BDAT Reader
Joiner
person
SAS7BDAT Reader
Joiner
distract
SAS7BDAT Reader
Joiner
Binned roadway surface condition into dry, wet, snow/ice/slush, other surface, and non-trafficway/driveway categories; not reported and unknown left missing.
Rule Engine
Created binary rollover variable: coded no rollover versus any rollover involvement.
Rule Engine
Created binned trafficway-description variable: grouped original VTRAFWAY codes into non-trafficway/driveway, two-way not divided, two-way divided, one-way trafficway, and entrance/exit ramp categories; not reported and unknown values were left missi
Rule Engine
Binned number of roadway lanes into 1–2 lanes and 3+ lanes; unknown or invalid lane values left missing.
Rule Engine
Binned pre-event vehicle movement into broader movement categories; no-driver and unknown values were left missing.
Rule Engine
Binned roadway alignment into straight versus curve categories; non-trafficway/driveway was retained and not reported/unknown values were left missing.
Rule Engine
Created binary vehicle-removal variable indicating whether the vehicle was towed or not towed.
Rule Engine
Created binary driver-distraction variable: coded 0 as not distracted and all specific distraction-related codes as distracted; not reported and unknown values left missing.
Rule Engine
Measures the overall classification performance
ROC Curve
Combines the results from all folds into one final output
X-Aggregator
Binned roadway grade into level and non-level grade categories; not reported and unknown values were left missing.
Rule Engine
Binned traffic control devices into no controls, traffic signals, regulatory signs, and other controls; not reported and unknown values were left missing.
Rule Engine
Created binary fire-occurrence variable: coded 0 as “No Fire” and 1 as “Fire Occurred.”
Rule Engine
Created binary interstate-highway variable: coded whether the crash occurred on an interstate highway
Rule Engine
Binned traffic control functioning into no controls, functioning properly, and not functioning properly; not reported and unknown values were left missing.
Rule Engine
Binned vehicle damage severity into no damage, minor, functional, and disabling damage; not reported and unknown values were left missing.
Rule Engine
INJ_SEV1 to 4 only
Row Filter
Handled missing predictor values before modeling. Numeric variables were replaced using the median,
Missing Value
RProp MLP Learner
INJ_SEVHigh vs Low
Numeric Binner
Color Manager
Final variables in the model
Column Filter
Logistic Regression Learner
Rule Engine
MultiLayerPerceptron Predictor
Logistic Regression Predictor
Splits the data into training and testing folds for cross-validation
X-Partitioner
Column Filter
Rule Engine
Rule Engine
Missing Value
valid male/female onlyRemoved not reported/unknown sex codes before recoding.
Rule Engine
Random Forest Learner
Created REST_USE_binnedCollapsed valid restraint-use codes into restrained vs. not restrained; other/not reported/unknown codes were left as missing.Created binary restraint-use variable. Valid restraint-use codes were grouped as “Restraint Used,”
Rule Engine
Math Formula
Created vehicle-count category from VE_TOTAL: coded crashes as single-vehicle or multiple-vehicle to capture crash complexity without using injury-outcome information.
Rule Engine
Numeric Binner
Excel Writer
Created binary alcohol-involvement variable: coded 0 as “No Alcohol” and 1 as “Alcohol Involved”; not reported and unknown values were left missing/unmatched.
Rule Engine
Cleaned numeric number-injured variable by retaining valid values 0–97 and leaving special/non-informative codes missing before modeling.
Rule Engine
Value Counter
Created binary police-reported drug involvement variable: coded 0 as “No Drug Involvement” and 1 as “Drug Involvement”; not reported and unknown values were left missing/unmatched.
Rule Engine
Created binary air bag deployment variableGrouped all air bag deployment codes as “Air Bag Deployed” and code 20 as “Air Bag Not Deployed”; non-informative codes such as not reported, not applicable, or unknown were left missing/unmatched.
Rule Engine
Compares the ROC performance of all models in one view
ROC Curve
Created binary ejection variable: coded 0 as “Not Ejected” and codes 1–3 as “Ejected”; not reported, not applicable, and unknown codes were left missing/unmatched.
Rule Engine
Column Appender
Created season variable from crash month: grouped months into Winter, Spring, Summer, and Fall to capture seasonal crash patterns.
Rule Engine
Created light-condition variable: grouped crash lighting into Daylight, Dark, and Dawn/Dusk; other, not reported, and unknown light-condition codes were left missing/unmatched.
Rule Engine
Binned initial impact location into front, side, rear, top, undercarriage, and non-collision/no-impact categories; unknown and not reported values were left missing.
Rule Engine
Created time-of-day variable from crash hour: grouped valid 0–23 hour values into late night, morning, afternoon, and evening/night; unknown hour values were left missing/unmatched.
Rule Engine
Created binned crash-type variable from ACC_TYPE: grouped detailed crash-type codes into broader crash-mechanism categories including single-driver, same-direction, opposite-direction, intersecting-paths, and backing/other crashes.
Rule Engine
Created binned first-harmful-event variable: grouped detailed harmful-event codes into non-collision, collision with motor vehicle, collision with non-fixed object, and collision with fixed object categories; unknown or unmatched codes were left miss
Rule Engine
evaluates how well the model performed
Scorer
Grouped critical precrash event into broader crash-mechanism categories to capture the event that immediately preceded the crash.
Rule Engine
Created weather-condition variable: grouped clear/cloudy conditions separately from adverse weather conditions; other, not reported, and unknown weather codes were left missing/unmatched.
Rule Engine
Splits the data into training and testing folds for cross-validation
X-Partitioner
Grouped original collision-orientation codes into interpretable categories; not reported and unknown codes were left missing/unmatched.
Rule Engine
Balance classes in training folds
Equal Size Sampling
Created binary hit-and-run variable: coded 0 as “No Hit-and-Run” and 1 as “Hit-and-Run”; any unmatched values were left missing.
Rule Engine
Measures the overall classification performance
ROC Curve
Naive Bayes Learner
Created binned specific junction-location variable: grouped RELJCT2 into non-junction, intersection-related, driveway access, ramp/acceleration-lane related, railway crossing, crossover, and other junction/interchange locations; not reported and unkn
Rule Engine
Combines the results from all folds into one final output
X-Aggregator
Created binary work-zone variable: coded 0 as “No Work Zone” and codes 1–4 as “Work Zone” to capture whether the crash was work-zone related.
Rule Engine
Naive Bayes Predictor
Random Forest Predictor
Splits the data into training and testing folds for cross-validation
X-Partitioner
Random Forest Learner
Combines the results from all folds into one final output
X-Aggregator
evaluates how well the model performed
Scorer
Balance classes in training folds
Equal Size Sampling
Measures the overall classification performance
ROC Curve
One to Many
Normalizer
evaluates how well the model performed
Scorer
Measures the overall classification performance
ROC Curve
Balance classes in training folds
Equal Size Sampling
Splits the data into training and testing folds for cross-validation
X-Partitioner
evaluates how well the model performed
Scorer
SEAT_POSDriver/11 only
Row Filter
Balance classes in training folds
Equal Size Sampling
Combines the results from all folds into one final output
X-Aggregator
Numeric Binner
Value Counter
BODY_TYPPassanger Car ONLY
Row Filter
Splits the data into training and testing folds for cross-validation
X-Partitioner
Column Filter
Missing Value
Rule Engine
Rule Engine
Rule Engine

Nodes

Extensions

Links