Icon

1. Data Preparation

KBL: Data Preparation for Classification
Workflow: Data Preparation This workflow prepares the data for the next workflow (My First Data Model)and uses some of the most common data preparations:- subsetting (Row Sampling and Partitioning nodes)- Strategies to deal with missing values (Missing Value node)- Shuffling (Shuffle node)- Concatenation of data sets (Concatenate node)- Normalization (Normalizer and Normalizer (Apply) nodes) 20% subsetrandomly drawn with seed50% training setdrawn with linear samplingno seedtraining set + test settraining settest setadult.data setwith knime:// protocolbuild normalization lawon training dataapply normalization transformationbuilt on training set to test setmissing values in:age -> mean valueincome -> remove row Row Sampling Partitioning Shuffle Concatenate CSV Writer CSV Writer File Reader Normalizer Normalizer (Apply) Missing Value Workflow: Data Preparation This workflow prepares the data for the next workflow (My First Data Model)and uses some of the most common data preparations:- subsetting (Row Sampling and Partitioning nodes)- Strategies to deal with missing values (Missing Value node)- Shuffling (Shuffle node)- Concatenation of data sets (Concatenate node)- Normalization (Normalizer and Normalizer (Apply) nodes) 20% subsetrandomly drawn with seed50% training setdrawn with linear samplingno seedtraining set + test settraining settest setadult.data setwith knime:// protocolbuild normalization lawon training dataapply normalization transformationbuilt on training set to test setmissing values in:age -> mean valueincome -> remove rowRow Sampling Partitioning Shuffle Concatenate CSV Writer CSV Writer File Reader Normalizer Normalizer (Apply) Missing Value

Nodes

Extensions

Links