s_601_spark_label_encoder

s_601 - Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System)

Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System)
Use Big Data Technologies like Spark to ge a robust and scalable data preparation. Use the latest Auo ML technology like H2O.ai AutoML to cretae a robust model and deploy it in a Big Data environment (like Cloudera)

s_601 - prepare label encoding with spark
prepare the preparation of data in a big data environment
- label encode string variables
- transform numbers into Double format (Spark ML likes that)
- remove highly correlated data
- remove NaN variables
- remove continous variables
- optional: normalize the data

Nodes

Column Filter19 ×
String Manipulation18 ×
Table Row to Variable10 ×
Column Combiner9 ×
Spark Column Filter9 ×
Show all 58 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Database
FeatureKNIME Extension for Apache Spark
FeatureKNIME Extension for Apache Spark (legacy)
FeatureKNIME Extension for Big Data File Formats
Show all 7 modules

s_​601_​spark_​label_​encoder

Nodes

Extensions

Links

Download

s_601_spark_label_encoder