s_605_spark_prepare_data

s_605 - use the stored rules and lists to actually prepare the data

s_605 - apply the label encoding and other transformations stored in SQL code and the selected final column as RegEx string

Get the results back and export them to .parquet files so you could use them in a powerful R or Python environment. (or leave them on the big data system). Of course you could also do the model building in Spark with a genuine Spark-ML model or H2O.ai Sparkling Water. All that matters is that the result is a MOJO file KNIME would be able to read and apply to Sparkling Water.

Nodes

Parquet Writer4 ×
Spark to Table4 ×
Table Row to Variable4 ×
Create Folder3 ×
Table Reader3 ×
Show all 31 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Database
FeatureKNIME Extension for Apache Spark
FeatureKNIME Extension for Apache Spark (legacy)
FeatureKNIME Extension for Big Data File Formats
Show all 6 modules

s_​605_​spark_​prepare_​data

Nodes

Extensions

Links

Download

s_605_spark_prepare_data