04_Parameter_Optimization_in_Spark

Mix and match Spark nodes with other KNIME nodes

This workflow mixes standard KNIME nodes with the Spark nodes to find the optimal parameters for a k-means clustering using the hillclimbing approach. Other optimization strategies are available - check the Parameter Optimization Loop Start Node description for more.

The workflow makes use of the Create Local Big Data Environment node to create a Spark context. You can swap this node out for a Create Spark Context (Livy) node to connect to a remote cluster.

Nodes

File Reader (Complex Format)2 ×
Cluster Assigner1 ×
Create Local Big Data Environment1 ×
Entropy Scorer1 ×
Parameter Optimization Loop End1 ×
Show all 12 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Extension for Apache Spark
FeatureKNIME Extension for Apache Spark (legacy)
FeatureKNIME Optimization extension

04_​Parameter_​Optimization_​in_​Spark

Nodes

Extensions

Links

Download

04_Parameter_Optimization_in_Spark