06_Modularized_Spark_Scripting

Spark Java snippet nodes

This workflow demonstrates the usage of the different Spark Java Snippet nodes to read a text file from HDFS, parse it, filter it and write the result back to HDFS.

You might also want to have a look at the provided snippet templates that each of the node provides. In order to do so simply open the configuration dialog of a Spark Java Snippet node and go to the Templates tab.

Note that this workflow requires that access to a Hadoop cluster running Apache Spark 1.2.1 or newer

Nodes

Spark RDD Java Snippet3 ×
Component Input1 ×
Component Output1 ×
Create Spark Context (Jobserver)1 ×
Delete Files1 ×
Show all 13 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Basic File System Connectors
FeatureKNIME Big Data Connectors
FeatureKNIME Extension for Apache Spark
FeatureKNIME Extension for Apache Spark (legacy)

06_​Modularized_​Spark_​Scripting

Nodes

Extensions

Links

Download

06_Modularized_Spark_Scripting