Icon

02_​Spark_​Executor

This directory contains 17 workflows.

Icon01_​Spark_​MLlib_​Decision_​Tree 

This workflow demonstrates the usage of the Spark MLlib Decision Tree Learner and Spark Predictor. It also demonstrates the conversion of categorical […]

Icon02_​Mass_​Learning_​Event_​Prediction_​MLlib_​to_​PMML 

This workflow demonstrates the usage of the Spark MLlib to PMML node. Together with the Compiled Model Predictor and the JSON Input/Output node it can be […]

Icon03_​PMML_​to_​Spark_​Comprehensive_​Mode_​Learning_​Mass_​Prediction 

This workflow demonstrates the usage of the Spark Compiled Model Predictor node which converts a given PMML model into machine code and uses the compiled […]

Icon04_​Parameter_​Optimization_​in_​Spark 

This workflow mixes standard KNIME nodes with the Spark nodes to find the optimal parameters for a k-means clustering using the hillclimbing approach. Other […]

Icon05_​Hive_​to_​Spark_​to_​Hive 

This workflow demonstrates the usage of the Hive to Spark and Spark to Hive nodes that allow you to transfer data between Apache Spark and Apache […]

Icon06_​Modularized_​Spark_​Scripting 

This workflow demonstrates the usage of the different Spark Java Snippet nodes to read a text file from HDFS, parse it, filter it and write the result back […]

Icon07_​SparkSQL_​meets_​HiveQL 

This workflow builds a line plot of the age distribution for men and women in Maine (US) over the last 5 years. In particular, women's data is processed […]

Icon08_​Learning_​Asociation_​Rule_​for_​Next_​Restaurant_​Prediction 

In this workflow we demonstrate how to use the KNIME Spark nodes for giving locality recommendations. For this we are using the Yelp reviews as provided by […]

Icon09_​Big_​Data_​Irish_​Meter_​on_​Spark_​only 

Local big data Irish meter This workflow uses a portion of the Irish Energy Meter dataset, and presents a simple analysis based on the whitepaper "Big […]