Icon

03_​Spark_​for_​Performance

Performance and Scalability Testing: Example KNIME with Spark

This workflows show how to learn a random forest using the KNIME Apache Spark nodes.

We here are measuring the speed of the workflow with the last metanode. In addition it collects the max used memory and the start parameters of this instantiation of the KNIME Analytics Platform.

Preprocessing Data Conversion and Transfer Performance and Scalability Testing: Example KNIME with SparkThis workflows show how to learn a random forest using the KNIME Apache Spark nodes.We here are measuring the speed of the workflow with the last metanode. In addition it collects the max used memory and the startparameters of this instantiation of the KNIME Analytics Platform. Metanode Input Port: Connected to first node to be measured.Timer Info Node Input Port:Connected to last node to be measured. Receivedata from caller workfowRead Input DataIncrease iterationsfor mutliple evaluationrounds Table to Spark Spark Partitioning Spark Scorer CSV Writer ContainerOutput (Table) ContainerInput (Table) File Reader Capture Accuracy Benchmark End(Memory Monitoring) Benchmark Start(Memory Monitoring) Combine Spark RandomForest Learner Spark Predictor(Classification) Create Local BigData Environment Preprocessing Data Conversion and Transfer Performance and Scalability Testing: Example KNIME with SparkThis workflows show how to learn a random forest using the KNIME Apache Spark nodes.We here are measuring the speed of the workflow with the last metanode. In addition it collects the max used memory and the startparameters of this instantiation of the KNIME Analytics Platform. Metanode Input Port: Connected to first node to be measured.Timer Info Node Input Port:Connected to last node to be measured. Receivedata from caller workfowRead Input DataIncrease iterationsfor mutliple evaluationroundsTable to Spark Spark Partitioning Spark Scorer CSV Writer ContainerOutput (Table) ContainerInput (Table) File Reader Capture Accuracy Benchmark End(Memory Monitoring) Benchmark Start(Memory Monitoring) Combine Spark RandomForest Learner Spark Predictor(Classification) Create Local BigData Environment

Nodes

Extensions

Links