Spark Numeric Scorer

This node computes certain statistics between the a numeric column's values (r_i) and predicted (p_i) values. It computes R²=1-SS_res/SS_tot=1-Σ(p_i-r_i)²/Σ(r_i-1/n*Σr_i)² (can be negative!), mean absolute error (1/n*Σ|p_i-r_i|), mean squared error (1/n*Σ(p_i-r_i)²), root mean squared error (sqrt(1/n*Σ(p_i-r_i)²)), and mean signed difference (1/n*Σ(p_i-r_i)). The computed values can be inspected in the node's view and/or further processed using the output table.

Options

Reference column: Column with the correct, observed, training data values. Rows with missing values in selected column will be ignored.
Predicted column: Column with the modeled, predicted data values. Computation will fail if selected column contains missing values.
Change column name: Change the default output column name.
Output column name: The name of the column in the output.
Output scores as flow variables: The scores can be exported as flow variables.
Prefix of flow variables: This option allows you to define a prefix for these variable identifiers so that name conflicts are resolved.

Input Ports

: Arbitrary input Spark DataFrame/RDD with at least two numeric columns to compare.

Output Ports

The computed statistical measures:

R² - coefficient of determination, 1-SS_res/SS_tot
Mean squared error - 1/n*Σ((p_i-r_i)²)
Mean absolute error - 1/n*Σ|p_i-r_i|
Root mean squared error - Sqrt(1/n*Σ((p_i-r_i)²))
Mean signed difference - 1/n*Σ(p_i - r_i)

Popular Predecessors

Popular Successors

Views

Statistics: A table with the statistical measures

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Extension for Apache Spark (legacy) from the below update site following our NodePit Product and Node Installation Guide:

v5.6

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.6.0.v202507151409

On NodePit since: 2025-08-15

Last update: 2025-08-15

KNIME versions: Since v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!