Linear Correlation

This Node Is Deprecated — This version of the node has been replaced with a new and improved version. The old version is kept for backwards-compatibility, but for all new workflows we suggest to use the version linked below.

Calculates for each pair of selected columns a correlation coefficient, i.e. a measure of the correlation of the two variables.

Which correlation measure is applied depends on the types of the underlying variables:
numeric <-> numeric: Pearson's product-moment coefficient. Missing values in such columns are ignored (the corresponding records are skipped). The value of this measure ranges from -1 (strong negative correlation) to 1 (strong positive correlation). A value of 0 represents no correlation.
nominal <-> nominal: Pearson's chi square test on the contingency table. This value is then normalized to a range [0,1] using Cramer's V, whereby 0 represents no correlation and 1 a strong correlation. Missing values in nominal columns are treated such as they were a self-contained possible value. If one of the two columns contains more possible values than specified in the dialog (default 50), the correlation will not be computed.
Correlation measures for other pairs of columns are not available, they are represented by missing values in the output table and crosses in the accompanying view.

Options

Column Filter: Select columns for which correlation values shall be computed.
Possible Values Count: Select an upper bound for the number of possible values for each of the nominal columns. If more values are encountered in a nominal column, the column will be ignored (no correlation values will be computed).

Input Ports

: Numeric input data to evaluate

Output Ports

: Correlation variables in a square matrix
: A model containing the correlation measures. This model is appropriate to be read by the Correlation Filter node.

Popular Predecessors

No recommendations found

Popular Successors

No recommendations found

Views

Correlation Matrix: Squared table view showing the pair-wise correlation values of all columns. The color range varies from dark red (strong negative correlation), over white (no correlation) to dark blue (strong positive correlation). If a correlation value for a pair of column is not available, the corresponding cell contains a missing value (shown as cross in the color view).

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Base nodes from the below update site following our NodePit Product and Node Installation Guide:

v5.6

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.6.0.v202507241546

On NodePit since: 2025-08-15

Last update: 2025-08-18

Tags: Deprecated

KNIME versions: Since v4.2

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!