0 ×

Deprecated**KNIME Base Nodes** version **4.3.2.v202103021015** by **KNIME AG, Zurich, Switzerland**

Calculates for each pair of selected columns a correlation coefficient, i.e. a measure of the correlation of the two variables.

Which correlation measure is applied depends on the types of the
underlying variables:

`numeric <-> numeric`

:
Pearson's product-moment coefficient.
Missing values in a column are ignored in such a way that for the
computation of the correlation between two columns only complete
records are taken into account. For instance, if there are three
columns A, B and C and a row contains a missing value in column A
but not in B and C, then the row will be ignored for computing the
correlation between (A, B) and (A, C). It will not be ignored for
the correlation between (B, C). This corresponds to the function
*cor(<data.frame>, use="pairwise.complete.obs")*
in the R statistics package.

The value of this measure ranges from -1 (strong negative
correlation) to 1 (strong positive correlation). A value of 0
represents no linear correlation (the columns might still be
highly dependent on each other, though).

`nominal <-> nominal`

:
Pearson's chi square test on the contingency table.
This value is then normalized to a range [0,1] using
Cramer's V, whereby 0 represents no correlation and 1
a strong correlation. Missing values in nominal columns are
treated such as they were a self-contained possible value.
If one of the two columns contains more possible values than
specified in the dialog (default 50), the correlation will not
be computed.

Correlation measures for other pairs of columns are not
available, they are represented by missing values in the output
table and crosses in the accompanying view.

- Include
- This list contains the names of those columns in the input table to be included in the output table.
- Exclude
- This list contains the names of those columns in the input table to be excluded from the output table.
- Filter
- Use one of these fields to filter either the Include or Exclude list for certain column names or name substrings.
- Buttons
- Use these buttons to move columns between the Include and Exclude list. Single-arrow buttons will move all selected columns. Double-arrow buttons will move all columns (filtering is taken into account).
- Enforce Inclusion
- Select this option to enforce the current inclusion list to stay the same even if the input table specification changes. If some of the included columns are not available anymore, a warning is displayed. (New columns will automatically be added to the exclusion list.)
- Enforce Exclusion
- Select this option to enforce the current exclusion list to stay the same even if the input table specification changes. If some of the excluded columns are not available anymore, a warning is displayed. (New columns will automatically be added to the inclusion list.)

- Type a search pattern which matches columns to move into the Include list. You can use either Wildcards ('?' matching any character, '*' matching a sequence of any characters) or Regex. You can specify whether your pattern should be case sensitive.

- Possible Values Count
- Select an upper bound for the number of possible values for each of the nominal columns. If more values are encountered in a nominal column, the column will be ignored (no correlation values will be computed).

- Correlation variables in a square matrix
- A model containing the correlation measures. This model is appropriate to be read by the Correlation Filter node.

- Correlation Matrix
- Squared table view showing the pair-wise correlation values of all columns. The color range varies from dark red (strong negative correlation), over white (no correlation) to dark blue (strong positive correlation). If a correlation value for a pair of column is not available, the corresponding cell contains a missing value (shown as cross in the color view).

- Statistics (14 %)
~~Excel Reader (XLS)~~(8 %) StreamableDeprecated~~Linear Correlation~~(5 %) Deprecated~~String To Number~~(4 %) StreamableDeprecated- GroupBy (4 %)
- Show all 376 recommendations

- Correlation Filter (33 %)
- Column Filter (7 %) Streamable
- Linear Regression Learner (4 %)
~~Linear Correlation~~(3 %) Deprecated~~Excel Writer (XLS)~~(3 %) Deprecated- Show all 354 recommendations

- 01_Analytics (KNIME Hub)
- 01_Correlation_Matrix (KNIME Hub)
- 01_Data_Preparation (KNIME Hub)
- 01_Forest_Fire_Prediction (KNIME Hub)
- 01_Guided_Analytics_for_ML_Automation (KNIME Hub)
- Show all 61 workflows

To use this node in KNIME, install KNIME Base nodes from the following update site:

KNIME 4.3

A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com, follow @NodePit on Twitter, or chat on Gitter!

Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.