1 ×

k-Means

KNIME Base Nodes version 4.2.3.v202011031328 by KNIME AG, Zurich, Switzerland

This node outputs the cluster centers for a predefined number of clusters (no dynamic number of clusters). K-means performs a crisp clustering that assigns a data vector to exactly one cluster. The algorithm terminates when the cluster assignments do not change anymore.
The clustering algorithm uses the Euclidean distance on the selected attributes. The data is not normalized by the node (if required, you should consider to use the "Normalizer" as a preprocessing step).

Options

Number of clusters
The number of clusters (cluster centers) to be created.
Centroid initialization
  • First k rows: Initializes the centroids using the first rows of the input table.
  • Random initialization: Initializes the centroids with random rows of the input table. Checking the Use static random seed it is possible to get reproducible results.
Max number of iterations
The maximum number of iterations after which the algorithm terminates if it hasn't found a stable solution before.
Numeric Column Selection
Move the numeric columns of interest to the "Include" list. Always include all columns option moves all numeric columns to the "Include" list by default.
Enable Hilite Mapping
If enabled, the hiliting of a cluster row (2nd output) will hilite all rows of this cluster in the input table and the 1st output table. Depending on the number of rows, enabling this feature might consume a lot of memory.

Input Ports

Icon
Input to clustering. All numerical values and only these are considered for clustering.

Output Ports

Icon
The input data labeled with the cluster they are contained in.
Icon
The created clusters
Icon
PMML cluster model

Views

Cluster View
Displays the cluster prototypes in a tree-like structure, with each node containing the coordinates of the cluster center.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install KNIME Base nodes from the following update site:

KNIME 4.2

A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.