H2O k-Means

Build and apply a K-Means model using H2O.

Options

General Settings

Number of clusters
Specify the number of clusters (k).
Maximum number of iterations
Specify the number of training iterations (max_iterations).
Column selection
Select columns used for model training.
Ignore constant columns
Select to ignore constant columns.

Algorithm Settings

Estimate number of clusters
Specify whether to estimate the number of clusters (<=k) iteratively (independent of the seed) and deterministically (beginning with k=1,2,3...) (estimate_k).
Initialization mode
Specify the initialization mode. If mode "User" is selected, user-specified initial cluster centers should be given in the second in-port (init).
Standardize numeric columns
Specify whether to standardize the numeric columns to have a mean of zero and unit variance (recommended) (standardize).
Select categorical encoding
Specify one of the following encoding schemes for handling categorical features (categorical_encoding).

Algorithm Settings

Max runtime in seconds
Maximum allowed runtime in seconds for model training (max_runtime_secs).
Use static random seed
Select to use static seed for randomization.

Input Ports

Icon
H2O Frame with training data.
Icon
H2O Frame with user-specified initial cluster centers.

Output Ports

Icon
H2O Frame with the K-Means cluster assignment.
Icon
H2O Frame with the K-Means cluster centers.
Icon
H2O K-Means model.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.