H2O PCA

Options

Dimensions to reduce to: Specify the rank of matrix approximation (the dimension of the principal components). Note that the dimension cannot be larger than the dimension of the input data (k).
Column selection: Select columns used for model training.
Ignore constant columns: Select to ignore constant columns.

Transformation method: Specify the transformation method for the training data (None, Standardize, Normalize, Demean, or Descale) (transform).
PCA method: Specify the algorithm to use for computing the principal components (pca_method).
Number of max iterations: Specify the number of training iterations (max_iterations).
Use all factor levels: Specify whether to use all factor levels in the possible set of predictors (use_all_factor_levels).
Impute missing values: Specify whether to impute missing entries with the column mean value (impute_missing). Note that if this option is set to false, any rows with missing values will be removed.

Max runtime in seconds: Maximum allowed runtime in seconds for model training (max_runtime_secs).
Use static random seed: Select to use static seed for randomization.

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

To use this node in KNIME, install the extension KNIME H2O Machine Learning Integration from the below update site following our NodePit Product and Node Installation Guide:

v5.5

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.5.0.v202504171027

On NodePit since: 2025-07-02

Last update: 2025-07-23

KNIME versions: Since v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.