Spark PCA

This Node Is Deprecated — This version of the node has been replaced with a new and improved version. The old version is kept for backwards-compatibility, but for all new workflows we suggest to use the version linked below.
Go to Suggested ReplacementSpark PCA

This node utilizes the Apache Spark Principal component analysis (PCA) implementation.

Options

Number of components
The number of top principal components to compute.
Feature Columns
The feature columns to use during computation. Supports only numeric columns.

Input Ports

Icon
Input Spark DataFrame/RDD

Output Ports

Icon
The input DataFrame/RDD projected onto the principal components.
Icon
The principal component matrix.

Popular Predecessors

Popular Successors

  • No recommendations found

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.