Spark Statistics

This node computes summary statistics for the selected input columns using the MLlib Statistics package.

Computed statistics:

  • Minimum value
  • Maximum value
  • Sample mean
  • Sample variance
  • L1 norm
  • L2 norm
  • Number of nonzero elements
  • Number of zero elements
  • Row count

Options

Column filter
Select the columns to include in the calculation.

Input Ports

Icon
Spark DataFrame/RDD to compute statistics for.

Output Ports

Icon
Table with numeric values.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.