Node Connectivity

There are 5462 nodes that can be used as predessesor for a node with an input port of type Generic Port.

Spark Partitioning 

Splits input data into two partitions.

Spark Pivot 

Pivots and groups the given Spark DataFrame/RDD by the selected columns for pivoting and grouping. Also performs aggregations for each pivot value.

Spark Row Filter 

The Spark Row Filter allows rows to be excluded from the input Spark DataFrame/RDD.

Spark Row Sampling 

Extracts a sample from the input data.

Spark Sorter 

Sorts the rows according to user-defined criteria.

Spark Correlation Filter 

Filters out correlated columns.

Spark Correlation Matrix 

Computes the correlation matrix for the selected input columns.

Spark Linear Correlation 

Computes the correlation coefficient between two columns.

Spark Statistics 

Computes summary statistics for the selected input columns.

Create Big Data Test Environment 

Creates big data test environment based on flow variable configuration.