This category contains 7 nodes.

Spark Concatenate 

Concatenates Spark DataFrame/RDDs row wise, inputs are optional.

Spark GroupBy 

The Spark GroupBy allows to group by the selected columns and output aggregated data to the generated groups.

Spark Partitioning 

Splits input data into two partitions.

Spark Pivot 

Pivots and groups the given Spark DataFrame/RDD by the selected columns for pivoting and grouping. Also performs aggregations for each pivot value.

Spark Row Filter 

The Spark Row Filter allows rows to be excluded from the input Spark DataFrame/RDD.

Spark Row Sampling 

Extracts a sample from the input data.

Spark Sorter 

Sorts the rows according to user-defined criteria.