There are 5547 nodes that can be used as predessesor
for a node with an input port of type Generic Port.
The Spark GroupBy allows to group by the selected columns and output aggregated data to the generated groups.
Splits input data into two partitions.
Pivots and groups the given Spark DataFrame/RDD by the selected columns for pivoting and grouping. Also performs aggregations for each pivot value.
The Spark Row Filter allows rows to be excluded from the input Spark DataFrame/RDD.
Extracts a sample from the input data.
Sorts the rows according to user-defined criteria.
Filters out correlated columns.
Computes the correlation matrix for the selected input columns.
Computes the correlation coefficient between two columns.
Computes summary statistics for the selected input columns.