Node Connectivity

There are 85 nodes that can be used as predessesor for a node with an input port of type Spark Data.

PySpark Script (1 to 2) 

Execute arbitrary Python code in Spark.

PySpark Script (2 to 1) 

Execute arbitrary Python code in Spark.

PySpark Script (2 to 2) 

Execute arbitrary Python code in Spark.

PySpark Script Source 

Execute arbitrary Python code in Spark.

Spark Repartition 

Repartitions a Spark DataFrame.

Spark SQL Query 

Executes a Spark SQL query statement

Spark Concatenate 

Concatenates Spark DataFrame/RDDs row wise, inputs are optional.

Spark GroupBy 

The Spark GroupBy allows to group by the selected columns and output aggregated data to the generated groups.

Spark Partitioning 

Splits input data into two partitions.

Spark Pivot 

Pivots and groups the given Spark DataFrame/RDD by the selected columns for pivoting and grouping. Also performs aggregations for each pivot value.