Spark Row Filter

This node allows rows to be filtered from the input Spark DataFrame/RDD by adding and grouping conditions. Rows that match the conditions are included in the output DataFrame/RDD.

This node requires at least Apache Spark 2.0.

Options

Preview: This list contains the conditions and groups.
Add Condition: Add condition to the list. If a logical operator is selected it will be added to that, if a condition is selected it will be added to the parent logical operator. All rows that match the conditions are included in the output DataFame/RDD.
Group: Create a new logical operator and put the selected condition below.
Ungroup: Delete the selected logical operator and put the conditions from it to the parent logical operator.
Delete: Delete the selected element from the list.

Input Ports

: Spark DataFrame/RDD from which rows are to be excluded.

Output Ports

: Spark DataFrame/RDD including rows that match the defined conditions.

Popular Predecessors

Popular Successors

Views

This node has no views

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Extension for Apache Spark (legacy) from the below update site following our NodePit Product and Node Installation Guide:

v5.5

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.5.0.v202506051107

On NodePit since: 2025-07-02

Last update: 2025-08-01

KNIME versions: Since v3.7

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!