This node is currently not available in KNIME v5.11 — instead we’re showing this page for KNIME v3.7. You can use the version menu in the title bar to permanently switch your preferred version. This will also show the link to the update site.

Create Big Data Test Environment

Creates a fully functional big data environment for testing purposes, including Apache Hive, Apache Spark and a remote file system. This node has no own configuration, instead it will read its configuration from a file called flowvariables.csv from the root of the KNIME workspace. This file is expected two provide keys and values. These can be used to control what this node does.

NoteThis node only creates a new Spark context upon its first execution after KNIME has started, or after the context has been destroyed. The Spark context created by during its first execution is meant to be shared between KNIME testflows.

Input Ports

This node has no input ports

Output Ports

: JDBC connection to a Hive instance. This port can be connected to the KNIME database nodes.
: Remote file system connection that can be used with the Spark nodes that read/write files.
: Spark context, that can be connected to all Spark nodes.

Popular Predecessors

No recommendations found

Popular Successors

No recommendations found

Views

This node has no views

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Extension for Apache Spark from the below update site following our NodePit Product and Node Installation Guide:

v3.7

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 2.4.0.v201811301556

On NodePit since: 2018-12-08

Last update: 2026-06-12

KNIME versions: From v3.7 to v4.0

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!