0 ×

Create Big Data Test Environment (legacy)

DeprecatedKNIME Extension for Apache Spark core infrastructure version 4.3.1.v202101261633 by KNIME AG, Zurich, Switzerland

Creates a fully functional big data environment for testing purposes, including Apache Hive, Apache Spark and a remote file system. This node has no own configuration, instead it will read its configuration from a file called flowvariables.csv from the root of the KNIME workspace. This file is expected two provide keys and values. These can be used to control what this node does.

Note: This node only creates a new Spark context upon its first execution after KNIME has started, or after the context has been destroyed. The Spark context created by during its first execution is meant to be shared between KNIME testflows.

Note: This node uses the old database connection based Hive output port.

Output Ports

JDBC connection to a Hive instance. This port can be connected to the KNIME database nodes.
Remote file system connection that can be used with the Spark nodes that read/write files.
Spark context, that can be connected to all Spark nodes.


To use this node in KNIME, install KNIME Extension for Apache Spark from the following update site:


A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.


You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.