Icon

01_​Big_​Data_​Preprocessing_​Example

Big Data preprocessing

This workflow demonstrates the usage of the DB nodes in conjunction with the Create Local Big Data Environment node, which is part of the KNIME Big Data Extension. This node, together with the DB nodes, allows complex data preprocessing without the need of manual SQL coding.

To run this workflow on a remote cluster, use an HDFS Connection node and Hive Connector node (available in the KNIME Big Data Connectors Extension) in place of the Create Local Big Data Environment node.

The table name is controlled by a workflow variable which can be altered via the context menu of the workflow in the KNIME explorer.

Requirements:
- KNIME File Handling Nodes
- KNIME Extension for Local Big Data Environments

Nodes

Extensions

Links