This workflow demonstrates the usage of the DB nodes in conjunction with the Create Local Big Data Environment node, which is part of the KNIME Big Data Extension. This node, together with the DB nodes, allows complex data preprocessing without the need of manual SQL coding.
To run this workflow on a remote cluster, use an HDFS Connection node and Hive Connector node (available in the KNIME Big Data Connectors Extension) in place of the Create Local Big Data Environment node.
The table name is controlled by a workflow variable which can be altered via the context menu of the workflow in the KNIME explorer.
- KNIME File Handling Nodes
- KNIME Extension for Local Big Data Environments
Get this workflow from the following link: Download
01_Big_Data_Preprocessing_Example consists of the following 28 nodes(s):
01_Big_Data_Preprocessing_Example contains nodes provided by the following 4 plugin(s):
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to firstname.lastname@example.org, follow @NodePit on Twitter, or chat on Gitter!
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.