School of Hive - with KNIME's local Big Data environment (SQL for Big Data)
Demonstrates a collection of Hive functions using KNIME's local Big Data environment including creating table structures from scratch and from an existing file and working with partitions.
Partitions are an essential organizing principle of Big Data systems. They will make is easier to store and handel big data tables.
All examples are fully functional. You could switch out the local big data environment for your own one (Cloudera e.g.).
This example focusses on Hive-SQL script in executors. Similar effects could be achieved by using KNIME's DB nodes.
https://hub.knime.com/mlauber71/spaces/Public/latest/_db_sql_bigdata_hive_spark_meta_collection#externalresources
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!