m_001_hive_partitions

Create a Big Data Hive/Parquet table with a partition based on an existing KNIME table and add more partitions later

Create a Big Data Hive/Parquet table with a partition based on an existing KNIME table and add more partitions later

You can create a Hive table with Parquet format with the DB Table Creator node with additional options specifying the PARQUET format and a PARTITION. You will leave the partition column out when creating the table from the example and later use the column as partition when you insert the KNIME table into your newly created Hiv table. You can later add more partitions just by uploading it thru "DB Loader" - the partitions will automatically be created or an existing partition will be appended.

=> please download the whole workflow group "kn_example_bigdata_hive_partitions"

Nodes

Create Folder6 ×
DB GroupBy (Labs)4 ×
DB Reader (Labs)4 ×
DB Sorter (Labs)4 ×
DB Table Selector (Labs)4 ×
Show all 39 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Column Expressions (legacy)
FeatureKNIME Database
FeatureKNIME Extension for Apache Spark
FeatureKNIME Extension for Apache Spark (legacy)
Show all 8 modules

m_​001_​hive_​partitions

Nodes

Extensions

Links

Download

m_001_hive_partitions