m_001_import_hive_parquet

KNIME and Hive - load multiple Parquet files at once via external table

This workflow demonstrates how to import several Parquet files at once without iteration using an external HIVE table.

The initial structure wil be derived from a sample of one of the files. The rule are very basic: String, Double and Int. You might add rules for BIGINT if you need them
You could use a column as partition.

Please Download the complete folder at: https://hub.knime.com/mlauber71/spaces/Public/latest/kn_example_bigdata_hive_parquet_loader/

Nodes

Table Row to Variable7 ×
Create Folder6 ×
Column Filter5 ×
String Manipulation4 ×
DB SQL Executor (Labs)3 ×
Show all 42 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Basic File System Connectors
FeatureKNIME Column Expressions (legacy)
FeatureKNIME Database
FeatureKNIME Extension for Apache Spark
Show all 9 modules

m_​001_​import_​hive_​parquet

Nodes

Extensions

Links

Download

m_001_import_hive_parquet