This directory contains 2 workflows.
Demo workflow: Flight delay statistics using Impala. - Load the airline dataset into Hive. - Aggregate and visualize data (flight delay per day-of-week). […]
In this use case, we will use the NYC taxi dataset and a Random Forest to train a simple time series prediction model to predict taxi demand in the next […]