Icon

Big_​Data_​Workshop_​2020

This directory contains 2 workflows.

Icon01_​Flight_​Delay_​Statistics_​Impala 

Demo workflow: Flight delay statistics using Impala. - Connect to Kerberized Impala. - Load the airline dataset into Impala. - Aggregate and visualize data […]

Icon02_​Taxi_​Demand_​Prediction_​Training_​workflow 

In this use case, we will use the NYC taxi dataset and a Random Forest to train a simple time series prediction model to predict taxi demand in the next […]