Taxi_Time_Series_Prediction

Taxi demand prediction training workflow

In this use case, we will use the NYC taxi dataset and a Random Forest to train a simple time series prediction model to predict taxi demand in the next hour based on data from past hours.

Given the large size of the dataset, we train and deploy the machine learning model of choice on a Spark cluster. The KNIME Big Data Extension allows you to run a KNIME workflow on the big data platform you prefer, via in-database processing or via Spark.

Nodes

Extensions

No modules found

Download

To use this workflow in KNIME, download it from the below URL and open it in KNIME:

Download Workflow

Created by: dewi

Created at: 2019-02-07

On NodePit since: 2026-02-04

Last update: 2026-07-21

Created with KNIME version: v5.9.0

Tags: Demand predictionrandom foresttime series predictionSpark clusterNYC taxi datsetIoTInternet of ThingsPracticing Data Science

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!

Taxi_​Time_​Series_​Prediction

Nodes

Extensions

Links

Download

Taxi_Time_Series_Prediction