The challenge is to blend together models from different analytics platforms - i.e. Python , R, and KNIME - to create an ensemble model. Data is the “airline data set” (http://stat-computing.org/dataexpo/2009/the-data.html) enriched with additional external data , such as cities, daily weather (https://www.ncdc.noaa.gov/cdo-web/datasets/), US holidays, geo-coordinates, airplane maintenance. DepDealys is used as the target variable. R SVM, Python Logisitc Regression, and KNIME Decision Tree. Will they blend in a single ensemble model? ... and yes! They blend.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!