This workflow is designed to illustrate how to select an optimal value for a model parameter using cross validation.
The workflow reads in the dataset and tries to find the optimal polynomial regression model (i.e. the polynomial that will have the minimal test error). We check various degrees (from 1 to 10) using the "Parameter Optimization Loop". For each degree, we estimate the test error using cross-validation.
We plot the estimation of the test error vs the degree. We also identify the best degree (i.e., the one that minimizes the estimation of the test error) and we use all the data to train a polynomial of that degree.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!