This is a workflow designed to illustrate the difference between training error and test error. It compares three different models:
A polynomial of order 1.
A polynomial of order 3.
A polynomial of order 10.
Naturally, the model with the lowest training error is the most flexible, i.e., the polynomial of order 10 (see this by clicking on the magnifier on the Training component). However, for this dataset, the model that gives the lowest test error is the polynomial of order 3 (see this by clicking on the magnifier on the Test component). Therefore, this latter model is expected to predict better on new data, which is the whole point of Machine Learning.
P.S. The estimation of the test error is conducted using validation.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!