Use both SMOTE (Synthetic Minority Over-sampling Technique) and ROSE (Random Over-Sampling Examples) algorithms to balance data. SMOTE is implemented within KNIME. ROSE can be accessed via R.
It is advisable to balace only your training data and leave the test/validation data as they are or you run the risk of greatly inflated values on your precision statistics.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!