This workflow uses the KNIME H2O Machine Learning Integration to train an isolation forest model for fraud detection, a method particularly effective for anomaly detection, to spot potential fraud in transactional data. Starting with data intake and preprocessing, it segments the data into training and validation sets. We then use the H2O Isolation Forest Learner to train the model on the 2/3 of 'good' transaction patterns. The trained model is validated against the training set from the split, with its performance measured and the model saved for use in the deployment workflow.
Here are the steps taken to train the mode;
1. Read Training Data
2. Data Preprocessing: Partition data into a training, test, and validation set
3. Train/Validate the model and Evaluate results
4. Save model for deployment
5. Evaluate Results
URL: Kaggle Dataset https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.