Icon

fIX 01_​Training Random Forest for Fraud Detection

Fraud Detection: Random Forest Model Training

The workflow reads in the creditcard.csv file and trains and evaluates a Random Forest model to classify transactions as either fraudulent or not. Notice the Rule Engine node classifies all transactions with fraud probability above 0.3 as fraudulent. We apply a threshold of 0.3 to the probability of being a fraudulent transaction (default is 0.5). Adopting a lower threshold makes the algorithm more responsive in classifying frauds. You can evaluate the results by opening 'Evaluation' component view. After training, the model is saved for deployment. In our case we use Random Forest Learner, but we can use any other Supervised model.


This workflow demonstrates how we can train the model on the provided data:
1. Read training data
2. Train the Random Forest Model
3. Evaluate model results
4. Save trained model for deployment

URL: Kaggle Dataset https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud

Read training data

Train the Random Forest Model

Save trained model for deployment

Fraud Detection: Random Forest Model Training


The workflow reads in the creditcard.csv file and trains and evaluates a Random Forest model to classify transactions as either fraudulent or not. Notice the Rule Engine node classifies all transactions with fraud probability above 0.3 as fraudulent. We apply a threshold of 0.3 to the probability of being a fraudulent transaction (default is 0.5). Adopting a lower threshold makes the algorithm more responsive in classifying frauds. You can evaluate the results by opening 'Evaluation' component view. After training, the model is saved for deployment. In our case we use Random Forest Learner, but we can use any other Supervised model.

https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud?resource=download

This workflow demonstrates how we can train the model on the provided data:

  1. Read training data

  2. Train the Random Forest Model

  3. Evaluate model results

  4. Save trained model for deployment

Evaluate model results

Open Viewto see results
Evaluation
Class = Fraud
Random Forest Learner
Read credit card data (sample)Download full dataset via the Kaggle link in the description
CSV Reader
Random Forest Predictor
Convert Class to string
Number to String
70 % Training30 % Testing
Table Partitioner
Save model
Model Writer
P(Class =1)>0.3=>1P(Class=1)<= 1
Expression

Nodes

Extensions

Links