Have a personal or library account? Click to login
Explainable deep learning model for predicting money laundering transactions Cover

Explainable deep learning model for predicting money laundering transactions

Open Access
|Jul 2024

Figures & Tables

Figure 1:

Research development methodology for generating synthetic data, predicting money laundering transactions using CNN, and interpreting the predictions using SHAP. AI, artificial intelligence; CNN, convolutional neural network; ML, machine learning; RF, random forest; SHAP, SHapley Additive exPlanations; SVM, support vector machine.
Research development methodology for generating synthetic data, predicting money laundering transactions using CNN, and interpreting the predictions using SHAP. AI, artificial intelligence; CNN, convolutional neural network; ML, machine learning; RF, random forest; SHAP, SHapley Additive exPlanations; SVM, support vector machine.

Figure 2:

Synthetic financial transaction data generation methodology.
Synthetic financial transaction data generation methodology.

Figure 3:

CNN architecture to predict suspicious money laundering transactions. CNN, convolutional neural network.
CNN architecture to predict suspicious money laundering transactions. CNN, convolutional neural network.

Figure 4:

ROC curve for (A) CNN, (B) RF, (C) XGBoost, and (D) SVM classifiers. CNN, convolutional neural network; RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting.
ROC curve for (A) CNN, (B) RF, (C) XGBoost, and (D) SVM classifiers. CNN, convolutional neural network; RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting.

Figure 5:

Interpretation of CNN predictions using SHAP force plot. (A) Force plot of a transaction predicted as suspicious by CNN. (B) Force plot of a transaction predicted as legitimate by CNN. CNN, convolutional neural network; SHAP, SHapley Additive exPlanations.
Interpretation of CNN predictions using SHAP force plot. (A) Force plot of a transaction predicted as suspicious by CNN. (B) Force plot of a transaction predicted as legitimate by CNN. CNN, convolutional neural network; SHAP, SHapley Additive exPlanations.

Figure 6:

Global interpretation of predictions made by RF, XGBoost, and SVM using feature importance score. RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting.
Global interpretation of predictions made by RF, XGBoost, and SVM using feature importance score. RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting.

Synthetic financial transaction dataset summary

ParameterValue
Number of customers442
Number of accounts442
Approximate number of transactions per customer184
Approximate time period of transactions12 months
Total number of transactions92,824
Labeled suspicious transactions4,054
Labeled legitimate transactions88,770

Sample of two original transaction records considered for prediction by CNN and interpretation by SHAP

FeaturesSuspicious transactionLegitimate transaction
Transaction date7/12/20176/02/2018
Transaction number339549359932
Transaction account1030001510202449
Transaction amount6,000.00322.00
Credit6,000.00
Debit322.00
Balance52,659.0016,054.00
Transaction typeCreditDebit
Transaction subtypeCash depositAuto-debit
Transaction descriptionCash depositHealth insurance
Transaction currencyAUDAUD
Transaction location typeATMOnline
Transaction location code448222
Target account0891141
Target country code0Australia
Target bank code0559059
Customer ID2000073620002452
Customer typeStudentIndividual
GenderMaleFemale
Date of birth24/09/199226/05/1965
Age2855
Marital statusSingleMarried
Residence countryAustraliaAustralia
StateNew South WalesNew South Wales
CitySydneyNew Castle
Postcode23582361
Tax resident countryAustraliaAustralia
Birth countryOverseas countryAustralia
Nationality countryOverseas countryAustralia
ProfessionStudentLaborers
Income category400077668
KYC updated on date22/04/201713/09/2019
KYC stateActiveActive
Risk rating00.463290428
Account number1030001510202449
BSB number203901201807
Account created on date22/04/201723/08/2017
Account typeSavingsSavings
Daily transaction limit3,0002,000
TFN999528645968305061
Statement delivery methodNot setOnline

CNN architecture hyperparameters

LayerParameters
Conv1DFilters = 32, Kernel size = 2, Input shape = 51,980 × 40, Activation = ReLU
Batch normalizationAxis = −1, momentum = 0.99, center = true, scale = true
Dropout0.3
Conv1DFilters = 64, Kernel size = 2, Activation = ReLU
Batch normalizationAxis = −1, momentum = 0.99, center = true, scale = true
Dropout0.3
Conv1DFilters = 128, Kernel size = 2, Activation = ReLU
Batch normalizationAxis = −1, momentum = 0.99, center = true, scale = true
Dropout0.3
FlattenAxis = −1, momentum = 0.99, center = true, scale = true
Dropout0.3
DenseUnits = 512, Activation = ReLU
Dropout0.3
DenseUnits = 1, Activation = Sigmoid

Overlapping transaction scenarios that shares the characteristics of legitimate and suspicious transactions

S. No.Scenario description
OL-1Wire transfer of money to offshore accounts from savings account
OL-2Cash withdrawal from the account in the range of AU $2,000 to AU $5,000
OL-3Wire transfer of money from offshore account into savings account
OL-4Shopping in the range of AU $10,000 to AU $30,000

Hyperparameters for RF, XGBoost, and SVM

ClassifierHyperparameterValue
RFNumber of trees in the forest100
RFMinimum number of data points in a node prior splitting2
RFMinimum number of data points allowed in a leaf node1
RFMaximum number of features for splitting a nodesqrt
RFMethod for sampling data pointsTrue
RFClass weight0:1, 1:100
XGBoostMinimum number of data points in a node prior splitting2
XGBoostMinimum number of data points allowed in a leaf node1
XGBoostLearning rate0.1
XGBoostNumber of decision trees to be boosted100
XGBoostSubsample ratio of training data1
XGBoostMaximum depth3
SVMC1.0
SVMKernelLinear
SVMGammaScale

Scenarios to develop money laundering transactions

S. No.Scenario description
ML-1Small deposits (<AU $5,000) of money through ATM by multiple people into a single account (<AU $10,000 per day) over a month. Then the same money is transferred in batches of AU $10,000 to $30,000 to multiple overseas accounts in different countries.
ML-2Small deposits (<AU $5,000) of money through ATM by multiple people into a single account (<AU $10,000 per day) over a month. Then the same money is used to buy luxurious items locally in the range of AU $10,000 to AU $90,000 (vehicles, gold, property, etc.).
ML-3Transfer of money from multiple overseas accounts from multiple countries and using the same to buy luxurious items in the range of AU $10,000 to AU $90,000 (vehicles, gold, property).
ML-4Transfer of money from multiple overseas accounts from multiple countries and withdraw the same through ATM over next couple of months in a small quantity in the range of AU $2,000 to AU $4,900.
ML-5Deposit a small amount of money in the range of AU $2,000 to AU $4,500 each month to ATM deposit machine and transfer the deposited amount online to an account in a different local bank (but same account) the next day.

Suspicious transaction prediction results of CNN, RF, XGBoost, and SVM models

MetricsCNNRFXGBoostSVM
fβ score78.23%61.97%62.09%30.86%
Recall91.01%59.82%60.14%29.10%
Precision34.56%91.53%87.72%67.47%
Accuracy92.03%97.95%97.84%96.20%
AUC98.00%79.80%98.40%83.60%
TPs1,114746750363
TNs24,51526,53226,49626,426
FPs2,10969105175
FNs110501497884
Training time70 min6 s16 s4.4 min
Language: English
Submitted on: Apr 10, 2024
|
Published on: Jul 24, 2024
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2024 Dattatray Vishnu Kute, Biswajeet Pradhan, Nagesh Shukla, Abdullah Alamri, published by Professor Subhas Chandra Mukhopadhyay
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.