Lim Zheng You, Pang Ying Han, Kamarudin Khairul Zaqwan Bin, Ooi Shih Yin, Hiew Fu San
Faculty of Information Science and Technology, Multimedia University, Ayer Keroh, Melaka 75450, Malaysia.
Infineon Technologies, Free Trade Zone, Batu Berendam, Melaka 75350, Malaysia.
MethodsX. 2024 Nov 16;13:103055. doi: 10.1016/j.mex.2024.103055. eCollection 2024 Dec.
Credit card usage has surged, heightening concerns about fraud. To address this, advanced credit card fraud detection (CCFD) technology employs machine learning algorithms to analyze transaction behavior. Credit card data's complexity and imbalance can cause overfitting in conventional models. We propose a Bayesian-optimized Extremely Randomized Trees via Tree-structured Parzen Estimator (TP-ERT) to detect fraudulent transactions. TP-ERT uses higher randomness in split points and feature selection to capture diverse transaction patterns, improving model generalization. The performance of the model is assessed using real-world credit card transaction data. Experimental results demonstrate the superiority of TP-ERT over the other CCFD systems. Furthermore, our validation exhibits the effectiveness of TPE compared to other optimization techniques with higher F1 score.•The optimized Extremely Randomized Trees model is a viable artificial intelligence tool for detecting credit card fraud.•Model hyperparameter tuning is conducted using Tree-structured Parzen Estimator, a Bayesian optimization strategy, to efficiently explore the hyperparameter space and identify the best combination of hyperparameters. This facilitates the model to capture intricate patterns in the transactions, resulting in enhanced model performance.•The empirical findings exhibit that the proposed approach is superior to the other machine learning models on a real-world credit card transaction dataset.
信用卡使用量激增,引发了对欺诈行为的担忧。为解决这一问题,先进的信用卡欺诈检测(CCFD)技术采用机器学习算法来分析交易行为。信用卡数据的复杂性和不平衡性可能导致传统模型出现过拟合。我们提出了一种通过树结构帕曾估计器(TP-ERT)进行贝叶斯优化的极端随机树来检测欺诈交易。TP-ERT在分割点和特征选择上使用更高的随机性来捕捉不同的交易模式,提高模型的泛化能力。使用真实世界的信用卡交易数据评估模型的性能。实验结果证明了TP-ERT相对于其他CCFD系统的优越性。此外,我们的验证表明,与其他优化技术相比,TPE具有更高的F1分数,验证了其有效性。
•优化后的极端随机树模型是一种用于检测信用卡欺诈的可行人工智能工具。
•使用树结构帕曾估计器(一种贝叶斯优化策略)对模型超参数进行调整,以有效探索超参数空间并确定超参数的最佳组合。这有助于模型捕捉交易中的复杂模式,从而提高模型性能。
•实证结果表明,在真实世界的信用卡交易数据集上,所提出的方法优于其他机器学习模型。