Dal Pozzolo Andrea, Boracchi Giacomo, Caelen Olivier, Alippi Cesare, Bontempi Gianluca
IEEE Trans Neural Netw Learn Syst. 2018 Aug;29(8):3784-3797. doi: 10.1109/TNNLS.2017.2736643. Epub 2017 Sep 14.
Detecting frauds in credit card transactions is perhaps one of the best testbeds for computational intelligence algorithms. In fact, this problem involves a number of relevant challenges, namely: concept drift (customers' habits evolve and fraudsters change their strategies over time), class imbalance (genuine transactions far outnumber frauds), and verification latency (only a small set of transactions are timely checked by investigators). However, the vast majority of learning algorithms that have been proposed for fraud detection rely on assumptions that hardly hold in a real-world fraud-detection system (FDS). This lack of realism concerns two main aspects: 1) the way and timing with which supervised information is provided and 2) the measures used to assess fraud-detection performance. This paper has three major contributions. First, we propose, with the help of our industrial partner, a formalization of the fraud-detection problem that realistically describes the operating conditions of FDSs that everyday analyze massive streams of credit card transactions. We also illustrate the most appropriate performance measures to be used for fraud-detection purposes. Second, we design and assess a novel learning strategy that effectively addresses class imbalance, concept drift, and verification latency. Third, in our experiments, we demonstrate the impact of class unbalance and concept drift in a real-world data stream containing more than 75 million transactions, authorized over a time window of three years.
检测信用卡交易中的欺诈行为或许是计算智能算法的最佳试验平台之一。事实上,这个问题涉及诸多相关挑战,具体如下:概念漂移(客户习惯不断演变,欺诈者也会随时间改变策略)、类别不平衡(真实交易数量远远超过欺诈交易)以及验证延迟(调查人员仅对一小部分交易进行及时检查)。然而,针对欺诈检测所提出的绝大多数学习算法都依赖于一些在现实世界的欺诈检测系统(FDS)中几乎不成立的假设。这种缺乏现实性涉及两个主要方面:1)提供监督信息的方式和时机,以及2)用于评估欺诈检测性能的指标。本文有三大贡献。首先,在我们的行业合作伙伴的帮助下,我们提出了一种对欺诈检测问题的形式化描述,它切实地描绘了日常分析海量信用卡交易流的欺诈检测系统的运行状况。我们还阐述了用于欺诈检测目的的最合适的性能指标。其次,我们设计并评估了一种新颖的学习策略,它能有效应对类别不平衡、概念漂移和验证延迟问题。第三,在我们的实验中,我们在一个包含超过7500万笔交易(在三年的时间窗口内授权)的真实世界数据流中展示了类别不平衡和概念漂移的影响。