• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用数据挖掘进行医疗保险欺诈检测。

Healthcare insurance fraud detection using data mining.

作者信息

Hamid Zain, Khalique Fatima, Mahmood Saba, Daud Ali, Bukhari Amal, Alshemaimri Bader

机构信息

Department of Computer Science, Bahria University, Islamabad, Pakistan.

Faculty of Resilience, Rabdan Academy, Abu Dhabi, United Arab Emirates.

出版信息

BMC Med Inform Decis Mak. 2024 Apr 26;24(1):112. doi: 10.1186/s12911-024-02512-4.

DOI:10.1186/s12911-024-02512-4
PMID:38671513
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11046758/
Abstract

BACKGROUND

Healthcare programs and insurance initiatives play a crucial role in ensuring that people have access to medical care. There are many benefits of healthcare insurance programs but fraud in healthcare continues to be a significant challenge in the insurance industry. Healthcare insurance fraud detection faces challenges from evolving and sophisticated fraud schemes that adapt to detection methods. Analyzing extensive healthcare data is hindered by complexity, data quality issues, and the need for real-time detection, while privacy concerns and false positives pose additional hurdles. The lack of standardization in coding and limited resources further complicate efforts to address fraudulent activities effectively.

METHODOLGY

In this study, a fraud detection methodology is presented that utilizes association rule mining augmented with unsupervised learning techniques to detect healthcare insurance fraud. Dataset from the Centres for Medicare and Medicaid Services (CMS) 2008-2010 DE-SynPUF is used for analysis. The proposed methodology works in two stages. First, association rule mining is used to extract frequent rules from the transactions based on patient, service and service provider features. Second, the extracted rules are passed to unsupervised classifiers, such as IF, CBLOF, ECOD, and OCSVM, to identify fraudulent activity.

RESULTS

Descriptive analysis shows patterns and trends in the data revealing interesting relationship among diagnosis codes, procedure codes and the physicians. The baseline anomaly detection algorithms generated results in 902.24 seconds. Another experiment retrieved frequent rules using association rule mining with apriori algorithm combined with unsupervised techniques in 868.18 seconds. The silhouette scoring method calculated the efficacy of four different anomaly detection techniques showing CBLOF with highest score of 0.114 followed by isolation forest with the score of 0.103. The ECOD and OCSVM techniques have lower scores of 0.063 and 0.060, respectively.

CONCLUSION

The proposed methodology enhances healthcare insurance fraud detection by using association rule mining for pattern discovery and unsupervised classifiers for effective anomaly detection.

摘要

背景

医疗保健项目和保险举措在确保人们能够获得医疗服务方面发挥着关键作用。医疗保险项目有诸多益处,但医疗保健领域的欺诈行为仍是保险行业面临的重大挑战。医疗保险欺诈检测面临着不断演变且复杂的欺诈计划带来的挑战,这些欺诈计划会适应检测方法。分析大量医疗保健数据受到复杂性、数据质量问题以及实时检测需求的阻碍,而隐私问题和误报则带来了额外的障碍。编码缺乏标准化以及资源有限,进一步使有效应对欺诈活动的努力变得复杂。

方法

在本研究中,提出了一种欺诈检测方法,该方法利用关联规则挖掘并结合无监督学习技术来检测医疗保险欺诈。使用了来自医疗保险和医疗补助服务中心(CMS)2008 - 2010年DE - SynPUF的数据集进行分析。所提出的方法分两个阶段工作。首先,关联规则挖掘用于根据患者、服务和服务提供者特征从交易中提取频繁规则。其次,将提取的规则传递给无监督分类器,如IF、CBLOF、ECOD和OCSVM,以识别欺诈活动。

结果

描述性分析显示了数据中的模式和趋势,揭示了诊断代码、程序代码和医生之间有趣的关系。基线异常检测算法在902.24秒内生成结果。另一个实验使用先验算法结合无监督技术的关联规则挖掘在868.18秒内检索到频繁规则。轮廓评分方法计算了四种不同异常检测技术的功效,显示CBLOF得分最高,为0.114,其次是孤立森林,得分为0.103。ECOD和OCSVM技术得分较低,分别为0.063和0.060。

结论

所提出的方法通过使用关联规则挖掘进行模式发现以及无监督分类器进行有效异常检测,增强了医疗保险欺诈检测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/405ae6282c88/12911_2024_2512_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/d98bae003e3e/12911_2024_2512_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/3275d46f7411/12911_2024_2512_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/81c68d20149c/12911_2024_2512_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/07d358ab86f4/12911_2024_2512_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/7244e912a0e2/12911_2024_2512_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/16fc00aec31c/12911_2024_2512_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/766fc80dce0b/12911_2024_2512_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/d7f666056661/12911_2024_2512_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/82f4ee6466dd/12911_2024_2512_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/53b247d5a74b/12911_2024_2512_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/8c0af7450fd3/12911_2024_2512_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/3e45675b2167/12911_2024_2512_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/238456b707d6/12911_2024_2512_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/0142d3dce84b/12911_2024_2512_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/68985aa4ec0e/12911_2024_2512_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/405ae6282c88/12911_2024_2512_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/d98bae003e3e/12911_2024_2512_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/3275d46f7411/12911_2024_2512_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/81c68d20149c/12911_2024_2512_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/07d358ab86f4/12911_2024_2512_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/7244e912a0e2/12911_2024_2512_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/16fc00aec31c/12911_2024_2512_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/766fc80dce0b/12911_2024_2512_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/d7f666056661/12911_2024_2512_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/82f4ee6466dd/12911_2024_2512_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/53b247d5a74b/12911_2024_2512_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/8c0af7450fd3/12911_2024_2512_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/3e45675b2167/12911_2024_2512_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/238456b707d6/12911_2024_2512_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/0142d3dce84b/12911_2024_2512_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/68985aa4ec0e/12911_2024_2512_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7586/11046758/405ae6282c88/12911_2024_2512_Fig15_HTML.jpg

相似文献

1
Healthcare insurance fraud detection using data mining.利用数据挖掘进行医疗保险欺诈检测。
BMC Med Inform Decis Mak. 2024 Apr 26;24(1):112. doi: 10.1186/s12911-024-02512-4.
2
Detecting medical prescriptions suspected of fraud using an unsupervised data mining algorithm.使用无监督数据挖掘算法检测疑似欺诈的医疗处方。
Daru. 2018 Dec;26(2):209-214. doi: 10.1007/s40199-018-0227-z. Epub 2018 Nov 20.
3
Identifying frauds and anomalies in Medicare-B dataset.识别医疗保险B数据集里的欺诈行为和异常情况。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3664-3667. doi: 10.1109/EMBC.2017.8037652.
4
Approaches for identifying U.S. medicare fraud in provider claims data.识别美国医疗保险欺诈在医疗服务提供者索赔数据中的方法。
Health Care Manag Sci. 2020 Mar;23(1):2-19. doi: 10.1007/s10729-018-9460-8. Epub 2018 Oct 27.
5
Multi-stage methodology to detect health insurance claim fraud.用于检测医疗保险索赔欺诈的多阶段方法。
Health Care Manag Sci. 2016 Sep;19(3):249-60. doi: 10.1007/s10729-015-9317-3. Epub 2015 Jan 20.
6
Using data mining to detect health care fraud and abuse: a review of literature.利用数据挖掘检测医疗保健欺诈与滥用行为:文献综述
Glob J Health Sci. 2014 Aug 31;7(1):194-202. doi: 10.5539/gjhs.v7n1p194.
7
FraudAuditor: A Visual Analytics Approach for Collusive Fraud in Health Insurance. FraudAuditor:一种用于医疗保险共谋欺诈的可视分析方法。
IEEE Trans Vis Comput Graph. 2023 Jun;29(6):2849-2861. doi: 10.1109/TVCG.2023.3261910. Epub 2023 May 3.
8
Data mining application to healthcare fraud detection: a two-step unsupervised clustering method for outlier detection with administrative databases.数据挖掘在医疗保健欺诈检测中的应用:基于行政数据库的两步无监督聚类异常检测方法。
BMC Med Inform Decis Mak. 2020 Jul 14;20(1):160. doi: 10.1186/s12911-020-01143-9.
9
The evaluation of trustworthiness to identify health insurance fraud in dentistry.评估可信度以识别牙科医疗保险欺诈行为。
Artif Intell Med. 2017 Jan;75:40-50. doi: 10.1016/j.artmed.2016.12.002. Epub 2016 Dec 27.
10
Healthcare Fraud Data Mining Methods: A Look Back and Look Ahead.医疗保健欺诈数据挖掘方法:回顾与展望。
Perspect Health Inf Manag. 2022 Jan 1;19(1):1i. eCollection 2022 Winter.

引用本文的文献

1
Using large language models for enhanced fraud analysis and detection in blockchain based health insurance claims.利用大语言模型加强基于区块链的健康保险理赔中的欺诈分析与检测。
Sci Rep. 2025 Aug 13;15(1):29763. doi: 10.1038/s41598-025-15676-4.
2
Enhancing Healthcare Integrity Using Simple Statistical Methods: Detecting Irregularities in Historical Dermatology Services Payments.使用简单统计方法提高医疗保健诚信度:检测皮肤科历史服务支付中的违规行为。
Healthcare (Basel). 2025 Jun 18;13(12):1464. doi: 10.3390/healthcare13121464.
3
Anomaly-based threat detection in smart health using machine learning.

本文引用的文献

1
Procedure code overutilization detection from healthcare claims using unsupervised deep learning methods.利用无监督深度学习方法从医疗保健索赔中检测程序代码过度使用。
BMC Med Inform Decis Mak. 2023 Sep 28;23(1):196. doi: 10.1186/s12911-023-02268-3.
2
Health insurance fraud detection by using an attributed heterogeneous information network with a hierarchical attention mechanism.使用具有层次注意力机制的属性异质信息网络进行健康保险欺诈检测。
BMC Med Inform Decis Mak. 2023 Apr 6;23(1):62. doi: 10.1186/s12911-023-02152-0.
3
Determinants of Diabetes Disease Management, 2011-2019.
基于异常的智能健康中的威胁检测:机器学习方法
BMC Med Inform Decis Mak. 2024 Nov 19;24(1):347. doi: 10.1186/s12911-024-02760-4.
4
An imbalanced learning method based on graph tran-smote for fraud detection.一种基于图变换过采样技术的不平衡学习方法用于欺诈检测。
Sci Rep. 2024 Jul 17;14(1):16560. doi: 10.1038/s41598-024-67550-4.
2011 - 2019年糖尿病疾病管理的决定因素
Healthcare (Basel). 2021 Jul 26;9(8):944. doi: 10.3390/healthcare9080944.
4
Health and hard time.健康与艰难时期。
CMAJ. 2013 Feb 19;185(3):E139-40. doi: 10.1503/cmaj.109-4389. Epub 2013 Jan 21.
5
Eliminating waste in US health care.消除美国医疗保健中的浪费。
JAMA. 2012 Apr 11;307(14):1513-6. doi: 10.1001/jama.2012.362. Epub 2012 Mar 14.