• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用关联规则挖掘在管理健康数据中开发和验证数据质量规则。

Development and validation of data quality rules in administrative health data using association rule mining.

机构信息

Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada.

Analytics, Alberta Health Services, Calgary, Alberta, Canada.

出版信息

BMC Med Inform Decis Mak. 2020 Apr 25;20(1):75. doi: 10.1186/s12911-020-1089-0.

DOI:10.1186/s12911-020-1089-0
PMID:32334599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7183129/
Abstract

BACKGROUND

Data quality assessment presents a challenge for research using coded administrative health data. The objective of this study is to develop and validate a set of coding association rules for coded diagnostic data.

METHODS

We used the Canadian re-abstracted hospital discharge abstract data coded in International Classification of Disease, 10th revision (ICD-10) codes. Association rule mining was conducted on the re-abstracted data in four age groups (0-4, 20-44, 45-64; ≥ 65) to extract ICD-10 coding association rules at the three-digit (category of diagnosis) and four-digit levels (category of diagnosis with etiology, anatomy, or severity). The rules were reviewed by a panel of 5 physicians and 2 classification specialists using a modified Delphi rating process. We proposed and defined the variance and bias to assess data quality using the rules.

RESULTS

After the rule mining process and the panel review, 388 rules at the three-digit level and 275 rules at the four-digit level were developed. Half of the rules were from the age group of ≥65. Rules captured meaningful age-specific clinical associations, with rules at the age group of ≥65 being more complex and comprehensive than other age groups. The variance and bias can identify rules with high bias and variance in Alberta data and provides directions for quality improvement.

CONCLUSIONS

A set of ICD-10 data quality rules were developed and validated by a clinical and classification expert panel. The rules can be used as a tool to assess ICD-coded data, enabling the monitoring and comparison of data quality across institutions, provinces, and countries.

摘要

背景

数据质量评估对使用编码行政健康数据进行的研究提出了挑战。本研究的目的是开发和验证一组编码诊断数据的编码关联规则。

方法

我们使用加拿大重新摘录的国际疾病分类第 10 版(ICD-10)编码的住院病历摘要数据。在四个年龄组(0-4 岁、20-44 岁、45-64 岁和≥65 岁)对重新摘录的数据进行关联规则挖掘,以提取 ICD-10 编码的三位数(诊断类别)和四位数(诊断类别与病因、解剖或严重程度)编码关联规则。规则由一组 5 名医生和 2 名分类专家使用修改后的 Delphi 评分过程进行审查。我们使用规则提出并定义了方差和偏差来评估数据质量。

结果

经过规则挖掘过程和专家组审查,制定了 388 条三位数规则和 275 条四位数规则。其中一半的规则来自≥65 岁的年龄组。规则捕获了有意义的特定年龄的临床关联,≥65 岁年龄组的规则比其他年龄组更复杂和全面。方差和偏差可以识别艾伯塔省数据中具有高偏差和方差的规则,并为质量改进提供方向。

结论

由临床和分类专家小组开发和验证了一组 ICD-10 数据质量规则。这些规则可用作评估 ICD 编码数据的工具,能够在机构、省和国家之间监测和比较数据质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/5267e8cabe19/12911_2020_1089_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/ebaa51b7a4b6/12911_2020_1089_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/4fcdde831cf0/12911_2020_1089_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/fc486f50b513/12911_2020_1089_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/5d5b77c91f3e/12911_2020_1089_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/5267e8cabe19/12911_2020_1089_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/ebaa51b7a4b6/12911_2020_1089_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/4fcdde831cf0/12911_2020_1089_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/fc486f50b513/12911_2020_1089_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/5d5b77c91f3e/12911_2020_1089_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffb2/7183129/5267e8cabe19/12911_2020_1089_Fig5_HTML.jpg

相似文献

1
Development and validation of data quality rules in administrative health data using association rule mining.使用关联规则挖掘在管理健康数据中开发和验证数据质量规则。
BMC Med Inform Decis Mak. 2020 Apr 25;20(1):75. doi: 10.1186/s12911-020-1089-0.
2
Data on coding association rules from an inpatient administrative health data coded by International classification of disease - 10th revision (ICD-10) codes.来自按国际疾病分类第十次修订版(ICD-10)编码的住院管理健康数据的编码关联规则数据。
Data Brief. 2018 Feb 16;18:710-712. doi: 10.1016/j.dib.2018.02.043. eCollection 2018 Jun.
3
Exploration of association rule mining for coding consistency and completeness assessment in inpatient administrative health data.探索关联规则挖掘在住院行政健康数据中的编码一致性和完整性评估中的应用。
J Biomed Inform. 2018 Mar;79:41-47. doi: 10.1016/j.jbi.2018.02.001. Epub 2018 Feb 6.
4
Validation of the new diagnosis grouping system for pediatric emergency department visits using the International Classification of Diseases, 10th Revision.使用《国际疾病分类》第十版对儿科急诊科就诊新诊断分组系统进行验证。
Pediatr Emerg Care. 2013 Dec;29(12):1266-72. doi: 10.1097/PEC.0000000000000025.
5
Development of an expert based ICD-9-CM and ICD-10-CM map to AIS 2005 update 2008.基于专家的ICD - 9 - CM和ICD - 10 - CM到2008年更新版AIS 2005的映射开发。
Traffic Inj Prev. 2016 Sep;17 Suppl 1:1-5. doi: 10.1080/15389588.2016.1191069.
6
Development of an expert derived ICD-AIS map for serious AIS3+ injury identification.制定一个专家衍生的 ICD-AIS 图谱,用于识别严重 AIS3+损伤。
Traffic Inj Prev. 2020;21(3):181-187. doi: 10.1080/15389588.2020.1725494. Epub 2020 Mar 6.
7
Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records.电子病历中索赔数据对成人先天性心脏病的识别和分类的准确性。
Arch Cardiovasc Dis. 2019 Jan;112(1):31-43. doi: 10.1016/j.acvd.2018.07.002. Epub 2019 Jan 3.
8
Limitations of pulmonary embolism ICD-10 codes in emergency department administrative data: let the buyer beware.急诊科管理数据中肺栓塞ICD - 10编码的局限性:买家需谨慎。
BMC Med Res Methodol. 2017 Jun 8;17(1):89. doi: 10.1186/s12874-017-0361-1.
9
Improving Accuracy of Administrative Data for Perforated Appendicitis Classification.提高穿孔性阑尾炎分类的行政数据准确性。
J Surg Res. 2024 Jul;299:120-128. doi: 10.1016/j.jss.2024.03.041. Epub 2024 May 14.
10
Validation and optimisation of an ICD-10-coded case definition for sepsis using administrative health data.使用行政健康数据对脓毒症的国际疾病分类第十版(ICD - 10)编码病例定义进行验证和优化。
BMJ Open. 2015 Dec 23;5(12):e009487. doi: 10.1136/bmjopen-2015-009487.

引用本文的文献

1
Analyzing Radiation Protection Risks in Nuclear Medicine: Cause Investigation and WSR-Based Countermeasure Evaluation.分析核医学中的辐射防护风险:原因调查与基于WSR的对策评估
Risk Manag Healthc Policy. 2025 Feb 4;18:329-337. doi: 10.2147/RMHP.S508013. eCollection 2025.
2
Strengths and weaknesses of computerized clinical decision support systems: insights from a digital control center (C3 COVID-19) for early and personalized treatment for COVID-19.计算机化临床决策支持系统的优势与不足:来自一个用于新冠病毒病早期个性化治疗的数字控制中心(C3 COVID-19)的见解
Rev Esp Quimioter. 2025 Feb;38(1):1-7. doi: 10.37201/req/088.2024. Epub 2024 Oct 29.
3

本文引用的文献

1
Exploration of association rule mining for coding consistency and completeness assessment in inpatient administrative health data.探索关联规则挖掘在住院行政健康数据中的编码一致性和完整性评估中的应用。
J Biomed Inform. 2018 Mar;79:41-47. doi: 10.1016/j.jbi.2018.02.001. Epub 2018 Feb 6.
2
Barriers to data quality resulting from the process of coding health information to administrative data: a qualitative study.将健康信息编码为管理数据过程中导致数据质量问题的障碍:一项定性研究
BMC Health Serv Res. 2017 Nov 22;17(1):766. doi: 10.1186/s12913-017-2697-y.
3
Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data.
Comorbidity patterns in cardiovascular diseases: the role of life-stage and socioeconomic status.
心血管疾病中的共病模式:生命阶段和社会经济地位的作用。
Front Cardiovasc Med. 2024 Feb 13;11:1215458. doi: 10.3389/fcvm.2024.1215458. eCollection 2024.
4
Data Quality in Health Research: Integrative Literature Review.卫生研究中的数据质量:综合文献综述。
J Med Internet Res. 2023 Oct 31;25:e41446. doi: 10.2196/41446.
5
CODE-EHR best practice framework for the use of structured electronic healthcare records in clinical research.临床研究中使用结构化电子健康记录的 CODE-EHR 最佳实践框架。
BMJ. 2022 Aug 29;378:e069048. doi: 10.1136/bmj-2021-069048.
6
CODE-EHR best practice framework for the use of structured electronic healthcare records in clinical research.用于临床研究中使用结构化电子健康记录的 CODE-EHR 最佳实践框架。
Eur Heart J. 2022 Oct 7;43(37):3578-3588. doi: 10.1093/eurheartj/ehac426.
7
Understanding challenges of using routinely collected health data to address clinical care gaps: a case study in Alberta, Canada.理解利用常规收集的健康数据来解决临床护理差距的挑战:以加拿大艾伯塔省为例的案例研究。
BMJ Open Qual. 2022 Jan;11(1). doi: 10.1136/bmjoq-2021-001491.
8
Definitions and Prevalence of Multimorbidity in Large Database Studies: A Scoping Review.大型数据库研究中多病种的定义和流行情况:范围综述。
Int J Environ Res Public Health. 2021 Feb 9;18(4):1673. doi: 10.3390/ijerph18041673.
基于 MapReduce 的频繁模式挖掘的先验版本。
IEEE Trans Cybern. 2018 Oct;48(10):2851-2865. doi: 10.1109/TCYB.2017.2751081. Epub 2017 Sep 27.
4
Coder perspectives on physician-related barriers to producing high-quality administrative data: a qualitative study.编码员对医生在生成高质量管理数据方面相关障碍的看法:一项定性研究。
CMAJ Open. 2017 Aug 15;5(3):E617-E622. doi: 10.9778/cmajo.20170036.
5
A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.电子健康记录数据二次使用的统一数据质量评估术语和框架。
EGEMS (Wash DC). 2016 Sep 11;4(1):1244. doi: 10.13063/2327-9214.1244. eCollection 2016.
6
Under-coding of secondary conditions in coded hospital health data: Impact of co-existing conditions, death status and number of codes in a record.编码医院健康数据中次要条件的漏报:共存疾病、死亡状态和记录中编码数量的影响。
Health Informatics J. 2017 Dec;23(4):260-267. doi: 10.1177/1460458216647089. Epub 2016 May 8.
7
ICD-10: History and Context.国际疾病分类第十版:历史与背景。
AJNR Am J Neuroradiol. 2016 Apr;37(4):596-9. doi: 10.3174/ajnr.A4696. Epub 2016 Jan 28.
8
Administrative health data in Canada: lessons from history.加拿大的行政卫生数据:历史教训
BMC Med Inform Decis Mak. 2015 Aug 19;15:69. doi: 10.1186/s12911-015-0196-9.
9
An Introduction to Health Care Administrative Data.医疗保健管理数据简介。
Can J Hosp Pharm. 2015 May-Jun;68(3):232-7. doi: 10.4212/cjhp.v68i3.1457.
10
Defining a reference set to support methodological research in drug safety.定义参考集以支持药物安全性方法学研究。
Drug Saf. 2013 Oct;36 Suppl 1:S33-47. doi: 10.1007/s40264-013-0097-8.