• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从电子健康记录(EHR)数据中通过机器学习得出的编码比人工分配的国际疾病分类(ICD)编码能更好地预测严重后果。

Machine-Learned Codes from EHR Data Predict Hard Outcomes Better than Human-Assigned ICD Codes.

作者信息

Yin Ying, Shao Yijun, Ma Phillip, Zeng-Treitler Qing, Nelson Stuart J

机构信息

Biomedical Informatics Center, George Washington University, Washington, DC 20052, USA.

Veterans Administration Hospital, Washington, DC 20422, USA.

出版信息

Mach Learn Knowl Extr. 2025 Jun;7(2):36. doi: 10.3390/make7020036. Epub 2025 Apr 17.

DOI:10.3390/make7020036
PMID:40406594
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12093355/
Abstract

We used machine learning (ML) to characterize 894,154 medical records of outpatient visits from the Veterans Administration Central Data Warehouse (VA CDW) by the likelihood of assignment of 200 International Classification of Diseases (ICD) code blocks. Using four different predictive models, we found the ML-derived predictions for the code blocks were consistently more effective in predicting death or 90-day rehospitalization than the assigned code block in the record. We reviewed records of ICD chapter assignments. The review revealed that the ML-predicted chapter assignments were consistently better than those humanly assigned. Impact factor analysis, a method of explanation of AI findings that was developed in our group, demonstrated little effect on any one assigned ICD code block but a marked impact on the ML-derived code blocks of kidney disease as well as several other morbidities. In this study, machine learning was much better than human code assignment at predicting the relatively rare outcomes of death or rehospitalization. Future work will address generalizability using other datasets, as well as addressing coding that is more nuanced than that of the categorization provided by code blocks.

摘要

我们使用机器学习(ML),通过200个国际疾病分类(ICD)代码块的分配可能性,对退伍军人事务部中央数据仓库(VA CDW)的894,154份门诊医疗记录进行特征描述。使用四种不同的预测模型,我们发现,对于代码块,由机器学习得出的预测在预测死亡或90天内再次住院方面,始终比记录中分配的代码块更有效。我们审查了ICD章节分配的记录。审查显示,机器学习预测的章节分配始终优于人工分配的章节分配。影响因素分析是我们团队开发的一种解释人工智能结果的方法,它对任何一个分配的ICD代码块几乎没有影响,但对机器学习得出的肾病代码块以及其他几种疾病有显著影响。在本研究中,在预测相对罕见的死亡或再次住院结果方面,机器学习比人工代码分配要好得多。未来的工作将使用其他数据集解决可推广性问题,以及处理比代码块提供的分类更细微的编码问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/11f2b4d175b2/nihms-2081514-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/e4f02bf2a7a5/nihms-2081514-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/83cc2d332f31/nihms-2081514-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/0c532a69ed0f/nihms-2081514-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/11f2b4d175b2/nihms-2081514-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/e4f02bf2a7a5/nihms-2081514-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/83cc2d332f31/nihms-2081514-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/0c532a69ed0f/nihms-2081514-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379c/12093355/11f2b4d175b2/nihms-2081514-f0004.jpg

相似文献

1
Machine-Learned Codes from EHR Data Predict Hard Outcomes Better than Human-Assigned ICD Codes.从电子健康记录(EHR)数据中通过机器学习得出的编码比人工分配的国际疾病分类(ICD)编码能更好地预测严重后果。
Mach Learn Knowl Extr. 2025 Jun;7(2):36. doi: 10.3390/make7020036. Epub 2025 Apr 17.
2
Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.美国退伍军人事务部电子健康记录中基于人工智能的心力衰竭表型分析方法。
ESC Heart Fail. 2024 Oct;11(5):3155-3166. doi: 10.1002/ehf2.14787. Epub 2024 Jun 14.
3
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study.无监督特征选择以识别冠心病患者队列机器学习中的重要国际疾病分类第十版(ICD - 10)和解剖治疗化学分类系统(ATC)编码:回顾性研究
JMIR Med Inform. 2024 Jul 26;12:e52896. doi: 10.2196/52896.
4
Are ICD codes reliable for observational studies? Assessing coding consistency for data quality.国际疾病分类代码用于观察性研究是否可靠?评估数据质量的编码一致性。
Digit Health. 2024 Oct 29;10:20552076241297056. doi: 10.1177/20552076241297056. eCollection 2024 Jan-Dec.
5
A keyphrase-based approach for interpretable ICD-10 code classification of Spanish medical reports.基于关键词的方法用于对西班牙语医疗报告进行可解释的 ICD-10 编码分类。
Artif Intell Med. 2021 Nov;121:102177. doi: 10.1016/j.artmed.2021.102177. Epub 2021 Sep 22.
6
Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record.基于规则和机器学习算法可在电子健康记录中准确识别系统性硬化症患者。
Arthritis Res Ther. 2019 Dec 30;21(1):305. doi: 10.1186/s13075-019-2092-7.
7
Predictive Value of International Classification of Diseases Codes for Idiopathic Intracranial Hypertension in a University Health System.国际疾病分类代码对大学健康系统中特发性颅内高压的预测价值。
J Neuroophthalmol. 2021 Dec 1;41(4):e679-e683. doi: 10.1097/WNO.0000000000000992.
8
Predictive Value of International Classification of Diseases Codes for Idiopathic Intracranial Hypertension in a University Health System.国际疾病分类编码对大学健康系统特发性颅内高压的预测价值。
J Neuroophthalmol. 2022 Mar 1;42(1):6-10. doi: 10.1097/WNO.0000000000001563.
9
Determining Distinct Suicide Attempts From Recurrent Electronic Health Record Codes: Classification Study.通过电子健康记录重复代码确定不同的自杀未遂事件:分类研究
JMIR Form Res. 2024 Jan 8;8:e46364. doi: 10.2196/46364.
10
Seeing the unseen: how can we best identify transgender women within the Veterans Affairs healthcare system's electronic medical record?见所未见:我们如何才能在退伍军人事务部医疗保健系统的电子病历中最好地识别跨性别女性?
J Sex Med. 2023 Mar 31;20(4):559-567. doi: 10.1093/jsxmed/qdac033.

本文引用的文献

1
Are ICD codes reliable for observational studies? Assessing coding consistency for data quality.国际疾病分类代码用于观察性研究是否可靠?评估数据质量的编码一致性。
Digit Health. 2024 Oct 29;10:20552076241297056. doi: 10.1177/20552076241297056. eCollection 2024 Jan-Dec.
2
Machine learning approaches for electronic health records phenotyping: a methodical review.基于机器学习的电子健康记录表型分析方法:系统评价
J Am Med Inform Assoc. 2023 Jan 18;30(2):367-381. doi: 10.1093/jamia/ocac216.
3
Shedding Light on the Black Box: Explaining Deep Neural Network Prediction of Clinical Outcomes.
揭开黑箱之谜:解释深度神经网络对临床结果的预测
J Med Syst. 2021 Jan 4;45(1):5. doi: 10.1007/s10916-020-01701-8.
4
Problems and Barriers during the Process of Clinical Coding: a Focus Group Study of Coders' Perceptions.临床编码过程中的问题和障碍:编码员认知的焦点小组研究。
J Med Syst. 2020 Feb 8;44(3):62. doi: 10.1007/s10916-020-1532-x.
5
A qualitative evaluation of clinically coded data quality from health information manager perspectives.从健康信息管理员的角度对临床编码数据质量进行定性评估。
Health Inf Manag. 2020 Jan;49(1):19-27. doi: 10.1177/1833358319855031. Epub 2019 Jul 8.
6
Impact of ICD-10-CM Transition on Mental Health Diagnoses Recording.国际疾病分类第十次修订本临床修正版(ICD-10-CM)转换对心理健康诊断记录的影响。
EGEMS (Wash DC). 2019 Apr 12;7(1):14. doi: 10.5334/egems.281.
7
Using Electronic Health Records To Generate Phenotypes For Research.利用电子健康记录生成用于研究的表型。
Curr Protoc Hum Genet. 2019 Jan;100(1):e80. doi: 10.1002/cphg.80. Epub 2018 Dec 5.
8
ICD-10 Coding Will Challenge Researchers: Caution and Collaboration may Reduce Measurement Error and Improve Comparability Over Time.ICD-10 编码将对研究人员构成挑战:谨慎和协作可能会减少测量误差,并随着时间的推移提高可比性。
Med Care. 2019 Jul;57(7):e42-e46. doi: 10.1097/MLR.0000000000001010.
9
Evaluating Coding Accuracy in General Surgery Residents' Accreditation Council for Graduate Medical Education Procedural Case Logs.评估普通外科住院医师毕业后医学教育认证委员会程序病例日志中的编码准确性。
J Surg Educ. 2016 Nov-Dec;73(6):e59-e63. doi: 10.1016/j.jsurg.2016.07.017.
10
Electronic health records to facilitate clinical research.电子健康记录助力临床研究。
Clin Res Cardiol. 2017 Jan;106(1):1-9. doi: 10.1007/s00392-016-1025-6. Epub 2016 Aug 24.