• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用电子健康记录数据的机器学习算法中的潜在偏差。

Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data.

机构信息

Division of Rheumatology, Department of Medicine, University of California, San Francisco.

Center for Population Health Sciences, Stanford University, Palo Alto, California.

出版信息

JAMA Intern Med. 2018 Nov 1;178(11):1544-1547. doi: 10.1001/jamainternmed.2018.3763.

DOI:10.1001/jamainternmed.2018.3763
PMID:30128552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6347576/
Abstract

A promise of machine learning in health care is the avoidance of biases in diagnosis and treatment; a computer algorithm could objectively synthesize and interpret the data in the medical record. Integration of machine learning with clinical decision support tools, such as computerized alerts or diagnostic support, may offer physicians and others who provide health care targeted and timely information that can improve clinical decisions. Machine learning algorithms, however, may also be subject to biases. The biases include those related to missing data and patients not identified by algorithms, sample size and underestimation, and misclassification and measurement error. There is concern that biases and deficiencies in the data used by machine learning algorithms may contribute to socioeconomic disparities in health care. This Special Communication outlines the potential biases that may be introduced into machine learning-based clinical decision support tools that use electronic health record data and proposes potential solutions to the problems of overreliance on automation, algorithms based on biased data, and algorithms that do not provide information that is clinically meaningful. Existing health care disparities should not be amplified by thoughtless or excessive reliance on machines.

摘要

机器学习在医疗保健中的一个承诺是避免诊断和治疗中的偏见;计算机算法可以客观地综合和解释医疗记录中的数据。将机器学习与临床决策支持工具(如计算机警报或诊断支持)集成,可为医生和其他提供医疗保健的人员提供有针对性和及时的信息,从而改善临床决策。然而,机器学习算法也可能存在偏见。这些偏见包括与数据缺失和算法未识别的患者、样本量和低估、分类错误和测量误差有关的偏见。人们担心机器学习算法使用的数据中的偏差和缺陷可能导致医疗保健中的社会经济差异。本特别通讯概述了可能引入基于机器学习的临床决策支持工具的潜在偏差,这些工具使用电子健康记录数据,并提出了一些潜在的解决方案,以解决过度依赖自动化、基于有偏差数据的算法以及不提供有临床意义的信息的算法等问题。现有的医疗保健差异不应因盲目或过度依赖机器而加剧。

相似文献

1
Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data.利用电子健康记录数据的机器学习算法中的潜在偏差。
JAMA Intern Med. 2018 Nov 1;178(11):1544-1547. doi: 10.1001/jamainternmed.2018.3763.
2
The Sociodemographic Biases in Machine Learning Algorithms: A Biomedical Informatics Perspective.机器学习算法中的社会人口统计学偏差:生物医学信息学视角
Life (Basel). 2024 May 21;14(6):652. doi: 10.3390/life14060652.
3
Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records.自然语言处理和机器学习可实现从电子病历中自动提取和分类患者的吸烟状况。
Ups J Med Sci. 2020 Nov;125(4):316-324. doi: 10.1080/03009734.2020.1792010. Epub 2020 Jul 22.
4
A Racially Unbiased, Machine Learning Approach to Prediction of Mortality: Algorithm Development Study.一种基于机器学习的种族公平死亡率预测方法:算法开发研究。
JMIR Public Health Surveill. 2020 Oct 22;6(4):e22400. doi: 10.2196/22400.
5
Identifying the presence and severity of dementia by applying interpretable machine learning techniques on structured clinical records.通过在结构化临床记录上应用可解释的机器学习技术来识别痴呆的存在和严重程度。
BMC Med Inform Decis Mak. 2022 Oct 17;22(1):271. doi: 10.1186/s12911-022-02004-3.
6
Machine Learning Algorithm Helps Identify Non-Diagnosed Prodromal Alzheimer's Disease Patients in the General Population.机器学习算法有助于在普通人群中识别未被诊断的前驱期阿尔茨海默病患者。
J Prev Alzheimers Dis. 2019;6(3):185-191. doi: 10.14283/jpad.2019.10.
7
Machine learning applied to electronic health record data in home healthcare: A scoping review.机器学习在家庭医疗保健中的电子健康记录数据中的应用:范围综述。
Int J Med Inform. 2023 Feb;170:104978. doi: 10.1016/j.ijmedinf.2022.104978. Epub 2022 Dec 30.
8
Applying machine learning to continuously monitored physiological data.将机器学习应用于连续监测的生理数据。
J Clin Monit Comput. 2019 Oct;33(5):887-893. doi: 10.1007/s10877-018-0219-z. Epub 2018 Nov 11.
9
Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record.基于规则和机器学习算法可在电子健康记录中准确识别系统性硬化症患者。
Arthritis Res Ther. 2019 Dec 30;21(1):305. doi: 10.1186/s13075-019-2092-7.
10
Automation of penicillin adverse drug reaction categorisation and risk stratification with machine learning natural language processing.利用机器学习自然语言处理实现青霉素药物不良反应分类和风险分层的自动化。
Int J Med Inform. 2021 Dec;156:104611. doi: 10.1016/j.ijmedinf.2021.104611. Epub 2021 Oct 5.

引用本文的文献

1
Attitudes, Willingness, and Barriers Among Hospital Pharmacists Toward Artificial Intelligence Integration in Pharmacy Practice: A Cross-Sectional Survey.医院药剂师对药学实践中人工智能整合的态度、意愿和障碍:一项横断面调查。
Cureus. 2025 Aug 13;17(8):e89990. doi: 10.7759/cureus.89990. eCollection 2025 Aug.
2
Machine Learning Models for Predicting Mental Health Crises in Adolescents Using Electronic Health Records: A Systematic Review.使用电子健康记录预测青少年心理健康危机的机器学习模型:一项系统综述
Cureus. 2025 Aug 12;17(8):e89873. doi: 10.7759/cureus.89873. eCollection 2025 Aug.
3
Documentation of social determinants of health for patients with type 2 diabetes in Epic Cosmos.在Epic Cosmos系统中记录2型糖尿病患者的健康社会决定因素。
JAMIA Open. 2025 Sep 4;8(5):ooaf095. doi: 10.1093/jamiaopen/ooaf095. eCollection 2025 Oct.
4
Real World Performance of the Model for End-Stage Liver Disease Score Across Different Races and Ethnicities.不同种族和族裔中终末期肝病模型评分的真实世界表现。
Dig Dis Sci. 2025 Sep 3. doi: 10.1007/s10620-025-09362-8.
5
Misleading Results in Posttraumatic Stress Disorder Predictive Models Using Electronic Health Record Data: Algorithm Validation Study.使用电子健康记录数据的创伤后应激障碍预测模型中的误导性结果:算法验证研究
J Med Internet Res. 2025 Aug 27;27:e63352. doi: 10.2196/63352.
6
Lost in .*VCF Translation. From Data Fragmentation to Precision Genomics: Technical, Ethical, and Interpretive Challenges in the Post-Sequencing Era.迷失在.*VCF 翻译中。从数据碎片化到精准基因组学:测序后时代的技术、伦理和解释挑战。
J Pers Med. 2025 Aug 20;15(8):390. doi: 10.3390/jpm15080390.
7
Label Accuracy in Electronic Health Records and Its Impact on Machine Learning Models for Early Prediction of Gestational Diabetes: 3-Step Retrospective Validation Study.电子健康记录中的标签准确性及其对妊娠期糖尿病早期预测机器学习模型的影响:三步回顾性验证研究
JMIR Med Inform. 2025 Aug 21;13:e72938. doi: 10.2196/72938.
8
Artificial Intelligence-Enabled ECG Screening for LVSD in LBBB: Evaluating Model Development and Transfer Learning Approaches.用于左束支传导阻滞中左心室收缩功能障碍的人工智能心电图筛查:评估模型开发和迁移学习方法
JACC Adv. 2025 Aug 21;4(9):102089. doi: 10.1016/j.jacadv.2025.102089.
9
Establishing a policy statement on the use of artificial intelligence in neurosurgery.制定关于神经外科中人工智能使用的政策声明。
Neurosurg Rev. 2025 Aug 19;48(1):606. doi: 10.1007/s10143-025-03745-1.
10
Ensemble learning to enhance accurate identification of patients with glaucoma using electronic health records.使用电子健康记录的集成学习以提高青光眼患者的准确识别
JAMIA Open. 2025 Aug 10;8(4):ooaf080. doi: 10.1093/jamiaopen/ooaf080. eCollection 2025 Aug.

本文引用的文献

1
Algorithms of Oppression: How Search Engines Reinforce Racism NYU Press, 2018. 256 pp.《压迫的算法:搜索引擎如何强化种族主义》 纽约大学出版社,2018年。256页。
Science. 2021 Oct 29;374(6567):542. doi: 10.1126/science.abm5861. Epub 2021 Oct 28.
2
Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。
NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.
3
Implementing Machine Learning in Health Care - Addressing Ethical Challenges.在医疗保健中实施机器学习——应对伦理挑战。
N Engl J Med. 2018 Mar 15;378(11):981-983. doi: 10.1056/NEJMp1714229.
4
Big Data and Machine Learning in Health Care.医疗保健中的大数据与机器学习
JAMA. 2018 Apr 3;319(13):1317-1318. doi: 10.1001/jama.2017.18391.
5
What This Computer Needs Is a Physician: Humanism and Artificial Intelligence.这台计算机需要的是一位医生:人文主义与人工智能。
JAMA. 2018 Jan 2;319(1):19-20. doi: 10.1001/jama.2017.19198.
6
Unintended Consequences of Machine Learning in Medicine.机器学习在医学领域的意外后果。
JAMA. 2017 Aug 8;318(6):517-518. doi: 10.1001/jama.2017.7797.
7
Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification.尽责分类:数据科学家的歧视感知分类指南。
Big Data. 2017 Jun;5(2):120-134. doi: 10.1089/big.2016.0048.
8
How Socioeconomic Status Affects Patient Perceptions of Health Care: A Qualitative Study.社会经济地位如何影响患者对医疗保健的认知:一项定性研究。
J Prim Care Community Health. 2017 Jul;8(3):169-175. doi: 10.1177/2150131917697439. Epub 2017 Mar 8.
9
Semantics derived automatically from language corpora contain human-like biases.从语言语料库中自动推导出来的语义包含类人偏见。
Science. 2017 Apr 14;356(6334):183-186. doi: 10.1126/science.aal4230.
10
Data On Race, Ethnicity, And Language Largely Incomplete For Managed Care Plan Members.针对管理式医疗计划成员的种族、族裔和语言数据大多不完整。
Health Aff (Millwood). 2017 Mar 1;36(3):548-552. doi: 10.1377/hlthaff.2016.1044.