• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估预测 COVID-19 结果的医疗 AI 模型中未被识别偏见的客观框架。

An objective framework for evaluating unrecognized bias in medical AI models predicting COVID-19 outcomes.

机构信息

Laboratory of Computer Science, Massachusetts General Hospital, Boston, Massachusetts, USA.

Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA.

出版信息

J Am Med Inform Assoc. 2022 Jul 12;29(8):1334-1341. doi: 10.1093/jamia/ocac070.

DOI:10.1093/jamia/ocac070
PMID:35511151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9277645/
Abstract

OBJECTIVE

The increasing translation of artificial intelligence (AI)/machine learning (ML) models into clinical practice brings an increased risk of direct harm from modeling bias; however, bias remains incompletely measured in many medical AI applications. This article aims to provide a framework for objective evaluation of medical AI from multiple aspects, focusing on binary classification models.

MATERIALS AND METHODS

Using data from over 56 000 Mass General Brigham (MGB) patients with confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), we evaluate unrecognized bias in 4 AI models developed during the early months of the pandemic in Boston, Massachusetts that predict risks of hospital admission, ICU admission, mechanical ventilation, and death after a SARS-CoV-2 infection purely based on their pre-infection longitudinal medical records. Models were evaluated both retrospectively and prospectively using model-level metrics of discrimination, accuracy, and reliability, and a novel individual-level metric for error.

RESULTS

We found inconsistent instances of model-level bias in the prediction models. From an individual-level aspect, however, we found most all models performing with slightly higher error rates for older patients.

DISCUSSION

While a model can be biased against certain protected groups (ie, perform worse) in certain tasks, it can be at the same time biased towards another protected group (ie, perform better). As such, current bias evaluation studies may lack a full depiction of the variable effects of a model on its subpopulations.

CONCLUSION

Only a holistic evaluation, a diligent search for unrecognized bias, can provide enough information for an unbiased judgment of AI bias that can invigorate follow-up investigations on identifying the underlying roots of bias and ultimately make a change.

摘要

目的

人工智能(AI)/机器学习(ML)模型在临床实践中的应用越来越多,这带来了模型偏差直接造成伤害的风险增加;然而,在许多医疗 AI 应用中,偏差仍然没有得到完全衡量。本文旨在提供一个从多个方面客观评估医疗 AI 的框架,重点关注二元分类模型。

材料和方法

利用来自马萨诸塞州波士顿超过 56000 名 MGB 确诊严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)患者的数据,我们评估了在疫情早期开发的 4 种 AI 模型中存在的未被识别的偏差,这些模型仅基于感染 SARS-CoV-2 前的纵向医疗记录,预测感染 SARS-CoV-2 后的住院、入住 ICU、机械通气和死亡风险。使用模型水平的区分度、准确性和可靠性指标以及一种新的个体水平的误差指标,对模型进行回顾性和前瞻性评估。

结果

我们发现预测模型中存在不一致的模型水平偏差实例。然而,从个体水平来看,我们发现大多数模型对老年患者的预测误差率略高。

讨论

虽然模型在某些任务中可能对某些受保护群体(即表现更差)存在偏差,但它同时也可能对另一个受保护群体(即表现更好)存在偏差。因此,目前的偏差评估研究可能缺乏对模型对其亚群的可变影响的全面描述。

结论

只有全面的评估,以及对未被识别的偏差的仔细搜索,才能提供足够的信息,对 AI 偏差进行公正的判断,从而激发对识别偏差根源的后续调查,并最终做出改变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/1209fc06d6d9/ocac070f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/e2d7a9077f65/ocac070f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/cb9382346508/ocac070f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/27a496cc779b/ocac070f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/fa8bed9c3641/ocac070f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/1209fc06d6d9/ocac070f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/e2d7a9077f65/ocac070f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/cb9382346508/ocac070f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/27a496cc779b/ocac070f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/fa8bed9c3641/ocac070f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/921f/9277645/1209fc06d6d9/ocac070f5.jpg

相似文献

1
An objective framework for evaluating unrecognized bias in medical AI models predicting COVID-19 outcomes.评估预测 COVID-19 结果的医疗 AI 模型中未被识别偏见的客观框架。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1334-1341. doi: 10.1093/jamia/ocac070.
2
Artificial intelligence for prediction of COVID-19 progression using CT imaging and clinical data.利用 CT 影像和临床数据的人工智能预测 COVID-19 进展。
Eur Radiol. 2022 Jan;32(1):205-212. doi: 10.1007/s00330-021-08049-8. Epub 2021 Jul 5.
3
Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation.利用人工智能预测和分析韩国 COVID-19 严重程度及特征重要性:模型建立与验证。
J Med Internet Res. 2021 Apr 19;23(4):e27060. doi: 10.2196/27060.
4
Patients' Preferences for Artificial Intelligence Applications Versus Clinicians in Disease Diagnosis During the SARS-CoV-2 Pandemic in China: Discrete Choice Experiment.中国 SARS-CoV-2 大流行期间,患者对人工智能应用与临床医生在疾病诊断中的偏好:离散选择实验。
J Med Internet Res. 2021 Feb 23;23(2):e22841. doi: 10.2196/22841.
5
Safety and Efficacy of Imatinib for Hospitalized Adults with COVID-19: A structured summary of a study protocol for a randomised controlled trial.COVID-19 住院成人患者使用伊马替尼的安全性和疗效:一项随机对照试验研究方案的结构化总结。
Trials. 2020 Oct 28;21(1):897. doi: 10.1186/s13063-020-04819-9.
6
Artificial intelligence approach fighting COVID-19 with repurposing drugs.人工智能方法通过药物再利用抗击 COVID-19。
Biomed J. 2020 Aug;43(4):355-362. doi: 10.1016/j.bj.2020.05.001. Epub 2020 May 15.
7
A COVID-19 Pandemic Artificial Intelligence-Based System With Deep Learning Forecasting and Automatic Statistical Data Acquisition: Development and Implementation Study.一种基于人工智能的新冠肺炎大流行深度学习预测与自动统计数据采集系统:开发与实施研究
J Med Internet Res. 2021 May 20;23(5):e27806. doi: 10.2196/27806.
8
"Shortcuts" Causing Bias in Radiology Artificial Intelligence: Causes, Evaluation, and Mitigation.“捷径”导致放射科人工智能产生偏见:原因、评估和缓解。
J Am Coll Radiol. 2023 Sep;20(9):842-851. doi: 10.1016/j.jacr.2023.06.025. Epub 2023 Jul 27.
9
AI-SCoRE (artificial intelligence-SARS CoV2 risk evaluation): a fast, objective and fully automated platform to predict the outcome in COVID-19 patients.AI-SCoRE(人工智能-SARS-CoV2 风险评估):一个快速、客观和完全自动化的平台,用于预测 COVID-19 患者的结局。
Radiol Med. 2022 Sep;127(9):960-972. doi: 10.1007/s11547-022-01518-0. Epub 2022 Aug 29.
10
Artificial intelligence-driven prediction of COVID-19-related hospitalization and death: a systematic review.人工智能驱动的 COVID-19 相关住院和死亡预测:系统评价。
Front Public Health. 2023 Jun 20;11:1183725. doi: 10.3389/fpubh.2023.1183725. eCollection 2023.

引用本文的文献

1
A landscape review with novel criteria to evaluate microbial drivers for cancer: priorities for innovative research targeting excessive cancer mortality in sub-Saharan Africa.一项采用新标准评估癌症微生物驱动因素的全景综述:针对撒哈拉以南非洲地区过高癌症死亡率的创新研究重点
Front Cell Infect Microbiol. 2025 Aug 20;15:1625818. doi: 10.3389/fcimb.2025.1625818. eCollection 2025.
2
Algorithmic bias in public health AI: a silent threat to equity in low-resource settings.公共卫生人工智能中的算法偏见:对资源匮乏地区公平性的潜在威胁。
Front Public Health. 2025 Jul 23;13:1643180. doi: 10.3389/fpubh.2025.1643180. eCollection 2025.
3

本文引用的文献

1
Individualized prediction of COVID-19 adverse outcomes with MLHO.用 MLHO 对 COVID-19 不良结局进行个体化预测。
Sci Rep. 2021 Mar 5;11(1):5322. doi: 10.1038/s41598-021-84781-x.
2
Predicting COVID-19 mortality with electronic medical records.利用电子病历预测新冠肺炎死亡率。
NPJ Digit Med. 2021 Feb 4;4(1):15. doi: 10.1038/s41746-021-00383-x.
3
Calibrating variant-scoring methods for clinical decision making.校准用于临床决策的变异评分方法。
Patient perspective on predictive models in healthcare: translation into practice, ethical implications and limitations?
患者对医疗保健中预测模型的看法:转化为实践、伦理影响及局限性?
BMJ Health Care Inform. 2025 Jan 16;32(1):e101153. doi: 10.1136/bmjhci-2024-101153.
4
Sequestration of imaging studies in MIDRC: stratified sampling to balance demographic characteristics of patients in a multi-institutional data commons.医学影像数据资源中心(MIDRC)中影像研究的隔离:在多机构数据共享库中采用分层抽样以平衡患者的人口统计学特征
J Med Imaging (Bellingham). 2023 Nov;10(6):064501. doi: 10.1117/1.JMI.10.6.064501. Epub 2023 Nov 16.
5
Challenges in and Opportunities for Electronic Health Record-Based Data Analysis and Interpretation.基于电子健康记录的数据分析和解释面临的挑战和机遇。
Gut Liver. 2024 Mar 15;18(2):201-208. doi: 10.5009/gnl230272. Epub 2023 Oct 31.
6
Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study.通过从电子健康记录数据中进行分布式表示学习来表征新冠后长期症状的时间亚表型:一项队列研究
EClinicalMedicine. 2023 Sep 14;64:102210. doi: 10.1016/j.eclinm.2023.102210. eCollection 2023 Oct.
7
A broadly applicable approach to enrich electronic-health-record cohorts by identifying patients with complete data: a multisite evaluation.一种通过识别具有完整数据的患者来丰富电子健康记录队列的广泛适用方法:多站点评估。
J Am Med Inform Assoc. 2023 Nov 17;30(12):1985-1994. doi: 10.1093/jamia/ocad166.
8
Longitudinal assessment of demographic representativeness in the Medical Imaging and Data Resource Center open data commons.医学影像与数据资源中心开放数据共享库中人口统计学代表性的纵向评估。
J Med Imaging (Bellingham). 2023 Nov;10(6):61105. doi: 10.1117/1.JMI.10.6.061105. Epub 2023 Jul 18.
9
Detecting shortcut learning for fair medical AI using shortcut testing.使用捷径测试检测公平医疗 AI 的捷径学习。
Nat Commun. 2023 Jul 18;14(1):4314. doi: 10.1038/s41467-023-39902-7.
10
Medical artificial intelligence ethics: A systematic review of empirical studies.医学人工智能伦理:实证研究的系统综述
Digit Health. 2023 Jul 6;9:20552076231186064. doi: 10.1177/20552076231186064. eCollection 2023 Jan-Dec.
Bioinformatics. 2021 Apr 5;36(24):5709-5711. doi: 10.1093/bioinformatics/btaa943.
4
Addressing bias: artificial intelligence in cardiovascular medicine.应对偏见:心血管医学中的人工智能
Lancet Digit Health. 2020 Dec;2(12):e635-e636. doi: 10.1016/S2589-7500(20)30249-1. Epub 2020 Nov 23.
5
Transitive Sequencing Medical Records for Mining Predictive and Interpretable Temporal Representations.用于挖掘预测性和可解释性时间表征的传递性序列医疗记录
Patterns (N Y). 2020 Jul 10;1(4):100051. doi: 10.1016/j.patter.2020.100051. Epub 2020 Jun 18.
6
Dexamethasone in Hospitalized Patients with Covid-19.地塞米松在 COVID-19 住院患者中的应用。
N Engl J Med. 2021 Feb 25;384(8):693-704. doi: 10.1056/NEJMoa2021436. Epub 2020 Jul 17.
7
Latent bias and the implementation of artificial intelligence in medicine.医学人工智能应用中的潜在偏见
J Am Med Inform Assoc. 2020 Dec 9;27(12):2020-2023. doi: 10.1093/jamia/ocaa094.
8
Remdesivir for the Treatment of Covid-19 - Final Report.瑞德西韦治疗 COVID-19 的疗效 - 最终报告。
N Engl J Med. 2020 Nov 5;383(19):1813-1826. doi: 10.1056/NEJMoa2007764. Epub 2020 Oct 8.
9
Artificial Intelligence in Cardiology: Present and Future.人工智能在心脏病学中的应用:现状与未来。
Mayo Clin Proc. 2020 May;95(5):1015-1039. doi: 10.1016/j.mayocp.2020.01.038.
10
A tutorial on calibration measurements and calibration models for clinical prediction models.临床预测模型的校准测量和校准模型教程。
J Am Med Inform Assoc. 2020 Apr 1;27(4):621-633. doi: 10.1093/jamia/ocz228.