• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

新兴的算法偏差:公平性漂移作为模型维护与可持续性的新维度

Emerging algorithmic bias: fairness drift as the next dimension of model maintenance and sustainability.

作者信息

Davis Sharon E, Dorn Chad, Park Daniel J, Matheny Michael E

机构信息

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

出版信息

J Am Med Inform Assoc. 2025 May 1;32(5):845-854. doi: 10.1093/jamia/ocaf039.

DOI:10.1093/jamia/ocaf039
PMID:40079820
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12012346/
Abstract

OBJECTIVES

While performance drift of clinical prediction models is well-documented, the potential for algorithmic biases to emerge post-deployment has had limited characterization. A better understanding of how temporal model performance may shift across subpopulations is required to incorporate fairness drift into model maintenance strategies.

MATERIALS AND METHODS

We explore fairness drift in a national population over 11 years, with and without model maintenance aimed at sustaining population-level performance. We trained random forest models predicting 30-day post-surgical readmission, mortality, and pneumonia using 2013 data from US Department of Veterans Affairs facilities. We evaluated performance quarterly from 2014 to 2023 by self-reported race and sex. We estimated discrimination, calibration, and accuracy, and operationalized fairness using metric parity measured as the gap between disadvantaged and advantaged groups.

RESULTS

Our cohort included 1 739 666 surgical cases. We observed fairness drift in both the original and temporally updated models. Model updating had a larger impact on overall performance than fairness gaps. During periods of stable fairness, updating models at the population level increased, decreased, or did not impact fairness gaps. During periods of fairness drift, updating models restored fairness in some cases and exacerbated fairness gaps in others.

DISCUSSION

This exploratory study highlights that algorithmic fairness cannot be assured through one-time assessments during model development. Temporal changes in fairness may take multiple forms and interact with model updating strategies in unanticipated ways.

CONCLUSION

Equitable and sustainable clinical artificial intelligence deployments will require novel methods to monitor algorithmic fairness, detect emerging bias, and adopt model updates that promote fairness.

摘要

目的

虽然临床预测模型的性能漂移已有充分记录,但算法偏差在部署后出现的可能性却鲜有描述。为了将公平性漂移纳入模型维护策略,需要更好地理解时间模型性能在亚群体间可能如何变化。

材料与方法

我们在11年的全国人群中探索公平性漂移,有无旨在维持人群水平性能的模型维护措施。我们使用美国退伍军人事务部设施2013年的数据训练了随机森林模型,以预测术后30天再入院、死亡率和肺炎。从2014年到2023年,我们按自我报告的种族和性别每季度评估一次性能。我们估计了歧视、校准和准确性,并使用以弱势群体和优势群体之间差距衡量的指标均等化来实现公平性。

结果

我们的队列包括1739666例手术病例。我们在原始模型和时间更新模型中均观察到公平性漂移。模型更新对整体性能的影响大于公平性差距。在公平性稳定期间,在人群水平上更新模型会增加、减少或不影响公平性差距。在公平性漂移期间,更新模型在某些情况下恢复了公平性,而在其他情况下加剧了公平性差距。

讨论

这项探索性研究强调,算法公平性不能通过模型开发期间的一次性评估来确保。公平性的时间变化可能有多种形式,并以意想不到的方式与模型更新策略相互作用。

结论

公平和可持续的临床人工智能部署将需要新的方法来监测算法公平性、检测新出现的偏差,并采用促进公平性的模型更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/1bd33d8c948a/ocaf039f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/3b69a971e5a5/ocaf039f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/7af1beb6b89e/ocaf039f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/95c327768c53/ocaf039f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/645de2be5f2e/ocaf039f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/68a0e484f359/ocaf039f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/1bd33d8c948a/ocaf039f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/3b69a971e5a5/ocaf039f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/7af1beb6b89e/ocaf039f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/95c327768c53/ocaf039f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/645de2be5f2e/ocaf039f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/68a0e484f359/ocaf039f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e3c/12012346/1bd33d8c948a/ocaf039f6.jpg

相似文献

1
Emerging algorithmic bias: fairness drift as the next dimension of model maintenance and sustainability.新兴的算法偏差:公平性漂移作为模型维护与可持续性的新维度
J Am Med Inform Assoc. 2025 May 1;32(5):845-854. doi: 10.1093/jamia/ocaf039.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
6
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

引用本文的文献

1
Gaps in Artificial Intelligence Research for Rural Health in the United States: A Scoping Review.美国农村卫生人工智能研究的差距:一项范围综述
medRxiv. 2025 Jun 27:2025.06.26.25330361. doi: 10.1101/2025.06.26.25330361.
2
The ethics of data mining in healthcare: challenges, frameworks, and future directions.医疗保健领域数据挖掘的伦理问题:挑战、框架及未来方向。
BioData Min. 2025 Jul 11;18(1):47. doi: 10.1186/s13040-025-00461-w.

本文引用的文献

1
Sustainable deployment of clinical prediction tools-a 360° approach to model maintenance.临床预测工具的可持续部署——模型维护的全方位方法。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1195-1198. doi: 10.1093/jamia/ocae036.
2
Evaluation of clinical prediction models (part 1): from development to external validation.临床预测模型的评估(第 1 部分):从建立到外部验证。
BMJ. 2024 Jan 8;384:e074819. doi: 10.1136/bmj-2023-074819.
3
American College of Surgeons NSQIP Risk Calculator Accuracy Using a Machine Learning Algorithm Compared with Regression.
与回归法相比,使用机器学习算法的美国外科医师学会NSQIP风险计算器的准确性
J Am Coll Surg. 2023 May 1;236(5):1024-1030. doi: 10.1097/XCS.0000000000000556. Epub 2023 Jan 12.
4
Performance drift in a mortality prediction algorithm among patients with cancer during the SARS-CoV-2 pandemic.在 SARS-CoV-2 大流行期间,癌症患者的死亡率预测算法中的性能漂移。
J Am Med Inform Assoc. 2023 Jan 18;30(2):348-354. doi: 10.1093/jamia/ocac221.
5
Machine Learning Model Drift: Predicting Diagnostic Imaging Follow-Up as a Case Example.机器学习模型漂移:以诊断成像随访为例进行预测。
J Am Coll Radiol. 2022 Oct;19(10):1162-1169. doi: 10.1016/j.jacr.2022.05.030. Epub 2022 Aug 16.
6
Maintaining a National Acute Kidney Injury Risk Prediction Model to Support Local Quality Benchmarking.维持国家急性肾损伤风险预测模型以支持本地质量基准测试。
Circ Cardiovasc Qual Outcomes. 2022 Aug;15(8):e008635. doi: 10.1161/CIRCOUTCOMES.121.008635. Epub 2022 Aug 12.
7
Addressing fairness in artificial intelligence for medical imaging.解决医学影像人工智能中的公平性问题。
Nat Commun. 2022 Aug 6;13(1):4581. doi: 10.1038/s41467-022-32186-3.
8
Quantification of Sepsis Model Alerts in 24 US Hospitals Before and During the COVID-19 Pandemic.24 家美国医院在 COVID-19 大流行前后对脓毒症模型警报的量化。
JAMA Netw Open. 2021 Nov 1;4(11):e2135286. doi: 10.1001/jamanetworkopen.2021.35286.
9
The Clinician and Dataset Shift in Artificial Intelligence.临床医生与人工智能中的数据集偏移
N Engl J Med. 2021 Jul 15;385(3):283-286. doi: 10.1056/NEJMc2104626.
10
Addressing bias in prediction models by improving subpopulation calibration.通过改进子群体校准来解决预测模型中的偏差。
J Am Med Inform Assoc. 2021 Mar 1;28(3):549-558. doi: 10.1093/jamia/ocaa283.