• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估并减少人工智能模型中的亚组差异:对儿童新冠病毒检测结果的分析

Evaluating and Reducing Subgroup Disparity in AI Models: An Analysis of Pediatric COVID-19 Test Outcomes.

作者信息

Libin Alexander, Treitler Jonah T, Vasaitis Tadas, Shao Yijun

机构信息

AIM AHEAD Consortium, Georgetown-Howard Universities Center for Clinical and Translational Science (GHUCCTS), Medstar Research Health Institute, Georgetown University, Washington, D.C., USA.

Thomas Jefferson High School for Science and Technology, Alexandria, Virginia, USA.

出版信息

medRxiv. 2024 Sep 19:2024.09.18.24313889. doi: 10.1101/2024.09.18.24313889.

DOI:10.1101/2024.09.18.24313889
PMID:39371141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11451670/
Abstract

Artificial Intelligence (AI) fairness in healthcare settings has attracted significant attention due to the concerns to propagate existing health disparities. Despite ongoing research, the frequency and extent of subgroup fairness have not been sufficiently studied. In this study, we extracted a nationally representative pediatric dataset (ages 0-17, n=9,935) from the US National Health Interview Survey (NHIS) concerning COVID-19 test outcomes. For subgroup disparity assessment, we trained 50 models using five machine learning algorithms. We assessed the models' area under the curve (AUC) on 12 small (<15% of the total n) subgroups defined using social economic factors versus the on the overall population. Our results show that subgroup disparities were prevalent (50.7%) in the models. Subgroup AUCs were generally lower, with a mean difference of 0.01, ranging from -0.29 to +0.41. Notably, the disparities were not always statistically significant, with four out of 12 subgroups having statistically significant disparities across models. Additionally, we explored the efficacy of synthetic data in mitigating identified disparities. The introduction of synthetic data enhanced subgroup disparity in 57.7% of the models. The mean AUC disparities for models with synthetic data decreased on average by 0.03 via resampling and 0.04 via generative adverbial network methods.

摘要

由于担心加剧现有的健康差距,人工智能(AI)在医疗保健环境中的公平性受到了广泛关注。尽管研究仍在进行,但亚组公平性的频率和程度尚未得到充分研究。在本研究中,我们从美国国家健康访谈调查(NHIS)中提取了一个具有全国代表性的儿科数据集(0至17岁,n = 9,935),该数据集涉及新冠病毒检测结果。为了评估亚组差异,我们使用五种机器学习算法训练了50个模型。我们评估了这些模型在根据社会经济因素定义的12个小亚组(占总数的<15%)以及总体人群上的曲线下面积(AUC)。我们的结果表明,亚组差异在模型中普遍存在(50.7%)。亚组的AUC通常较低,平均差异为0.01,范围从-0.29到+0.41。值得注意的是,这些差异并不总是具有统计学意义,12个亚组中有4个在各个模型中具有统计学显著差异。此外,我们还探讨了合成数据在减轻已识别差异方面的效果。合成数据的引入在57.7%的模型中加剧了亚组差异。通过重采样,带有合成数据的模型的平均AUC差异平均降低了0.03,通过生成对抗网络方法降低了0.04。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44fd/11451670/fa2cea8edca4/nihpp-2024.09.18.24313889v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44fd/11451670/fa2cea8edca4/nihpp-2024.09.18.24313889v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44fd/11451670/fa2cea8edca4/nihpp-2024.09.18.24313889v1-f0001.jpg

相似文献

1
Evaluating and Reducing Subgroup Disparity in AI Models: An Analysis of Pediatric COVID-19 Test Outcomes.评估并减少人工智能模型中的亚组差异:对儿童新冠病毒检测结果的分析
medRxiv. 2024 Sep 19:2024.09.18.24313889. doi: 10.1101/2024.09.18.24313889.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Enhancing the fairness of AI prediction models by Quasi-Pareto improvement among heterogeneous thyroid nodule population.通过异质甲状腺结节人群中的拟 Pareto 改进提高 AI 预测模型的公平性。
Nat Commun. 2024 Mar 4;15(1):1958. doi: 10.1038/s41467-024-44906-y.
4
A hybrid artificial intelligence model leverages multi-centric clinical data to improve fetal heart rate pregnancy prediction across time-lapse systems.一种混合人工智能模型利用多中心临床数据,改善跨时间 lapse 系统的胎儿心率妊娠预测。
Hum Reprod. 2023 Apr 3;38(4):596-608. doi: 10.1093/humrep/dead023.
5
Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis.用于人工智能稳健、隐私保护训练的合成医学图像:在早产儿视网膜病变诊断中的应用
Ophthalmol Sci. 2022 Feb 11;2(2):100126. doi: 10.1016/j.xops.2022.100126. eCollection 2022 Jun.
6
The Problem of Fairness in Synthetic Healthcare Data.合成医疗数据中的公平性问题。
Entropy (Basel). 2021 Sep 4;23(9):1165. doi: 10.3390/e23091165.
7
Unmasking bias in artificial intelligence: a systematic review of bias detection and mitigation strategies in electronic health record-based models.揭开人工智能中的偏见:基于电子健康记录模型的偏见检测和缓解策略的系统评价。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1172-1183. doi: 10.1093/jamia/ocae060.
8
Machine learning models trained on synthetic datasets of multiple sample sizes for the use of predicting blood pressure from clinical data in a national dataset.使用来自国家数据集的临床数据,对多种样本量的合成数据集进行训练的机器学习模型,用于预测血压。
PLoS One. 2023 Mar 16;18(3):e0283094. doi: 10.1371/journal.pone.0283094. eCollection 2023.
9
Evaluating and mitigating bias in machine learning models for cardiovascular disease prediction.评估和减轻心血管疾病预测机器学习模型中的偏差。
J Biomed Inform. 2023 Feb;138:104294. doi: 10.1016/j.jbi.2023.104294. Epub 2023 Jan 24.
10
A scoping review of fair machine learning techniques when using real-world data.使用真实世界数据时公平机器学习技术的范围综述。
J Biomed Inform. 2024 Mar;151:104622. doi: 10.1016/j.jbi.2024.104622. Epub 2024 Mar 6.

本文引用的文献

1
The Impact of Generative Conversational Artificial Intelligence on the Lesbian, Gay, Bisexual, Transgender, and Queer Community: Scoping Review.生成式对话人工智能对女同性恋、男同性恋、双性恋、跨性别和酷儿群体的影响:范围综述。
J Med Internet Res. 2023 Dec 6;25:e52091. doi: 10.2196/52091.
2
Bias in artificial intelligence in vascular surgery.人工智能在血管外科学中的偏见。
Semin Vasc Surg. 2023 Sep;36(3):430-434. doi: 10.1053/j.semvascsurg.2023.07.003. Epub 2023 Aug 1.
3
Protecting patient safety and privacy in the era of artificial intelligence.
保护人工智能时代的患者安全和隐私。
Semin Vasc Surg. 2023 Sep;36(3):426-429. doi: 10.1053/j.semvascsurg.2023.06.002. Epub 2023 Jun 24.
4
ChatGPT-4: An assessment of an upgraded artificial intelligence chatbot in the United States Medical Licensing Examination.ChatGPT-4:美国医师执照考试中人工智能聊天机器人的升级评估。
Med Teach. 2024 Mar;46(3):366-372. doi: 10.1080/0142159X.2023.2249588. Epub 2023 Oct 15.
5
Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments.比较 ChatGPT 和 GPT-4 在 USMLE 软技能评估中的表现。
Sci Rep. 2023 Oct 1;13(1):16492. doi: 10.1038/s41598-023-43436-9.
6
COVID-19 Disease Characterization and Outcomes Comparison in Pediatrics.儿童新型冠状病毒肺炎的疾病特征及结局比较
HCA Healthc J Med. 2023 Aug 29;4(4):283-290. doi: 10.36518/2689-0216.1522. eCollection 2023.
7
Effect of physio-cognitive dual-task training on cognition in pre-ageing and older adults with neurocognitive disorders: A meta-analysis and meta-regression of randomized controlled trial.生理认知双任务训练对患有神经认知障碍的衰老前期和老年人认知的影响:一项随机对照试验的荟萃分析和荟萃回归
Arch Gerontol Geriatr. 2024 Jan;116:105161. doi: 10.1016/j.archger.2023.105161. Epub 2023 Aug 19.
8
Sociodemographic Differences in COVID-19 Pandemic Experiences Among Families in the United States.美国家庭在新冠疫情经历方面的社会人口统计学差异。
JAMA Netw Open. 2023 Aug 1;6(8):e2330495. doi: 10.1001/jamanetworkopen.2023.30495.
9
Rapid Triage of Children with Suspected COVID-19 Using Laboratory-Based Machine-Learning Algorithms.基于实验室的机器学习算法对疑似 COVID-19 患儿的快速分诊。
Viruses. 2023 Jul 8;15(7):1522. doi: 10.3390/v15071522.
10
Disparities in child and adolescent COVID-19 vaccination coverage and parental intent toward vaccinations for their children and adolescents.儿童和青少年 COVID-19 疫苗接种覆盖率的差异,以及父母对其子女接种疫苗的意愿。
Ann Med. 2023 Dec;55(1):2232818. doi: 10.1080/07853890.2023.2232818.