Suppr超能文献

机器学习算法中的偏差评估和修正:在自然语言处理算法中识别有不健康饮酒行为的住院患者的案例研究。

Bias Assessment and Correction in Machine Learning Algorithms: A Use-Case in a Natural Language Processing Algorithm to Identify Hospitalized Patients with Unhealthy Alcohol Use.

机构信息

Loyola University Chicago Stritch School of Medicine, Maywood, IL.

Loyola University Chicago, Chicago, IL.

出版信息

AMIA Annu Symp Proc. 2022 Feb 21;2021:247-254. eCollection 2021.

Abstract

Unhealthy alcohol use represents a major economic burden and cause of morbidity and mortality in the United States. Implementation of interventions for unhealthy alcohol use depends on the availability and accuracy of screening tools. Our group previously applied methods in natural language processing and machine learning to build a classifier for unhealthy alcohol use. In this study, we sought to evaluate and address bias through the use-case of our classifier. We demonstrated the presence of biased unhealthy alcohol use risk underestimation among Hispanic compared to Non-Hispanic White trauma inpatients, 18- to 44-year-old compared to 45 years and older medical/surgical inpatients, and Non-Hispanic Black compared to Non-Hispanic White medical/surgical inpatients. We further showed that intercept, slope, and concurrent intercept and slope recalibration resulted in minimal or no improvements in bias-indicating metrics within these subgroups. Our results exemplify the importance of integrating bias assessment early into the classifier development pipeline.

摘要

在美国,不健康的饮酒行为是一个主要的经济负担,也是发病率和死亡率的一个主要原因。实施针对不健康饮酒的干预措施取决于筛查工具的可用性和准确性。我们的团队之前应用自然语言处理和机器学习方法来构建一个用于不健康饮酒的分类器。在这项研究中,我们通过使用我们的分类器来评估和解决偏见问题。我们发现,与非西班牙裔白人创伤住院患者相比,西班牙裔患者的不健康饮酒风险被低估;与 45 岁及以上的内科/外科住院患者相比,18 至 44 岁的内科/外科住院患者的不健康饮酒风险被低估;与非西班牙裔白人内科/外科住院患者相比,非西班牙裔黑人患者的不健康饮酒风险被低估。我们还发现,在这些亚组中,截距、斜率以及同时对截距和斜率进行重新校准,对指示偏倚的指标几乎没有或没有任何改善。我们的研究结果说明了在分类器开发过程中尽早纳入偏差评估的重要性。

相似文献

5
Validation of an alcohol misuse classifier in hospitalized patients.住院患者酒精滥用分类器的验证。
Alcohol. 2020 May;84:49-55. doi: 10.1016/j.alcohol.2019.09.008. Epub 2019 Sep 28.
7
Investigation of bias in the automated assessment of school violence.学校暴力自动评估中的偏差研究。
J Biomed Inform. 2024 Sep;157:104709. doi: 10.1016/j.jbi.2024.104709. Epub 2024 Aug 15.

引用本文的文献

6
Medical artificial intelligence ethics: A systematic review of empirical studies.医学人工智能伦理:实证研究的系统综述
Digit Health. 2023 Jul 6;9:20552076231186064. doi: 10.1177/20552076231186064. eCollection 2023 Jan-Dec.

本文引用的文献

3
Racial disparities in automated speech recognition.种族差异与自动化语音识别。
Proc Natl Acad Sci U S A. 2020 Apr 7;117(14):7684-7689. doi: 10.1073/pnas.1915768117. Epub 2020 Mar 23.
8
Validation of an alcohol misuse classifier in hospitalized patients.住院患者酒精滥用分类器的验证。
Alcohol. 2020 May;84:49-55. doi: 10.1016/j.alcohol.2019.09.008. Epub 2019 Sep 28.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验