Suppr超能文献

机器学习算法在预测成年人未接受预防牙科护理方面的公平性。

Fairness of Machine Learning Algorithms for Predicting Foregone Preventive Dental Care for Adults.

机构信息

Harvard School of Dental Medicine, Harvard University, Boston, Massachusetts.

School of Public Health, University of Sao Paulo, Sao Paulo, Brazil.

出版信息

JAMA Netw Open. 2023 Nov 1;6(11):e2341625. doi: 10.1001/jamanetworkopen.2023.41625.

Abstract

IMPORTANCE

Access to routine dental care prevents advanced dental disease and improves oral and overall health. Identifying individuals at risk of foregoing preventive dental care can direct prevention efforts toward high-risk populations.

OBJECTIVE

To predict foregone preventive dental care among adults overall and in sociodemographic subgroups and to assess the algorithmic fairness.

DESIGN, SETTING, AND PARTICIPANTS: This prognostic study was a secondary analyses of longitudinal data from the US Medical Expenditure Panel Survey (MEPS) from 2016 to 2019, each with 2 years of follow-up. Participants included adults aged 18 years and older. Data analysis was performed from December 2022 to June 2023.

EXPOSURE

A total of 50 predictors, including demographic and socioeconomic characteristics, health conditions, behaviors, and health services use, were assessed.

MAIN OUTCOMES AND MEASURES

The outcome of interest was foregoing preventive dental care, defined as either cleaning, general examination, or an appointment with the dental hygienist, in the past year.

RESULTS

Among 32 234 participants, the mean (SD) age was 48.5 (18.2) years and 17 386 participants (53.9%) were female; 1935 participants (6.0%) were Asian, 5138 participants (15.9%) were Black, 7681 participants (23.8%) were Hispanic, 16 503 participants (51.2%) were White, and 977 participants (3.0%) identified as other (eg, American Indian and Alaska Native) or multiple racial or ethnic groups. There were 21 083 (65.4%) individuals who missed preventive dental care in the past year. The algorithms demonstrated high performance, achieving an area under the receiver operating characteristic curve (AUC) of 0.84 (95% CI, 0.84-0.85) in the overall population. While the full sample model performed similarly when applied to White individuals and older adults (AUC, 0.88; 95% CI, 0.87-0.90), there was a loss of performance for other subgroups. Removing the subgroup-sensitive predictors (ie, race and ethnicity, age, and income) did not impact model performance. Models stratified by race and ethnicity performed similarly or worse than the full model for all groups, with the lowest performance for individuals who identified as other or multiple racial groups (AUC, 0.76; 95% CI, 0.70-0.81). Previous pattern of dental visits, health care utilization, dental benefits, and sociodemographic characteristics were the highest contributing predictors to the models' performance.

CONCLUSIONS AND RELEVANCE

Findings of this prognostic study using cohort data suggest that tree-based ensemble machine learning models could accurately predict adults at risk of foregoing preventive dental care and demonstrated bias against underrepresented sociodemographic groups. These results highlight the importance of evaluating model fairness during development and testing to avoid exacerbating existing biases.

摘要

重要性

定期进行牙科护理可以预防严重的牙科疾病,提高口腔和整体健康水平。识别可能会放弃预防性牙科护理的个体,可以将预防措施针对高风险人群。

目的

预测总体和特定社会人口亚组中成年人放弃预防性牙科护理的情况,并评估算法的公平性。

设计、设置和参与者:这是一项对来自 2016 年至 2019 年美国医疗支出调查(MEPS)的纵向数据进行的二次分析,每个数据均有 2 年的随访期。参与者包括年龄在 18 岁及以上的成年人。数据分析于 2023 年 6 月从 2022 年 12 月开始进行。

暴露情况

共评估了 50 个预测因素,包括人口统计学和社会经济特征、健康状况、行为和卫生服务使用情况。

主要结果和测量

主要结局是过去一年中放弃预防性牙科护理,定义为清洁、一般检查或与牙科保健员预约。

结果

在 32234 名参与者中,平均(SD)年龄为 48.5(18.2)岁,17386 名参与者(53.9%)为女性;1935 名参与者(6.0%)为亚洲人,5138 名参与者(15.9%)为黑人,7681 名参与者(23.8%)为西班牙裔,16503 名参与者(51.2%)为白人,977 名参与者(3.0%)表示为其他(如美国印第安人和阿拉斯加原住民)或多种种族或族裔群体。过去一年中有 21083 名(65.4%)参与者错过了预防性牙科护理。该算法表现出了较高的性能,在总体人群中的接受者操作特征曲线(AUC)为 0.84(95%CI,0.84-0.85)。虽然全样本模型在应用于白人个体和老年人时表现相似(AUC,0.88;95%CI,0.87-0.90),但对于其他亚组,其性能有所下降。删除与亚组相关的预测因子(即种族和族裔、年龄和收入)不会影响模型性能。按种族和族裔分层的模型在所有组中表现相似或逊于全模型,其他种族或多种族群体的个体表现最差(AUC,0.76;95%CI,0.70-0.81)。之前的牙科就诊模式、卫生保健利用情况、牙科福利和社会人口统计学特征是模型性能的最高贡献预测因素。

结论和相关性

本队列数据预测性研究的结果表明,基于树的集成机器学习模型可以准确预测有放弃预防性牙科护理风险的成年人,并显示出对代表性不足的社会人口亚组的偏见。这些结果强调了在开发和测试过程中评估模型公平性的重要性,以避免加剧现有的偏见。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5120/10625037/bad254a687c7/jamanetwopen-e2341625-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验