Wastvedt Solvejg, Snoke Joshua, Agniel Denis, Lai Julie, Elliott Marc N, Martino Steven C
Department of Statistics & Data Science, NORC at the University of Chicago, Chicago, IL 60603, United States.
RAND Corporation, Pittsburgh, PA 15213, United States.
Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae155.
Health care decisions are increasingly informed by clinical decision support algorithms, but these algorithms may perpetuate or increase racial and ethnic disparities in access to and quality of health care. Further complicating the problem, clinical data often have missing or poor quality racial and ethnic information, which can lead to misleading assessments of algorithmic bias. We present novel statistical methods that allow for the use of probabilities of racial/ethnic group membership in assessments of algorithm performance and quantify the statistical bias that results from error in these imputed group probabilities. We propose a sensitivity analysis approach to estimating the statistical bias that allows practitioners to assess disparities in algorithm performance under a range of assumed levels of group probability error. We also prove theoretical bounds on the statistical bias for a set of commonly used fairness metrics and describe real-world scenarios where our theoretical results are likely to apply. We present a case study using imputed race and ethnicity from the modified Bayesian Improved First and Surname Geocoding algorithm for estimation of disparities in a clinical decision support algorithm used to inform osteoporosis treatment. Our novel methods allow policymakers to understand the range of potential disparities under a given algorithm even when race and ethnicity information is missing and to make informed decisions regarding the implementation of machine learning for clinical decision support.
医疗保健决策越来越多地受到临床决策支持算法的影响,但这些算法可能会使医疗保健获取和质量方面的种族和族裔差异长期存在或加剧。使问题更加复杂的是,临床数据往往缺少或质量不佳的种族和族裔信息,这可能导致对算法偏差的误导性评估。我们提出了新颖的统计方法,这些方法允许在评估算法性能时使用种族/族裔群体成员概率,并量化由这些估算的群体概率中的误差导致的统计偏差。我们提出一种敏感性分析方法来估计统计偏差,使从业者能够在一系列假定的群体概率误差水平下评估算法性能的差异。我们还证明了一组常用公平性指标的统计偏差的理论界限,并描述了我们的理论结果可能适用的现实世界场景。我们使用来自改进的贝叶斯改进姓氏地理编码算法的估算种族和族裔进行了一个案例研究,以估计用于指导骨质疏松症治疗的临床决策支持算法中的差异。我们的新方法使政策制定者能够在即使缺少种族和族裔信息的情况下,了解给定算法下潜在差异的范围,并就将机器学习用于临床决策支持的实施做出明智决策。