Department of Biostatistics and Bioinformatics, School of Medicine, Duke University, Durham, North Carolina; AI Health, School of Medicine, Duke University, Durham, North Carolina.
Institute for Health Equity Research, Department of Population Health, Icahn School of Medicine at Mount Sinai, New York, New York; Barbara T. Murphy Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York.
Am J Kidney Dis. 2024 Dec;84(6):780-786. doi: 10.1053/j.ajkd.2024.04.008. Epub 2024 Jun 6.
There has been a steady rise in the use of clinical decision support (CDS) tools to guide nephrology as well as general clinical care. Through guidance set by federal agencies and concerns raised by clinical investigators, there has been an equal rise in understanding whether such tools exhibit algorithmic bias leading to unfairness. This has spurred the more fundamental question of whether sensitive variables such as race should be included in CDS tools. In order to properly answer this question, it is necessary to understand how algorithmic bias arises. We break down 3 sources of bias encountered when using electronic health record data to develop CDS tools: (1) use of proxy variables, (2) observability concerns and (3) underlying heterogeneity. We discuss how answering the question of whether to include sensitive variables like race often hinges more on qualitative considerations than on quantitative analysis, dependent on the function that the sensitive variable serves. Based on our experience with our own institution's CDS governance group, we show how health system-based governance committees play a central role in guiding these difficult and important considerations. Ultimately, our goal is to foster a community practice of model development and governance teams that emphasizes consciousness about sensitive variables and prioritizes equity.
临床决策支持 (CDS) 工具在指导肾脏病学和一般临床护理方面的使用稳步增加。通过联邦机构的指导和临床研究人员提出的担忧,人们越来越了解这些工具是否存在导致不公平的算法偏差。这引发了一个更基本的问题,即敏感变量(如种族)是否应包含在 CDS 工具中。为了正确回答这个问题,有必要了解算法偏差是如何产生的。我们分解了在使用电子健康记录数据开发 CDS 工具时遇到的 3 个偏差源:(1)代理变量的使用,(2)可观察性问题和(3)潜在的异质性。我们讨论了回答是否包含敏感变量(如种族)的问题,这通常更多地取决于定性考虑因素,而不是定量分析,这取决于敏感变量所起的作用。基于我们在自己机构的 CDS 治理小组的经验,我们展示了基于健康系统的治理委员会如何在指导这些困难和重要的考虑因素方面发挥核心作用。最终,我们的目标是培养一种强调对敏感变量的意识并优先考虑公平的模型开发和治理团队的社区实践。