University of Toronto, Toronto, Ontario, Canada.
Vector Institute, Toronto, Ontario, Canada.
Nat Med. 2021 Dec;27(12):2176-2182. doi: 10.1038/s41591-021-01595-0. Epub 2021 Dec 10.
Artificial intelligence (AI) systems have increasingly achieved expert-level performance in medical imaging applications. However, there is growing concern that such AI systems may reflect and amplify human bias, and reduce the quality of their performance in historically under-served populations such as female patients, Black patients, or patients of low socioeconomic status. Such biases are especially troubling in the context of underdiagnosis, whereby the AI algorithm would inaccurately label an individual with a disease as healthy, potentially delaying access to care. Here, we examine algorithmic underdiagnosis in chest X-ray pathology classification across three large chest X-ray datasets, as well as one multi-source dataset. We find that classifiers produced using state-of-the-art computer vision techniques consistently and selectively underdiagnosed under-served patient populations and that the underdiagnosis rate was higher for intersectional under-served subpopulations, for example, Hispanic female patients. Deployment of AI systems using medical imaging for disease diagnosis with such biases risks exacerbation of existing care biases and can potentially lead to unequal access to medical treatment, thereby raising ethical concerns for the use of these models in the clinic.
人工智能(AI)系统在医学影像应用中已经越来越达到专家级别的性能。然而,人们越来越担心,这种 AI 系统可能会反映和放大人类的偏见,并降低其在历史上服务不足的人群(如女性患者、黑人患者或社会经济地位较低的患者)中的性能质量。在诊断不足的情况下,这种偏见尤其令人担忧,即 AI 算法会错误地将患有疾病的个体标记为健康,从而可能延迟获得治疗的机会。在这里,我们在三个大型胸部 X 射线数据集以及一个多源数据集上检查胸部 X 射线病理学分类中的算法诊断不足问题。我们发现,使用最先进的计算机视觉技术生成的分类器一致且选择性地诊断不足服务不足的患者群体,而且对于交叉服务不足的亚群(例如西班牙裔女性患者),诊断不足的比率更高。使用存在此类偏见的医疗成像 AI 系统进行疾病诊断可能会加剧现有的护理偏见,并可能导致医疗待遇的不平等,从而对这些模型在临床中的使用引发伦理关注。