Jackson Nicholas J, Yan Chao, Malin Bradley A
Vanderbilt University, Nashville, TN.
Vanderbilt University Medical Center, Nashville, TN.
AMIA Annu Symp Proc. 2025 May 22;2024:551-560. eCollection 2024.
The use of artificial intelligence (AI) in medicine has shown promise to improve the quality of healthcare decisions. However, AI can be biased in a manner that produces unfair predictions for certain demographic subgroups. In MIMIC-CXR, a publicly available dataset of over 300,000 chest X-ray images, diagnostic AI has been shown to have a higher false negative rate for racial minorities. We evaluated the capacity of synthetic data augmentation, oversampling, and demographic-based corrections to enhance the fairness of AI predictions. We show that adjusting unfair predictions for demographic attributes, such as race, is ineffective at improving fairness or predictive performance. However, using oversampling and synthetic data augmentation to modify disease prevalence reduced such disparities by 74.7% and 10.6%, respectively. Moreover, such fairness gains were accomplished without reduction in performance (95% CI AUC: [0.816, 0.820] versus [0.810, 0.819] versus [0.817, 0.821] for baseline, oversampling, and augmentation, respectively).
人工智能(AI)在医学中的应用已显示出有望提高医疗决策的质量。然而,人工智能可能存在偏差,对某些人口亚组产生不公平的预测。在MIMIC-CXR(一个包含超过30万张胸部X光图像的公开可用数据集)中,诊断性人工智能对少数族裔的假阴性率更高。我们评估了合成数据增强、过采样和基于人口统计学的校正方法增强人工智能预测公平性的能力。我们发现,针对种族等人口属性调整不公平预测在提高公平性或预测性能方面是无效的。然而,使用过采样和合成数据增强来改变疾病流行率分别将这种差异降低了74.7%和10.6%。此外,在不降低性能的情况下实现了这种公平性提升(基线、过采样和增强的95%置信区间AUC分别为[0.816, 0.820]、[0.810, 0.819]和[0.817, 0.821])。