School of Computing And Augmented Intelligence Arizona State University, Phoenix, Arziona, United States of America.
Mayo Clinic Arizona Department of Radiology, Phoenix, Arziona, United States of America.
PLoS One. 2024 Nov 22;19(11):e0303415. doi: 10.1371/journal.pone.0303415. eCollection 2024.
Advancement of AI has opened new possibility for accurate diagnosis and prognosis using digital histopathology slides which not only saves hours of expert effort but also makes the estimation more standardized and accurate. However, preserving the AI model performance on the external sites is an extremely challenging problem in the histopathology domain which is primarily due to the difference in data acquisition and/or sampling bias. Although, AI models can also learn spurious correlation, they provide unequal performance across validation population. While it is crucial to detect and remove the bias from the AI model before the clinical application, the cause of the bias is often unknown. We proposed a Causal Survival model that can reduce the effect of unknown bias by leveraging the causal reasoning framework. We use the model to predict recurrence-free survival for the colorectal cancer patients using quantitative histopathology features from seven geographically distributed sites and achieve equalized performance compared to the baseline traditional Cox Proportional Hazards and DeepSurvival model. Through ablation study, we demonstrated benefit of novel addition of latent probability adjustment and auxiliary losses. Although detection of cause of unknown bias is unsolved, we proposed a causal debiasing solution to reduce the bias and improve the AI model generalizibility on the histopathology domain across sites. Open-source codebase for the model training can be accessed from https://github.com/ramon349/fair_survival.git.
人工智能的进步为使用数字组织病理学幻灯片进行准确诊断和预后开辟了新的可能性,这不仅节省了数小时的专家工作,而且使评估更加标准化和准确。然而,在组织病理学领域,在外部站点上保持人工智能模型性能是一个极其具有挑战性的问题,主要是由于数据采集和/或采样偏差的差异。尽管人工智能模型也可以学习虚假相关性,但它们在验证人群中的表现并不均等。虽然在临床应用之前检测和消除人工智能模型中的偏差至关重要,但偏差的原因通常是未知的。我们提出了一种因果生存模型,该模型可以通过利用因果推理框架来减少未知偏差的影响。我们使用该模型使用来自七个地理位置分布的站点的定量组织病理学特征来预测结直肠癌患者的无复发生存率,并与基线传统 Cox 比例风险和 DeepSurvival 模型相比实现了均等性能。通过消融研究,我们证明了潜在概率调整和辅助损失的新添加的益处。尽管未知偏差原因的检测尚未解决,但我们提出了一种因果去偏解决方案,以减少偏差并提高组织病理学领域跨站点的人工智能模型通用性。模型训练的开源代码库可从 https://github.com/ramon349/fair_survival.git 访问。