Department of Diagnostic and Interventional Radiology, University Hospital RWTH Aachen, 52074 Aachen, Germany.
Else Kroener Fresenius Center for Digital Health, Technical University Dresden, 01307 Dresden, Germany.
Cell Rep Med. 2024 Sep 17;5(9):101713. doi: 10.1016/j.xcrm.2024.101713. Epub 2024 Sep 5.
Reliably detecting potentially misleading patterns in automated diagnostic assistance systems, such as those powered by artificial intelligence (AI), is crucial for instilling user trust and ensuring reliability. Current techniques fall short in visualizing such confounding factors. We propose DiffChest, a self-conditioned diffusion model trained on 515,704 chest radiographs from 194,956 patients across the US and Europe. DiffChest provides patient-specific explanations and visualizes confounding factors that might mislead the model. The high inter-reader agreement, with Fleiss' kappa values of 0.8 or higher, validates its capability to identify treatment-related confounders. Confounders are accurately detected with 10%-100% prevalence rates. The pretraining process optimizes the model for relevant imaging information, resulting in excellent diagnostic accuracy for 11 chest conditions, including pleural effusion and heart insufficiency. Our findings highlight the potential of diffusion models in medical image classification, providing insights into confounding factors and enhancing model robustness and reliability.
可靠地检测自动化诊断辅助系统(如人工智能(AI)驱动的系统)中潜在的误导性模式对于建立用户信任和确保可靠性至关重要。当前的技术在可视化这些混杂因素方面存在不足。我们提出了 DiffChest,这是一个基于来自美国和欧洲的 194956 名患者的 515704 张胸部 X 光片进行自我条件化扩散模型训练的模型。DiffChest 提供了针对患者的解释,并可视化了可能误导模型的混杂因素。高的读者间一致性,Fleiss' kappa 值为 0.8 或更高,验证了它识别与治疗相关的混杂因素的能力。混杂因素的检出率为 10%-100%。预训练过程优化了模型对相关成像信息的处理,从而对 11 种胸部疾病(包括胸腔积液和心脏功能不全)的诊断准确性达到了卓越水平。我们的研究结果强调了扩散模型在医学图像分类中的潜力,为混杂因素提供了深入的了解,并增强了模型的稳健性和可靠性。