Shen Shiwen, Han Simon X, Aberle Denise R, Bui Alex A, Hsu William
Department of Bioengineering, University of California, Los Angeles, CA, USA.
Medical & Imaging Informatics Group, Department of Radiological Sciences, University of California, Los Angeles, CA, USA.
Expert Syst Appl. 2019 Aug 15;128:84-95. doi: 10.1016/j.eswa.2019.01.048. Epub 2019 Jan 18.
While deep learning methods have demonstrated performance comparable to human readers in tasks such as computer-aided diagnosis, these models are difficult to interpret, do not incorporate prior domain knowledge, and are often considered as a "black-box." The lack of model interpretability hinders them from being fully understood by end users such as radiologists. In this paper, we present a novel interpretable deep hierarchical semantic convolutional neural network (HSCNN) to predict whether a given pulmonary nodule observed on a computed tomography (CT) scan is malignant. Our network provides two levels of output: 1) low-level semantic features; and 2) a high-level prediction of nodule malignancy. The low-level outputs reflect diagnostic features often reported by radiologists and serve to explain how the model interprets the images in an expert-interpretable manner. The information from these low-level outputs, along with the representations learned by the convolutional layers, are then combined and used to infer the high-level output. This unified architecture is trained by optimizing a global loss function including both low- and high-level tasks, thereby learning all the parameters within a joint framework. Our experimental results using the Lung Image Database Consortium (LIDC) show that the proposed method not only produces interpretable lung cancer predictions but also achieves significantly better results compared to using a 3D CNN alone.
虽然深度学习方法在计算机辅助诊断等任务中已展现出与人类读者相当的性能,但这些模型难以解释,未纳入先验领域知识,且常被视为“黑匣子”。缺乏模型可解释性阻碍了放射科医生等终端用户对它们的全面理解。在本文中,我们提出了一种新颖的可解释深度分层语义卷积神经网络(HSCNN),用于预测在计算机断层扫描(CT)上观察到的给定肺结节是否为恶性。我们的网络提供两级输出:1)低级语义特征;2)结节恶性程度的高级预测。低级输出反映了放射科医生经常报告的诊断特征,并以专家可解释的方式说明模型如何解释图像。然后,将这些低级输出的信息与卷积层学习到的表示相结合,用于推断高级输出。通过优化包括低级和高级任务的全局损失函数来训练这种统一架构,从而在联合框架内学习所有参数。我们使用肺部影像数据库协会(LIDC)的实验结果表明,所提出的方法不仅能生成可解释的肺癌预测结果,而且与单独使用三维卷积神经网络(3D CNN)相比,还取得了显著更好的结果。