IEEE Trans Med Imaging. 2021 Sep;40(9):2439-2451. doi: 10.1109/TMI.2021.3078370. Epub 2021 Aug 31.
In recent years, deep learning has been widely used in breast cancer diagnosis, and many high-performance models have emerged. However, most of the existing deep learning models are mainly based on static breast ultrasound (US) images. In actual diagnostic process, contrast-enhanced ultrasound (CEUS) is a commonly used technique by radiologists. Compared with static breast US images, CEUS videos can provide more detailed blood supply information of tumors, and therefore can help radiologists make a more accurate diagnosis. In this paper, we propose a novel diagnosis model based on CEUS videos. The backbone of the model is a 3D convolutional neural network. More specifically, we notice that radiologists generally follow two specific patterns when browsing CEUS videos. One pattern is that they focus on specific time slots, and the other is that they pay attention to the differences between the CEUS frames and the corresponding US images. To incorporate these two patterns into our deep learning model, we design a domain-knowledge-guided temporal attention module and a channel attention module. We validate our model on our Breast-CEUS dataset composed of 221 cases. The result shows that our model can achieve a sensitivity of 97.2% and an accuracy of 86.3%. In particular, the incorporation of domain knowledge leads to a 3.5% improvement in sensitivity and a 6.0% improvement in specificity. Finally, we also prove the validity of two domain knowledge modules in the 3D convolutional neural network (C3D) and the 3D ResNet (R3D).
近年来,深度学习在乳腺癌诊断中得到了广泛应用,涌现出了许多高性能模型。然而,现有的大多数深度学习模型主要基于静态乳腺超声(US)图像。在实际诊断过程中,超声造影(CEUS)是放射科医生常用的技术。与静态乳腺 US 图像相比,CEUS 视频可以提供更详细的肿瘤血供信息,因此可以帮助放射科医生做出更准确的诊断。在本文中,我们提出了一种基于 CEUS 视频的新型诊断模型。该模型的骨干是一个 3D 卷积神经网络。更具体地说,我们注意到放射科医生在浏览 CEUS 视频时通常遵循两种特定模式。一种模式是他们关注特定的时间槽,另一种模式是他们关注 CEUS 帧与相应的 US 图像之间的差异。为了将这两种模式纳入我们的深度学习模型,我们设计了一个基于领域知识的时间注意力模块和一个通道注意力模块。我们在由 221 例组成的 Breast-CEUS 数据集上验证了我们的模型。结果表明,我们的模型可以达到 97.2%的灵敏度和 86.3%的准确率。特别是,领域知识的引入使灵敏度提高了 3.5%,特异性提高了 6.0%。最后,我们还证明了两个领域知识模块在 3D 卷积神经网络(C3D)和 3D ResNet(R3D)中的有效性。