He Qiqi, Yang Qiuju, Xie Minghao
School of Physics and Information Technology, Shaanxi Normal University, Xi'an, China.
School of Physics and Information Technology, Shaanxi Normal University, Xi'an, China.
Comput Biol Med. 2023 Mar;155:106629. doi: 10.1016/j.compbiomed.2023.106629. Epub 2023 Feb 9.
Automatic breast ultrasound image segmentation helps radiologists to improve the accuracy of breast cancer diagnosis. In recent years, the convolutional neural networks (CNNs) have achieved great success in medical image analysis. However, it exhibits limitations in modeling long-range relations, which is unfavorable for ultrasound images with speckle noise and shadows, resulting in decreased accuracy of breast lesion segmentation. Transformer can obtain sufficient global information, but it is deficient in acquiring local details and needs to be pre-trained on large-scale datasets. In this paper, we propose a Hybrid CNN-Transformer network (HCTNet) for boosting the breast lesion segmentation in ultrasound images. In the encoder of HCTNet, Transformer Encoder Blocks (TEBlocks) are designed to learn the global contextual information, which are combined with CNNs to extract features. In the decoder of HCTNet, a Spatial-wise Cross Attention (SCA) module is developed based on the spatial attention mechanism, which reduces the semantic discrepancy with the encoder. Moreover, residual connection is used between decoder blocks to make the generated features more discriminative by aggregating contextual feature maps at different semantic scales. Extensive experiments on three public breast ultrasound datasets demonstrate that HCTNet outperforms other medical image segmentation methods and the recent semantic segmentation methods on breast ultrasound lesion segmentation.
自动乳腺超声图像分割有助于放射科医生提高乳腺癌诊断的准确性。近年来,卷积神经网络(CNN)在医学图像分析中取得了巨大成功。然而,它在对长距离关系进行建模时存在局限性,这对于带有斑点噪声和阴影的超声图像不利,导致乳腺病变分割的准确性下降。Transformer可以获得足够的全局信息,但在获取局部细节方面存在不足,并且需要在大规模数据集上进行预训练。在本文中,我们提出了一种混合CNN-Transformer网络(HCTNet),用于提升超声图像中的乳腺病变分割。在HCTNet的编码器中,Transformer编码器块(TEBlocks)被设计用于学习全局上下文信息,这些信息与CNN相结合以提取特征。在HCTNet的解码器中,基于空间注意力机制开发了一种空间交叉注意力(SCA)模块,该模块减少了与编码器的语义差异。此外,在解码器块之间使用了残差连接,通过聚合不同语义尺度的上下文特征图,使生成的特征更具判别力。在三个公开的乳腺超声数据集上进行的大量实验表明,HCTNet在乳腺超声病变分割方面优于其他医学图像分割方法和最近的语义分割方法。