一种基于Swin Transformer的超声图像甲状腺结节检测模型。

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images.

作者信息

Tian Ye, Zhu Jingqiang, Zhang Lei, Mou Lichao, Zhu Xiaoxiang, Shi Yilei, Ma Buyun, Zhao Wanjun

机构信息

Department of Ultrasonography, West China Hospital of Sichuan University.

Department of Thyroid Surgery, West China Hospital of Sichuan University.

出版信息

J Vis Exp. 2023 Apr 21(194). doi: 10.3791/64480.

DOI:10.3791/64480

PMID:37154577

Abstract

In recent years, the incidence of thyroid cancer has been increasing. Thyroid nodule detection is critical for both the detection and treatment of thyroid cancer. Convolutional neural networks (CNNs) have achieved good results in thyroid ultrasound image analysis tasks. However, due to the limited valid receptive field of convolutional layers, CNNs fail to capture long-range contextual dependencies, which are important for identifying thyroid nodules in ultrasound images. Transformer networks are effective in capturing long-range contextual information. Inspired by this, we propose a novel thyroid nodule detection method that combines the Swin Transformer backbone and Faster R-CNN. Specifically, an ultrasound image is first projected into a 1D sequence of embeddings, which are then fed into a hierarchical Swin Transformer. The Swin Transformer backbone extracts features at five different scales by utilizing shifted windows for the computation of self-attention. Subsequently, a feature pyramid network (FPN) is used to fuse the features from different scales. Finally, a detection head is used to predict bounding boxes and the corresponding confidence scores. Data collected from 2,680 patients were used to conduct the experiments, and the results showed that this method achieved the best mAP score of 44.8%, outperforming CNN-based baselines. In addition, we gained better sensitivity (90.5%) than the competitors. This indicates that context modeling in this model is effective for thyroid nodule detection.

摘要

近年来，甲状腺癌的发病率一直在上升。甲状腺结节检测对于甲状腺癌的检测和治疗都至关重要。卷积神经网络（CNN）在甲状腺超声图像分析任务中取得了良好的效果。然而，由于卷积层的有效感受野有限，CNN无法捕捉到对超声图像中甲状腺结节识别很重要的长距离上下文依赖关系。Transformer网络在捕捉长距离上下文信息方面很有效。受此启发，我们提出了一种将Swin Transformer主干与Faster R-CNN相结合的新型甲状腺结节检测方法。具体来说，首先将超声图像投影到一维嵌入序列中，然后将其输入到分层的Swin Transformer中。Swin Transformer主干通过利用移位窗口来计算自注意力，在五个不同尺度上提取特征。随后，使用特征金字塔网络（FPN）来融合不同尺度的特征。最后，使用检测头来预测边界框和相应的置信度分数。从2680名患者收集的数据用于进行实验，结果表明该方法实现了44.8%的最佳平均精度均值（mAP）分数，优于基于CNN的基线方法。此外，我们获得了比竞争对手更好的灵敏度（90.5%）。这表明该模型中的上下文建模对于甲状腺结节检测是有效的。

相似文献

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images.一种基于Swin Transformer的超声图像甲状腺结节检测模型。

J Vis Exp. 2023 Apr 21(194). doi: 10.3791/64480.

SRT: Swin-residual transformer for benign and malignant nodules classification in thyroid ultrasound images.SRT：甲状腺超声图像中良恶性结节分类的 Swin-residual 变压器。

Med Eng Phys. 2024 Feb;124:104101. doi: 10.1016/j.medengphy.2024.104101. Epub 2024 Jan 9.

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX.基于 Swin 变形 Transformer-BiPAFPN-YOLOX 的目标检测。

Comput Intell Neurosci. 2023 Mar 9;2023:4228610. doi: 10.1155/2023/4228610. eCollection 2023.

Reliable Thyroid Carcinoma Detection with Real-Time Intelligent Analysis of Ultrasound Images.实时智能超声图像分析可靠检测甲状腺癌。

Ultrasound Med Biol. 2021 Mar;47(3):590-602. doi: 10.1016/j.ultrasmedbio.2020.11.024. Epub 2020 Dec 14.

BPAT-UNet: Boundary preserving assembled transformer UNet for ultrasound thyroid nodule segmentation.BPAT-UNet：用于超声甲状腺结节分割的边界保持组装 Transformer UNet。

Comput Methods Programs Biomed. 2023 Aug;238:107614. doi: 10.1016/j.cmpb.2023.107614. Epub 2023 May 19.

Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery.基于Transformer且带有动态注意力金字塔头的甚高分辨率遥感影像语义分割模型

Entropy (Basel). 2022 Nov 6;24(11):1619. doi: 10.3390/e24111619.

Swin-GA-RF: genetic algorithm-based Swin Transformer and random forest for enhancing cervical cancer classification.Swin-GA-RF：基于遗传算法的Swin Transformer和随机森林用于增强宫颈癌分类

Front Oncol. 2024 Jul 19;14:1392301. doi: 10.3389/fonc.2024.1392301. eCollection 2024.

Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

Detection of Thyroid Nodules with Ultrasound Images Based on Deep Learning.基于深度学习的甲状腺超声图像结节检测。

Curr Med Imaging Rev. 2020;16(2):174-180. doi: 10.2174/1573405615666191023104751.

A pre-trained convolutional neural network based method for thyroid nodule diagnosis.一种基于预训练卷积神经网络的甲状腺结节诊断方法。

Ultrasonics. 2017 Jan;73:221-230. doi: 10.1016/j.ultras.2016.09.011. Epub 2016 Sep 12.

引用本文的文献

Optimizing Thyroid Nodule Management With Artificial Intelligence: Multicenter Retrospective Study on Reducing Unnecessary Fine Needle Aspirations.利用人工智能优化甲状腺结节管理：关于减少不必要细针穿刺的多中心回顾性研究

JMIR Med Inform. 2025 Jul 30;13:e71740. doi: 10.2196/71740.

Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study.机器学习模型评估卵巢肿瘤恶性风险的比较研究。

J Ovarian Res. 2024 Nov 6;17(1):219. doi: 10.1186/s13048-024-01544-8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于Swin Transformer的超声图像甲状腺结节检测模型。

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献