基于分层 Transformer 和掩模机制的医学图像目标检测。

Object Detection in Medical Images Based on Hierarchical Transformer and Mask Mechanism.

机构信息

School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha 410082, Hunan, China.

College of Information Engineering, Changsha Medical University, Changsha 410219, Hunan, China.

出版信息

Comput Intell Neurosci. 2022 Aug 4;2022:5863782. doi: 10.1155/2022/5863782. eCollection 2022.

DOI:10.1155/2022/5863782

PMID:35965770

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9371842/

Abstract

The object detection task in the medical field is challenging in terms of classification and regression. Due to its crucial applications in computer-aided diagnosis and computer-aided detection techniques, an increasing number of researchers are transferring the object detection techniques to the medical field. However, in existing work on object detection, researchers do not consider the low resolution of medical images, the high amount of noise, and the small size of the objects to be detected. Based on this, this paper proposes a new algorithmic model called the MS Transformer, where a self-supervised learning approach is used to perform a random mask on the input image to reconstruct the input features, learn a richer feature vector, and filter out excessive noise. To focus the model on the small objects that are being detected, the hierarchical transformer model is introduced in this paper, and a sliding window with a local self-attention mechanism is used to give a higher attention score to the small objects to be detected. Finally, a single-stage object detection framework is used to predict the sequence of sets at the location of the bounding box and the class of objects to be detected. On the DeepLesion and BCDD benchmark dataset, the model proposed in this paper achieves better performance improvement on multiple evaluation metric categories.

摘要

医学领域中的目标检测任务在分类和回归方面具有挑战性。由于其在计算机辅助诊断和计算机辅助检测技术中的关键应用，越来越多的研究人员将目标检测技术应用于医学领域。然而，在现有的目标检测工作中，研究人员没有考虑到医学图像的低分辨率、高噪声和待检测目标的小尺寸等问题。基于此，本文提出了一种名为 MS Transformer 的新算法模型，该模型采用自监督学习方法对输入图像进行随机遮挡，以重建输入特征，学习更丰富的特征向量，并滤除过多的噪声。为了使模型专注于要检测的小目标，本文引入了分层 Transformer 模型，并使用具有局部自注意力机制的滑动窗口为要检测的小目标赋予更高的注意力得分。最后，使用单阶段目标检测框架预测边界框位置处的集合序列和要检测的对象类别。在 DeepLesion 和 BCDD 基准数据集上，本文提出的模型在多个评估指标类别上实现了更好的性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5340/9371842/8253a5715a02/CIN2022-5863782.001.jpg

相似文献

Object Detection in Medical Images Based on Hierarchical Transformer and Mask Mechanism.

Comput Intell Neurosci. 2022 Aug 4;2022:5863782. doi: 10.1155/2022/5863782. eCollection 2022.

Automatic creation of annotations for chest radiographs based on the positional information extracted from radiographic image reports.

Comput Methods Programs Biomed. 2021 Sep;209:106331. doi: 10.1016/j.cmpb.2021.106331. Epub 2021 Aug 4.

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX.

Comput Intell Neurosci. 2023 Mar 9;2023:4228610. doi: 10.1155/2023/4228610. eCollection 2023.

TransEffiDet: Aircraft Detection and Classification in Aerial Images Based on EfficientDet and Transformer.

Comput Intell Neurosci. 2022 Apr 21;2022:2262549. doi: 10.1155/2022/2262549. eCollection 2022.

Focal DETR: Target-Aware Token Design for Transformer-Based Object Detection.

Sensors (Basel). 2022 Nov 10;22(22):8686. doi: 10.3390/s22228686.

A Deep-Learning Model with Task-Specific Bounding Box Regressors and Conditional Back-Propagation for Moving Object Detection in ADAS Applications.

Sensors (Basel). 2020 Sep 15;20(18):5269. doi: 10.3390/s20185269.

FEA-Swin: Foreground Enhancement Attention Swin Transformer Network for Accurate UAV-Based Dense Object Detection.

Sensors (Basel). 2022 Sep 15;22(18):6993. doi: 10.3390/s22186993.

DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection.

Sensors (Basel). 2022 Jun 26;22(13):4833. doi: 10.3390/s22134833.

Ghostformer: A GhostNet-Based Two-Stage Transformer for Small Object Detection.

Sensors (Basel). 2022 Sep 14;22(18):6939. doi: 10.3390/s22186939.

SFOD-Trans: semi-supervised fine-grained object detection framework with transformer module.

Med Biol Eng Comput. 2022 Dec;60(12):3555-3566. doi: 10.1007/s11517-022-02682-1. Epub 2022 Oct 17.

引用本文的文献

Optimizing RetinaNet anchors using differential evolution for improved object detection.

Sci Rep. 2025 Jun 20;15(1):20101. doi: 10.1038/s41598-025-02888-x.

Enhanced nuclear information fusion and visual transformer for pathological breast cancer image classification.

Sci Rep. 2025 Jun 3;15(1):19490. doi: 10.1038/s41598-025-04344-2.

From Binary to Multi-Class Classification: A Two-Step Hybrid CNN-ViT Model for Chest Disease Classification Based on X-Ray Images.

Diagnostics (Basel). 2024 Dec 6;14(23):2754. doi: 10.3390/diagnostics14232754.

Stratum corneum nanotexture feature detection using deep learning and spatial analysis: a noninvasive tool for skin barrier assessment.

Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae095.

Intraoperative detection of parathyroid glands using artificial intelligence: optimizing medical image training with data augmentation methods.

Surg Endosc. 2024 Oct;38(10):5732-5745. doi: 10.1007/s00464-024-11115-z. Epub 2024 Aug 13.

Hybrid Ensemble Deep Learning Model for Advancing Ischemic Brain Stroke Detection and Classification in Clinical Application.

J Imaging. 2024 Jul 2;10(7):160. doi: 10.3390/jimaging10070160.

Multi-branch CNN and grouping cascade attention for medical image classification.

Sci Rep. 2024 Jul 1;14(1):15013. doi: 10.1038/s41598-024-64982-w.

AVD-YOLOv5: a new lightweight network architecture for high-speed aortic valve detection from a new and large echocardiography dataset.

Med Biol Eng Comput. 2024 Aug;62(8):2511-2528. doi: 10.1007/s11517-024-03090-3. Epub 2024 Apr 18.

A survey of the impact of self-supervised pretraining for diagnostic tasks in medical X-ray, CT, MRI, and ultrasound.

BMC Med Imaging. 2024 Apr 6;24(1):79. doi: 10.1186/s12880-024-01253-0.

Recognition of rare antinuclear antibody patterns based on a novel attention-based enhancement framework.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad531.

本文引用的文献

The research of aptamer biosensor technologies for detection of microorganism.

Appl Microbiol Biotechnol. 2020 Dec;104(23):9877-9890. doi: 10.1007/s00253-020-10940-1. Epub 2020 Oct 13.

FUNMarker: Fusion Network-Based Method to Identify Prognostic and Heterogeneous Breast Cancer Biomarkers.

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2483-2491. doi: 10.1109/TCBB.2020.2973148. Epub 2021 Dec 8.

An experimental study on breast lesion detection and classification from ultrasound images using deep learning architectures.

BMC Med Imaging. 2019 Jul 1;19(1):51. doi: 10.1186/s12880-019-0349-x.

Evaluate the Malignancy of Pulmonary Nodules Using the 3-D Deep Leaky Noisy-OR Network.

IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3484-3495. doi: 10.1109/TNNLS.2019.2892409. Epub 2019 Feb 14.

Automatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model.

Med Phys. 2019 Jan;46(1):215-228. doi: 10.1002/mp.13268. Epub 2018 Nov 28.

DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning.

J Med Imaging (Bellingham). 2018 Jul;5(3):036501. doi: 10.1117/1.JMI.5.3.036501. Epub 2018 Jul 20.

Microaneurysm Detection Using Principal Component Analysis and Machine Learning Methods.

IEEE Trans Nanobioscience. 2018 Jul;17(3):191-198. doi: 10.1109/TNB.2018.2840084. Epub 2018 May 24.

Classification of Alzheimer's Disease Based on Eight-Layer Convolutional Neural Network with Leaky Rectified Linear Unit and Max Pooling.

J Med Syst. 2018 Mar 26;42(5):85. doi: 10.1007/s10916-018-0932-7.

On Efficient Feature Ranking Methods for High-Throughput Data Analysis.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1374-84. doi: 10.1109/TCBB.2015.2415790.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于分层 Transformer 和掩模机制的医学图像目标检测。

Object Detection in Medical Images Based on Hierarchical Transformer and Mask Mechanism.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献