• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在SegFormer架构中采用改进的混合注意力机制增强超声心动图中的左心室分割。

Enhancing left ventricular segmentation in echocardiography with a modified mixed attention mechanism in SegFormer architecture.

作者信息

Wu Hanqiong, Qu Gangrong, Xiao Zhifeng, Chunyu Fan

机构信息

Internal Medicine, The First Hospital of Jinzhou Medical University, Jinzhou, 121001, China.

Cardiovascular Medicine, Chongqing General Hospital of the Armed Police Force, Chongqing, 400061, China.

出版信息

Heliyon. 2024 Jul 17;10(15):e34845. doi: 10.1016/j.heliyon.2024.e34845. eCollection 2024 Aug 15.

DOI:10.1016/j.heliyon.2024.e34845
PMID:39170227
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11336270/
Abstract

Echocardiography is a key tool for the diagnosis of cardiac diseases, and accurate left ventricular (LV) segmentation in echocardiographic videos is crucial for the assessment of cardiac function. However, since semantic segmentation of video needs to take into account the temporal correlation between frames, this makes the task very challenging. This article introduces an innovative method that incorporates a modified mixed attention mechanism into the SegFormer architecture, enabling it to effectively grasp the temporal correlation present in video data. The proposed method processes each time series by encoding the image input into the encoder to obtain the current time feature map. This map, along with the historical time feature map, is then fed into a time-sensitive mixed attention mechanism type of convolution block attention module (TCBAM). Its output can serve as the historical time feature map for the subsequent sequence, and a combination of the current time feature map and historical time feature map for the current sequence. The processed feature map is then input into the Multilayer Perceptron (MLP) and subsequent networks to generate the final segmented image. Through extensive experiments conducted on two different datasets: Hamad Medical Corporation, Tampere University, and Qatar University (HMC-QU), Cardiac Acquisitions for Multi-structure Ultrasound Segmentation (CAMUS) and Sunnybrook Cardiac Data (SCD), achieving a Dice coefficient of 97.92 % on the SCD dataset and an F1 score of 0.9263 on the CAMUS dataset, outperforming all other models. This research provides a promising solution to the temporal modeling challenge in video semantic segmentation tasks using transformer-based models and points out a promising direction for future research in this field.

摘要

超声心动图是诊断心脏疾病的关键工具,而在超声心动图视频中准确进行左心室(LV)分割对于评估心脏功能至关重要。然而,由于视频的语义分割需要考虑帧之间的时间相关性,这使得该任务极具挑战性。本文介绍了一种创新方法,该方法将改进的混合注意力机制融入SegFormer架构,使其能够有效把握视频数据中存在的时间相关性。所提出的方法通过将图像输入编码到编码器中来处理每个时间序列,以获得当前时间特征图。然后,将此图与历史时间特征图一起输入到一种对时间敏感的混合注意力机制类型的卷积块注意力模块(TCBAM)中。其输出可作为后续序列的历史时间特征图,以及当前序列的当前时间特征图与历史时间特征图的组合。然后将处理后的特征图输入到多层感知器(MLP)和后续网络中,以生成最终的分割图像。通过在两个不同数据集上进行的广泛实验:哈马德医疗公司、坦佩雷大学和卡塔尔大学(HMC - QU)、用于多结构超声分割的心脏采集(CAMUS)以及桑尼布鲁克心脏数据(SCD),在SCD数据集上实现了97.92%的Dice系数,在CAMUS数据集上实现了0.9263的F1分数,优于所有其他模型。本研究为使用基于Transformer的模型解决视频语义分割任务中的时间建模挑战提供了一个有前景的解决方案,并为该领域未来的研究指出了一个有前景的方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/67ca7e27ff87/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/c28adcf21154/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/28ac044cdb16/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/c95a3113c866/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/a700989160be/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/4452f82d753a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/bb60afd52ee1/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/fb08c5a78524/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/1b26328fc501/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/17998983ee57/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/bb6cbdb55295/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/67ca7e27ff87/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/c28adcf21154/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/28ac044cdb16/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/c95a3113c866/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/a700989160be/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/4452f82d753a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/bb60afd52ee1/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/fb08c5a78524/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/1b26328fc501/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/17998983ee57/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/bb6cbdb55295/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2aa/11336270/67ca7e27ff87/gr11.jpg

相似文献

1
Enhancing left ventricular segmentation in echocardiography with a modified mixed attention mechanism in SegFormer architecture.在SegFormer架构中采用改进的混合注意力机制增强超声心动图中的左心室分割。
Heliyon. 2024 Jul 17;10(15):e34845. doi: 10.1016/j.heliyon.2024.e34845. eCollection 2024 Aug 15.
2
Boundary attention with multi-task consistency constraints for semi-supervised 2D echocardiography segmentation.基于多任务一致性约束的边界注意力的半监督二维超声心动图分割。
Comput Biol Med. 2024 Mar;171:108100. doi: 10.1016/j.compbiomed.2024.108100. Epub 2024 Feb 5.
3
Feature-guided attention network for medical image segmentation.基于特征引导的注意力网络的医学图像分割。
Med Phys. 2023 Aug;50(8):4871-4886. doi: 10.1002/mp.16253. Epub 2023 Feb 16.
4
Semi-supervised segmentation of echocardiography videos via noise-resilient spatiotemporal semantic calibration and fusion.基于抗噪时空语义校准和融合的超声心动图视频半监督分割。
Med Image Anal. 2022 May;78:102397. doi: 10.1016/j.media.2022.102397. Epub 2022 Feb 25.
5
Left Ventricle Segmentation in Echocardiography with Transformer.基于Transformer的超声心动图左心室分割
Diagnostics (Basel). 2023 Jul 13;13(14):2365. doi: 10.3390/diagnostics13142365.
6
D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.D-SAT:用于医学图像分割的具有双重注意力的双重语义聚合转换器。
Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.
7
MAEF-Net: Multi-attention efficient feature fusion network for left ventricular segmentation and quantitative analysis in two-dimensional echocardiography.MAEF-Net:用于二维超声心动图中左心室分割和定量分析的多注意有效特征融合网络。
Ultrasonics. 2023 Jan;127:106855. doi: 10.1016/j.ultras.2022.106855. Epub 2022 Oct 1.
8
[An attention-guided network for bilateral ventricular segmentation in pediatric echocardiography].[一种用于儿科超声心动图双侧心室分割的注意力引导网络]
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Oct 25;40(5):928-937. doi: 10.7507/1001-5515.202304038.
9
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
10
[Lung parenchyma segmentation based on double scale parallel attention network].基于双尺度并行注意力网络的肺实质分割
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2022 Aug 25;39(4):721-729. doi: 10.7507/1001-5515.202108005.

本文引用的文献

1
Left Ventricle Segmentation in Echocardiography with Transformer.基于Transformer的超声心动图左心室分割
Diagnostics (Basel). 2023 Jul 13;13(14):2365. doi: 10.3390/diagnostics13142365.
2
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment.用于全参考和无参考视听质量评估的注意力引导神经网络。
IEEE Trans Image Process. 2023;32:1882-1896. doi: 10.1109/TIP.2023.3251695. Epub 2023 Mar 21.
3
MITEA: A dataset for machine learning segmentation of the left ventricle in 3D echocardiography using subject-specific labels from cardiac magnetic resonance imaging.
MITEA:一个用于三维超声心动图中左心室机器学习分割的数据集,使用来自心脏磁共振成像的特定受试者标签。
Front Cardiovasc Med. 2023 Jan 10;9:1016703. doi: 10.3389/fcvm.2022.1016703. eCollection 2022.
4
Improved Multi-Stream Convolutional Block Attention Module for sEMG-Based Gesture Recognition.用于基于表面肌电信号的手势识别的改进型多流卷积块注意力模块
Front Bioeng Biotechnol. 2022 Jun 7;10:909023. doi: 10.3389/fbioe.2022.909023. eCollection 2022.
5
Motion-Driven Visual Tempo Learning for Video-Based Action Recognition.基于运动驱动的视觉节奏学习的视频动作识别。
IEEE Trans Image Process. 2022;31:4104-4116. doi: 10.1109/TIP.2022.3180585. Epub 2022 Jun 20.
6
UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++:一种用于医学图像分割的嵌套U-Net架构。
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.
7
Deep Learning for Segmentation Using an Open Large-Scale Dataset in 2D Echocardiography.基于二维超声心动图大型公开数据集的深度学习分割方法
IEEE Trans Med Imaging. 2019 Sep;38(9):2198-2210. doi: 10.1109/TMI.2019.2900516. Epub 2019 Feb 22.