• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于混合Transformer-CNN和表格Transformer的新型多模态肺栓塞计算机辅助诊断模型。

A novel multimodal computer-aided diagnostic model for pulmonary embolism based on hybrid transformer-CNN and tabular transformer.

作者信息

Zhang Wei, Gu Yu, Ma Hao, Yang Lidong, Zhang Baohua, Wang Jing, Chen Meng, Lu Xiaoqi, Li Jianjun, Liu Xin, Yu Dahua, Zhao Ying, Tang Siyuan, He Qun

机构信息

School of Digital and Intelligent Industry, Inner Mongolia University of Science and Technology, Baotou, 014010, China.

School of Automation and Electrical Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, China.

出版信息

Phys Eng Sci Med. 2025 May 24. doi: 10.1007/s13246-025-01568-4.

DOI:10.1007/s13246-025-01568-4
PMID:40411540
Abstract

Pulmonary embolism (PE) is a life-threatening clinical problem where early diagnosis and prompt treatment are essential to reducing morbidity and mortality. While the combination of CT images and electronic health records (EHR) can help improve computer-aided diagnosis, there are many challenges that need to be addressed. The primary objective of this study is to leverage both 3D CT images and EHR data to improve PE diagnosis. First, for 3D CT images, we propose a network combining Swin Transformers with 3D CNNs, enhanced by a Multi-Scale Feature Fusion (MSFF) module to address fusion challenges between different encoders. Secondly, we introduce a Polarized Self-Attention (PSA) module to enhance the attention mechanism within the 3D CNN. And then, for EHR data, we design the Tabular Transformer for effective feature extraction. Finally, we design and evaluate three multimodal attention fusion modules to integrate CT and EHR features, selecting the most effective one for final fusion. Experimental results on the RadFusion dataset demonstrate that our model significantly outperforms existing state-of-the-art methods, achieving an AUROC of 0.971, an F1 score of 0.926, and an accuracy of 0.920. These results underscore the effectiveness and innovation of our multimodal approach in advancing PE diagnosis.

摘要

肺栓塞(PE)是一个危及生命的临床问题,早期诊断和及时治疗对于降低发病率和死亡率至关重要。虽然CT图像和电子健康记录(EHR)的结合有助于改善计算机辅助诊断,但仍有许多挑战需要解决。本研究的主要目标是利用3D CT图像和EHR数据来改善PE诊断。首先,对于3D CT图像,我们提出了一种将Swin Transformer与3D CNN相结合的网络,并通过多尺度特征融合(MSFF)模块进行增强,以解决不同编码器之间的融合挑战。其次,我们引入了极化自注意力(PSA)模块来增强3D CNN中的注意力机制。然后,对于EHR数据,我们设计了表格Transformer进行有效的特征提取。最后,我们设计并评估了三个多模态注意力融合模块,以整合CT和EHR特征,选择最有效的一个进行最终融合。在RadFusion数据集上的实验结果表明,我们的模型显著优于现有的最先进方法,AUROC为0.971,F1分数为0.926,准确率为0.920。这些结果强调了我们的多模态方法在推进PE诊断方面的有效性和创新性。

相似文献

1
A novel multimodal computer-aided diagnostic model for pulmonary embolism based on hybrid transformer-CNN and tabular transformer.一种基于混合Transformer-CNN和表格Transformer的新型多模态肺栓塞计算机辅助诊断模型。
Phys Eng Sci Med. 2025 May 24. doi: 10.1007/s13246-025-01568-4.
2
Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.寻求肺栓塞计算机辅助诊断的最佳方法。
Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13.
3
Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.使用混合卷积和视觉Transformer网络增强胸部X光片中的肺炎检测
Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959.
4
LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.LumVertCancNet:一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。
Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.
5
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
6
Joint fusion of EHR and ECG data using attention-based CNN and ViT for predicting adverse clinical endpoints in percutaneous coronary intervention patients.使用基于注意力的卷积神经网络(CNN)和视觉Transformer(ViT)对电子健康记录(EHR)和心电图(ECG)数据进行联合融合,以预测经皮冠状动脉介入治疗患者的不良临床终点。
Comput Biol Med. 2025 May;189:109966. doi: 10.1016/j.compbiomed.2025.109966. Epub 2025 Mar 5.
7
Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。
Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.
8
BAF-Net: Bidirectional attention fusion network CNN and transformers for the pepper leaf segmentation.BAF-Net:用于辣椒叶片分割的双向注意力融合网络(结合卷积神经网络和变换器)
Front Plant Sci. 2023 Mar 27;14:1123410. doi: 10.3389/fpls.2023.1123410. eCollection 2023.
9
Multi-task approach based on combined CNN-transformer for efficient segmentation and classification of breast tumors in ultrasound images.基于卷积神经网络(CNN)与变换器(Transformer)相结合的多任务方法用于超声图像中乳腺肿瘤的高效分割与分类
Vis Comput Ind Biomed Art. 2024 Jan 26;7(1):2. doi: 10.1186/s42492-024-00155-w.
10
Swin-GA-RF: genetic algorithm-based Swin Transformer and random forest for enhancing cervical cancer classification.Swin-GA-RF:基于遗传算法的Swin Transformer和随机森林用于增强宫颈癌分类
Front Oncol. 2024 Jul 19;14:1392301. doi: 10.3389/fonc.2024.1392301. eCollection 2024.

本文引用的文献

1
A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images.一种基于深度学习的自动分割与三维可视化技术,用于利用计算机断层扫描图像检测颅内出血。
Diagnostics (Basel). 2023 Jul 31;13(15):2537. doi: 10.3390/diagnostics13152537.
2
A transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics.一种基于变压器的表示学习模型,可统一处理临床诊断的多模态输入。
Nat Biomed Eng. 2023 Jun;7(6):743-755. doi: 10.1038/s41551-023-01045-x. Epub 2023 Jun 12.
3
Deep Learning-Based Algorithm for Automatic Detection of Pulmonary Embolism in Chest CT Angiograms.
基于深度学习的胸部CT血管造影中肺栓塞自动检测算法
Diagnostics (Basel). 2023 Apr 3;13(7):1324. doi: 10.3390/diagnostics13071324.
4
A multitask deep learning approach for pulmonary embolism detection and identification.一种用于肺栓塞检测和识别的多任务深度学习方法。
Sci Rep. 2022 Jul 29;12(1):13087. doi: 10.1038/s41598-022-16976-9.
5
MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer.MATR:基于多尺度自适应变换的多模态医学图像融合。
IEEE Trans Image Process. 2022;31:5134-5149. doi: 10.1109/TIP.2022.3193288. Epub 2022 Aug 2.
6
Physics-constrained deep active learning for spatiotemporal modeling of cardiac electrodynamics.物理约束的深度主动学习在心脏电动力学时空建模中的应用。
Comput Biol Med. 2022 Jul;146:105586. doi: 10.1016/j.compbiomed.2022.105586. Epub 2022 May 10.
7
Medical image augmentation for lesion detection using a texture-constrained multichannel progressive GAN.基于纹理约束多通道渐进式 GAN 的医学图像病灶检测中的图像增强方法。
Comput Biol Med. 2022 Jun;145:105444. doi: 10.1016/j.compbiomed.2022.105444. Epub 2022 Mar 30.
8
Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening.利用心电图信号开发机器学习模型以改善急性肺栓塞筛查
Eur Heart J Digit Health. 2021 Nov 25;3(1):56-66. doi: 10.1093/ehjdh/ztab101. eCollection 2022 Mar.
9
Automated detection of pulmonary embolism from CT-angiograms using deep learning.基于深度学习的 CT 血管造影肺动脉栓塞自动检测。
BMC Med Imaging. 2022 Mar 14;22(1):43. doi: 10.1186/s12880-022-00763-z.
10
A Survey on Vision Transformer.视觉Transformer综述
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.