• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于功能和结构神经成像数据可解释融合的多模态视觉变换器。

A multimodal vision transformer for interpretable fusion of functional and structural neuroimaging data.

作者信息

Bi Yuda, Abrol Anees, Fu Zening, Calhoun Vince D

机构信息

Tri-institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia Tech, Emory, Atlanta, Georgia, USA.

出版信息

Hum Brain Mapp. 2024 Dec 1;45(17):e26783. doi: 10.1002/hbm.26783.

DOI:10.1002/hbm.26783
PMID:39600159
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11599617/
Abstract

Multimodal neuroimaging is an emerging field that leverages multiple sources of information to diagnose specific brain disorders, especially when deep learning-based AI algorithms are applied. The successful combination of different brain imaging modalities using deep learning remains a challenging yet crucial research topic. The integration of structural and functional modalities is particularly important for the diagnosis of various brain disorders, where structural information plays a crucial role in diseases such as Alzheimer's, while functional imaging is more critical for disorders such as schizophrenia. However, the combination of functional and structural imaging modalities can provide a more comprehensive diagnosis. In this work, we present MultiViT, a novel diagnostic deep learning model that utilizes vision transformers and cross-attention mechanisms to effectively fuse information from 3D gray matter maps derived from structural MRI with functional network connectivity matrices obtained from functional MRI using the ICA algorithm. MultiViT achieves an AUC of 0.833, outperforming both our unimodal and multimodal baselines, enabling more accurate classification and diagnosis of schizophrenia. In addition, using vision transformer's unique attentional maps in combination with cross-attentional mechanisms and brain function information, we identify critical brain regions in 3D gray matter space associated with the characteristics of schizophrenia. Our research not only significantly improves the accuracy of AI-based automated imaging diagnostics for schizophrenia, but also pioneers a rational and advanced data fusion approach by replacing complex, high-dimensional fMRI information with functional network connectivity, integrating it with representative structural data from 3D gray matter images, and further providing interpretative biomarker localization in a 3D structural space.

摘要

多模态神经影像学是一个新兴领域,它利用多种信息来源来诊断特定的脑部疾病,特别是在应用基于深度学习的人工智能算法时。使用深度学习成功结合不同的脑成像模态仍然是一个具有挑战性但至关重要的研究课题。结构和功能模态的整合对于各种脑部疾病的诊断尤为重要,其中结构信息在阿尔茨海默病等疾病中起着关键作用,而功能成像对于精神分裂症等疾病更为关键。然而,功能和结构成像模态的结合可以提供更全面的诊断。在这项工作中,我们提出了MultiViT,这是一种新颖的诊断深度学习模型,它利用视觉变换器和交叉注意力机制,有效地将来自结构MRI的3D灰质图信息与使用ICA算法从功能MRI获得的功能网络连接矩阵信息融合在一起。MultiViT的AUC达到0.833,优于我们的单模态和多模态基线,能够更准确地对精神分裂症进行分类和诊断。此外,通过将视觉变换器独特的注意力图与交叉注意力机制和脑功能信息相结合,我们在3D灰质空间中识别出与精神分裂症特征相关的关键脑区。我们的研究不仅显著提高了基于人工智能的精神分裂症自动成像诊断的准确性,而且开创了一种合理且先进的数据融合方法,即通过用功能网络连接取代复杂的高维功能磁共振成像信息,并将其与来自3D灰质图像的代表性结构数据整合,进而在3D结构空间中提供可解释的生物标志物定位。

相似文献

1
A multimodal vision transformer for interpretable fusion of functional and structural neuroimaging data.一种用于功能和结构神经成像数据可解释融合的多模态视觉变换器。
Hum Brain Mapp. 2024 Dec 1;45(17):e26783. doi: 10.1002/hbm.26783.
2
Gray matters: ViT-GAN framework for identifying schizophrenia biomarkers linking structural MRI and functional network connectivity.灰质问题:用于识别连接结构磁共振成像和功能网络连通性的精神分裂症生物标志物的ViT-GAN框架。
Neuroimage. 2024 Aug 15;297:120674. doi: 10.1016/j.neuroimage.2024.120674. Epub 2024 Jun 7.
3
Imaging-genomic spatial-modality attentive fusion for studying neuropsychiatric disorders.影像-基因组空间模态注意力融合用于研究神经精神障碍
Hum Brain Mapp. 2024 Dec 1;45(17):e26799. doi: 10.1002/hbm.26799.
4
Integrating machining learning and multimodal neuroimaging to detect schizophrenia at the level of the individual.将机器学习和多模态神经影像学相结合,以个体水平检测精神分裂症。
Hum Brain Mapp. 2020 Apr 1;41(5):1119-1135. doi: 10.1002/hbm.24863. Epub 2019 Nov 18.
5
Reading the (functional) writing on the (structural) wall: Multimodal fusion of brain structure and function via a deep neural network based translation approach reveals novel impairments in schizophrenia.从(结构)墙上的(功能)文字中读取信息:通过基于深度神经网络的翻译方法对大脑结构和功能进行多模态融合,揭示了精神分裂症的新的损伤。
Neuroimage. 2018 Nov 1;181:734-747. doi: 10.1016/j.neuroimage.2018.07.047. Epub 2018 Jul 25.
6
Parallel group ICA+ICA: Joint estimation of linked functional network variability and structural covariation with application to schizophrenia.平行组独立成分分析(ICA)+ICA:联合估计关联功能网络变异性和结构协变,应用于精神分裂症。
Hum Brain Mapp. 2019 Sep;40(13):3795-3809. doi: 10.1002/hbm.24632. Epub 2019 May 16.
7
Alterations in Gray Matter Structure Linked to Frequency-Specific Cortico-Subcortical Connectivity in Schizophrenia via Multimodal Data Fusion.通过多模态数据融合,灰质结构改变与精神分裂症中特定频率的皮质-皮质下连接性相关。
Neuroinformatics. 2025 Apr 26;23(2):31. doi: 10.1007/s12021-025-09728-3.
8
Three-way parallel group independent component analysis: Fusion of spatial and spatiotemporal magnetic resonance imaging data.三向平行组独立成分分析:空间和时空磁共振成像数据的融合。
Hum Brain Mapp. 2022 Mar;43(4):1280-1294. doi: 10.1002/hbm.25720. Epub 2021 Nov 22.
9
Multimodal fusion of multiple rest fMRI networks and MRI gray matter via parallel multilink joint ICA reveals highly significant function/structure coupling in Alzheimer's disease.多模态融合多个静息态 fMRI 网络和 MRI 灰质通过并行多链路联合 ICA 揭示阿尔茨海默病中具有高度显著的功能/结构耦合。
Hum Brain Mapp. 2023 Oct 15;44(15):5167-5179. doi: 10.1002/hbm.26456. Epub 2023 Aug 22.
10
Multi-scale multimodal deep learning framework for Alzheimer's disease diagnosis.用于阿尔茨海默病诊断的多尺度多模态深度学习框架
Comput Biol Med. 2025 Jan;184:109438. doi: 10.1016/j.compbiomed.2024.109438. Epub 2024 Nov 22.

引用本文的文献

1
TransUNET-DDPM: A transformer-enhanced diffusion model for subject-specific brain network generation and classification.TransUNET-DDPM:一种用于特定个体脑网络生成与分类的基于Transformer增强的扩散模型。
Comput Biol Med. 2025 Aug 28;197(Pt A):110996. doi: 10.1016/j.compbiomed.2025.110996.
2
A Multi-Modal Deep Learning Approach for Predicting Eligibility for Adaptive Radiation Therapy in Nasopharyngeal Carcinoma Patients.一种用于预测鼻咽癌患者适应性放射治疗适宜性的多模态深度学习方法。
Cancers (Basel). 2025 Jul 15;17(14):2350. doi: 10.3390/cancers17142350.
3
A CNN-Transformer Fusion Model for Proactive Detection of Schizophrenia Relapse from EEG Signals.
一种用于从脑电图信号中主动检测精神分裂症复发的卷积神经网络-Transformer融合模型。
Bioengineering (Basel). 2025 Jun 12;12(6):641. doi: 10.3390/bioengineering12060641.
4
AI-powered integration of multimodal imaging in precision medicine for neuropsychiatric disorders.人工智能驱动的多模态成像在神经精神疾病精准医学中的整合
Cell Rep Med. 2025 May 20;6(5):102132. doi: 10.1016/j.xcrm.2025.102132.