双引导脑扩散模型：基于人类视觉刺激功能磁共振成像的自然图像重建

Dual-Guided Brain Diffusion Model: Natural Image Reconstruction from Human Visual Stimulus fMRI.

作者信息

Meng Lu, Yang Chuanhao

机构信息

College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.

出版信息

Bioengineering (Basel). 2023 Sep 24;10(10):1117. doi: 10.3390/bioengineering10101117.

DOI:10.3390/bioengineering10101117

PMID:37892847

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10604156/

Abstract

The reconstruction of visual stimuli from fMRI signals, which record brain activity, is a challenging task with crucial research value in the fields of neuroscience and machine learning. Previous studies tend to emphasize reconstructing pixel-level features (contours, colors, etc.) or semantic features (object category) of the stimulus image, but typically, these properties are not reconstructed together. In this context, we introduce a novel three-stage visual reconstruction approach called the Dual-guided Brain Diffusion Model (DBDM). Initially, we employ the Very Deep Variational Autoencoder (VDVAE) to reconstruct a coarse image from fMRI data, capturing the underlying details of the original image. Subsequently, the Bootstrapping Language-Image Pre-training (BLIP) model is utilized to provide a semantic annotation for each image. Finally, the image-to-image generation pipeline of the Versatile Diffusion (VD) model is utilized to recover natural images from the fMRI patterns guided by both visual and semantic information. The experimental results demonstrate that DBDM surpasses previous approaches in both qualitative and quantitative comparisons. In particular, the best performance is achieved by DBDM in reconstructing the semantic details of the original image; the Inception, CLIP and SwAV distances are 0.611, 0.225 and 0.405, respectively. This confirms the efficacy of our model and its potential to advance visual decoding research.

摘要

从记录大脑活动的功能磁共振成像（fMRI）信号中重建视觉刺激，是神经科学和机器学习领域一项具有关键研究价值的挑战性任务。以往的研究往往侧重于重建刺激图像的像素级特征（轮廓、颜色等）或语义特征（物体类别），但通常这些属性不会一起重建。在此背景下，我们引入了一种新颖的三阶段视觉重建方法，称为双引导脑扩散模型（DBDM）。首先，我们使用超深度变分自编码器（VDVAE）从fMRI数据中重建一幅粗糙图像，捕捉原始图像的潜在细节。随后，利用自训练语言-图像预训练（BLIP）模型为每幅图像提供语义注释。最后，利用通用扩散（VD）模型的图像到图像生成管道，在视觉和语义信息的引导下从fMRI模式中恢复自然图像。实验结果表明，在定性和定量比较中，DBDM均优于以往的方法。特别是，DBDM在重建原始图像的语义细节方面表现最佳；Inception、CLIP和SwAV距离分别为0.611、0.225和0.405。这证实了我们模型的有效性及其推进视觉解码研究的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a16/10604156/3e420274842c/bioengineering-10-01117-g001.jpg

相似文献

Dual-Guided Brain Diffusion Model: Natural Image Reconstruction from Human Visual Stimulus fMRI.双引导脑扩散模型：基于人类视觉刺激功能磁共振成像的自然图像重建

Bioengineering (Basel). 2023 Sep 24;10(10):1117. doi: 10.3390/bioengineering10101117.

Retrieving and reconstructing conceptually similar images from fMRI with latent diffusion models and a neuro-inspired brain decoding model.使用潜在扩散模型和神经启发式脑解码模型从功能磁共振成像中检索和重建概念上相似的图像。

J Neural Eng. 2024 Jun 28;21(4). doi: 10.1088/1741-2552/ad593c.

Natural scene reconstruction from fMRI signals using generative latent diffusion.基于生成式潜在扩散模型从 fMRI 信号中重建自然场景

Sci Rep. 2023 Sep 20;13(1):15666. doi: 10.1038/s41598-023-42891-8.

Constraint-Free Natural Image Reconstruction From fMRI Signals Based on Convolutional Neural Network.基于卷积神经网络的功能磁共振成像信号无约束自然图像重建

Front Hum Neurosci. 2018 Jun 22;12:242. doi: 10.3389/fnhum.2018.00242. eCollection 2018.

Reconstruction of natural images from human fMRI using a three-stage multi-level deep fusion model.利用三阶段多层次深度融合模型从 fMRI 数据重建自然图像。

J Neurosci Methods. 2024 Nov;411:110269. doi: 10.1016/j.jneumeth.2024.110269. Epub 2024 Aug 31.

Accurate Reconstruction of Image Stimuli From Human Functional Magnetic Resonance Imaging Based on the Decoding Model With Capsule Network Architecture.基于胶囊网络架构解码模型从人类功能磁共振成像中准确重建图像刺激

Front Neuroinform. 2018 Sep 20;12:62. doi: 10.3389/fninf.2018.00062. eCollection 2018.

Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning.通过视觉引导的认知表示和对抗学习从大脑活动中重建可见图像。

Neuroimage. 2021 Mar;228:117602. doi: 10.1016/j.neuroimage.2020.117602. Epub 2021 Jan 1.

Self-supervised Natural Image Reconstruction and Large-scale Semantic Classification from Brain Activity.基于大脑活动的自监督自然图像重建和大规模语义分类。

Neuroimage. 2022 Jul 1;254:119121. doi: 10.1016/j.neuroimage.2022.119121. Epub 2022 Mar 24.

Natural Image Reconstruction from fMRI Based on Node-Edge Interaction and Multi-Scale Constraint.基于节点-边交互和多尺度约束的功能磁共振成像自然图像重建

Brain Sci. 2024 Feb 28;14(3):234. doi: 10.3390/brainsci14030234.

Perception-to-Image: Reconstructing Natural Images from the Brain Activity of Visual Perception.知觉到图像：从视觉感知的大脑活动中重建自然图像。

Ann Biomed Eng. 2020 Sep;48(9):2323-2332. doi: 10.1007/s10439-020-02502-3. Epub 2020 Apr 13.

引用本文的文献

Natural Image Reconstruction from fMRI Based on Node-Edge Interaction and Multi-Scale Constraint.基于节点-边交互和多尺度约束的功能磁共振成像自然图像重建

Brain Sci. 2024 Feb 28;14(3):234. doi: 10.3390/brainsci14030234.

本文引用的文献

Natural scene reconstruction from fMRI signals using generative latent diffusion.基于生成式潜在扩散模型从 fMRI 信号中重建自然场景

Sci Rep. 2023 Sep 20;13(1):15666. doi: 10.1038/s41598-023-42891-8.

Self-supervised Natural Image Reconstruction and Large-scale Semantic Classification from Brain Activity.基于大脑活动的自监督自然图像重建和大规模语义分类。

Neuroimage. 2022 Jul 1;254:119121. doi: 10.1016/j.neuroimage.2022.119121. Epub 2022 Mar 24.

Hyperrealistic neural decoding for reconstructing faces from fMRI activations via the GAN latent space.基于 GAN 潜在空间从 fMRI 激活中重建人脸的超真实神经解码。

Sci Rep. 2022 Jan 7;12(1):141. doi: 10.1038/s41598-021-03938-w.

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey.使用深度学习从功能磁共振成像进行自然图像重建：一项综述。

Front Neurosci. 2021 Dec 20;15:795488. doi: 10.3389/fnins.2021.795488. eCollection 2021.

Neuroimage. 2021 Mar;228:117602. doi: 10.1016/j.neuroimage.2020.117602. Epub 2021 Jan 1.

Reconstructing faces from fMRI patterns using deep generative neural networks.利用深度生成式神经网络从 fMRI 模式中重建人脸。

Commun Biol. 2019 May 21;2:193. doi: 10.1038/s42003-019-0438-y. eCollection 2019.

Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex.变分自编码器：一种用于对视觉皮层的 fMRI 活动进行编码和解码的无监督模型。

Neuroimage. 2019 Sep;198:125-136. doi: 10.1016/j.neuroimage.2019.05.039. Epub 2019 May 16.

End-to-End Deep Image Reconstruction From Human Brain Activity.基于人类大脑活动的端到端深度图像重建

Front Comput Neurosci. 2019 Apr 12;13:21. doi: 10.3389/fncom.2019.00021. eCollection 2019.

Deep image reconstruction from human brain activity.从人类大脑活动中进行深度图像重建。

PLoS Comput Biol. 2019 Jan 14;15(1):e1006633. doi: 10.1371/journal.pcbi.1006633. eCollection 2019 Jan.

Generative adversarial networks for reconstructing natural images from brain activity.生成对抗网络用于从大脑活动中重建自然图像。

Neuroimage. 2018 Nov 1;181:775-785. doi: 10.1016/j.neuroimage.2018.07.043. Epub 2018 Jul 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

双引导脑扩散模型：基于人类视觉刺激功能磁共振成像的自然图像重建

Dual-Guided Brain Diffusion Model: Natural Image Reconstruction from Human Visual Stimulus fMRI.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献