• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于生成式潜在扩散模型从 fMRI 信号中重建自然场景

Natural scene reconstruction from fMRI signals using generative latent diffusion.

机构信息

CerCo, CNRS UMR5549, Toulouse, France.

Universite de Toulouse, Toulouse, France.

出版信息

Sci Rep. 2023 Sep 20;13(1):15666. doi: 10.1038/s41598-023-42891-8.

DOI:10.1038/s41598-023-42891-8
PMID:37731047
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10511448/
Abstract

In neural decoding research, one of the most intriguing topics is the reconstruction of perceived natural images based on fMRI signals. Previous studies have succeeded in re-creating different aspects of the visuals, such as low-level properties (shape, texture, layout) or high-level features (category of objects, descriptive semantics of scenes) but have typically failed to reconstruct these properties together for complex scene images. Generative AI has recently made a leap forward with latent diffusion models capable of generating high-complexity images. Here, we investigate how to take advantage of this innovative technology for brain decoding. We present a two-stage scene reconstruction framework called "Brain-Diffuser". In the first stage, starting from fMRI signals, we reconstruct images that capture low-level properties and overall layout using a VDVAE (Very Deep Variational Autoencoder) model. In the second stage, we use the image-to-image framework of a latent diffusion model (Versatile Diffusion) conditioned on predicted multimodal (text and visual) features, to generate final reconstructed images. On the publicly available Natural Scenes Dataset benchmark, our method outperforms previous models both qualitatively and quantitatively. When applied to synthetic fMRI patterns generated from individual ROI (region-of-interest) masks, our trained model creates compelling "ROI-optimal" scenes consistent with neuroscientific knowledge. Thus, the proposed methodology can have an impact on both applied (e.g. brain-computer interface) and fundamental neuroscience.

摘要

在神经解码研究中,最有趣的话题之一是基于 fMRI 信号重建感知自然图像。以前的研究已经成功地重建了视觉的不同方面,例如低水平属性(形状、纹理、布局)或高水平特征(物体类别、场景描述语义),但通常未能一起重建复杂场景图像的这些属性。生成式人工智能最近通过能够生成高复杂度图像的潜在扩散模型取得了飞跃。在这里,我们研究如何利用这项创新技术进行大脑解码。我们提出了一种称为“Brain-Diffuser”的两阶段场景重建框架。在第一阶段,从 fMRI 信号开始,我们使用 VDVAE(非常深的变分自动编码器)模型重建捕获低水平属性和整体布局的图像。在第二阶段,我们使用条件为预测的多模态(文本和视觉)特征的潜在扩散模型(通用扩散)的图像到图像框架,生成最终的重建图像。在公开可用的自然场景数据集基准上,我们的方法在定性和定量方面都优于以前的模型。当应用于从单个 ROI(感兴趣区域)掩模生成的合成 fMRI 模式时,我们训练的模型创建了引人入胜的“ROI-optimal”场景,与神经科学知识一致。因此,所提出的方法学可以对应用(例如脑机接口)和基础神经科学都产生影响。

相似文献

1
Natural scene reconstruction from fMRI signals using generative latent diffusion.基于生成式潜在扩散模型从 fMRI 信号中重建自然场景
Sci Rep. 2023 Sep 20;13(1):15666. doi: 10.1038/s41598-023-42891-8.
2
Deep Natural Image Reconstruction from Human Brain Activity Based on Conditional Progressively Growing Generative Adversarial Networks.基于条件渐进式生成对抗网络的人类大脑活动的深度自然图像重建。
Neurosci Bull. 2021 Mar;37(3):369-379. doi: 10.1007/s12264-020-00613-4. Epub 2020 Nov 22.
3
Dual-Guided Brain Diffusion Model: Natural Image Reconstruction from Human Visual Stimulus fMRI.双引导脑扩散模型:基于人类视觉刺激功能磁共振成像的自然图像重建
Bioengineering (Basel). 2023 Sep 24;10(10):1117. doi: 10.3390/bioengineering10101117.
4
Retrieving and reconstructing conceptually similar images from fMRI with latent diffusion models and a neuro-inspired brain decoding model.使用潜在扩散模型和神经启发式脑解码模型从功能磁共振成像中检索和重建概念上相似的图像。
J Neural Eng. 2024 Jun 28;21(4). doi: 10.1088/1741-2552/ad593c.
5
Reconstruction of natural visual scenes from neural spikes with deep neural networks.利用深度神经网络从神经尖峰重建自然视觉场景。
Neural Netw. 2020 May;125:19-30. doi: 10.1016/j.neunet.2020.01.033. Epub 2020 Feb 8.
6
Perception-to-Image: Reconstructing Natural Images from the Brain Activity of Visual Perception.知觉到图像:从视觉感知的大脑活动中重建自然图像。
Ann Biomed Eng. 2020 Sep;48(9):2323-2332. doi: 10.1007/s10439-020-02502-3. Epub 2020 Apr 13.
7
Photorealistic Reconstruction of Visual Texture From EEG Signals.基于脑电图信号的视觉纹理逼真重建。
Front Comput Neurosci. 2021 Nov 19;15:754587. doi: 10.3389/fncom.2021.754587. eCollection 2021.
8
BigGAN-based Bayesian Reconstruction of Natural Images from Human Brain Activity.基于BigGAN的人脑活动自然图像贝叶斯重建
Neuroscience. 2020 Sep 15;444:92-105. doi: 10.1016/j.neuroscience.2020.07.040. Epub 2020 Jul 28.
9
Generative adversarial networks for reconstructing natural images from brain activity.生成对抗网络用于从大脑活动中重建自然图像。
Neuroimage. 2018 Nov 1;181:775-785. doi: 10.1016/j.neuroimage.2018.07.043. Epub 2018 Jul 20.
10
Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning.通过视觉引导的认知表示和对抗学习从大脑活动中重建可见图像。
Neuroimage. 2021 Mar;228:117602. doi: 10.1016/j.neuroimage.2020.117602. Epub 2021 Jan 1.

引用本文的文献

1
Evidence for compositionality in fMRI visual representations via Brain Algebra.通过脑代数证明功能磁共振成像视觉表征中的组合性。
Commun Biol. 2025 Aug 22;8(1):1263. doi: 10.1038/s42003-025-08706-4.
2
Through their eyes: Multi-subject brain decoding with simple alignment techniques.透过他们的眼睛:使用简单对齐技术的多主体脑解码
Imaging Neurosci (Camb). 2024 May 8;2. doi: 10.1162/imag_a_00170. eCollection 2024.
3
Emerging Brain-to-Content Technologies from Generative AI and Deep Representation Learning.生成式人工智能和深度表征学习中新兴的脑对内容技术。

本文引用的文献

1
Macaques recognize features in synthetic images derived from ventral stream neurons.猕猴能够识别出腹侧流神经元衍生的合成图像中的特征。
Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2213034120. doi: 10.1073/pnas.2213034120. Epub 2023 Mar 1.
2
Self-supervised Natural Image Reconstruction and Large-scale Semantic Classification from Brain Activity.基于大脑活动的自监督自然图像重建和大规模语义分类。
Neuroimage. 2022 Jul 1;254:119121. doi: 10.1016/j.neuroimage.2022.119121. Epub 2022 Mar 24.
3
Reconstructing rapid natural vision with fMRI-conditional video generative adversarial network.
IEEE Signal Process Mag. 2024 Nov;41(6):94-104. doi: 10.1109/msp.2024.3484629. Epub 2025 Jan 1.
4
Generative language reconstruction from brain recordings.基于脑电记录的生成式语言重建
Commun Biol. 2025 Mar 1;8(1):346. doi: 10.1038/s42003-025-07731-7.
5
Improved image reconstruction from brain activity through automatic image captioning.通过自动图像字幕实现从大脑活动中改进图像重建。
Sci Rep. 2025 Feb 10;15(1):4907. doi: 10.1038/s41598-025-89242-3.
6
Visual image reconstructed without semantics from human brain activity using linear image decoders and nonlinear noise suppression.使用线性图像解码器和非线性噪声抑制技术,从人类大脑活动中重建无语义的视觉图像。
Cogn Neurodyn. 2025 Dec;19(1):20. doi: 10.1007/s11571-024-10184-z. Epub 2025 Jan 9.
7
Unsupervised method for representation transfer from one brain to another.从一个大脑到另一个大脑的表示转移的无监督方法。
Front Neuroinform. 2024 Nov 28;18:1470845. doi: 10.3389/fninf.2024.1470845. eCollection 2024.
8
Forecasting fMRI images from video sequences: linear model analysis.从视频序列预测功能磁共振成像(fMRI)图像:线性模型分析
Health Inf Sci Syst. 2024 Nov 15;12(1):55. doi: 10.1007/s13755-024-00315-5. eCollection 2024 Dec.
9
Efficient Neural Decoding Based on Multimodal Training.基于多模态训练的高效神经解码
Brain Sci. 2024 Sep 28;14(10):988. doi: 10.3390/brainsci14100988.
10
Decoding dynamic visual scenes across the brain hierarchy.在大脑层级中解码动态视觉场景。
PLoS Comput Biol. 2024 Aug 2;20(8):e1012297. doi: 10.1371/journal.pcbi.1012297. eCollection 2024 Aug.
利用 fMRI 条件视频生成对抗网络重建快速自然视觉。
Cereb Cortex. 2022 Oct 8;32(20):4502-4511. doi: 10.1093/cercor/bhab498.
4
Hyperrealistic neural decoding for reconstructing faces from fMRI activations via the GAN latent space.基于 GAN 潜在空间从 fMRI 激活中重建人脸的超真实神经解码。
Sci Rep. 2022 Jan 7;12(1):141. doi: 10.1038/s41598-021-03938-w.
5
NeuroGen: Activation optimized image synthesis for discovery neuroscience.NeuroGen:用于发现神经科学的激活优化图像合成。
Neuroimage. 2022 Feb 15;247:118812. doi: 10.1016/j.neuroimage.2021.118812. Epub 2021 Dec 20.
6
A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence.一个用于连接认知神经科学与人工智能的大规模7T功能磁共振成像数据集。
Nat Neurosci. 2022 Jan;25(1):116-126. doi: 10.1038/s41593-021-00962-x. Epub 2021 Dec 16.
7
Computational models of category-selective brain regions enable high-throughput tests of selectivity.类别选择性脑区的计算模型能够实现高吞吐量的选择性测试。
Nat Commun. 2021 Sep 20;12(1):5540. doi: 10.1038/s41467-021-25409-6.
8
Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning.通过视觉引导的认知表示和对抗学习从大脑活动中重建可见图像。
Neuroimage. 2021 Mar;228:117602. doi: 10.1016/j.neuroimage.2020.117602. Epub 2021 Jan 1.
9
Reconstructing faces from fMRI patterns using deep generative neural networks.利用深度生成式神经网络从 fMRI 模式中重建人脸。
Commun Biol. 2019 May 21;2:193. doi: 10.1038/s42003-019-0438-y. eCollection 2019.
10
Neural population control via deep image synthesis.通过深度图像合成实现神经群体控制。
Science. 2019 May 3;364(6439). doi: 10.1126/science.aav9436.