基于层次多视图表示的脑活动重建可控人脸。

Reconstructing controllable faces from brain activity with hierarchical multiview representations.

机构信息

School of Electronic Engineering, Xidian University, Xi'an 710071, China.

Group 42 (G42), Abu Dhabi, United Arab Emirates.

出版信息

Neural Netw. 2023 Sep;166:487-500. doi: 10.1016/j.neunet.2023.07.016. Epub 2023 Jul 28.

DOI:10.1016/j.neunet.2023.07.016

PMID:37574622

Abstract

Reconstructing visual experience from brain responses measured by functional magnetic resonance imaging (fMRI) is a challenging yet important research topic in brain decoding, especially it has proved more difficult to decode visually similar stimuli, such as faces. Although face attributes are known as the key to face recognition, most existing methods generally ignore how to decode facial attributes more precisely in perceived face reconstruction, which often leads to indistinguishable reconstructed faces. To solve this problem, we propose a novel neural decoding framework called VSPnet (voxel2style2pixel) by establishing hierarchical encoding and decoding networks with disentangled latent representations as media, so that to recover visual stimuli more elaborately. And we design a hierarchical visual encoder (named HVE) to pre-extract features containing both high-level semantic knowledge and low-level visual details from stimuli. The proposed VSPnet consists of two networks: Multi-branch cognitive encoder and style-based image generator. The encoder network is constructed by multiple linear regression branches to map brain signals to the latent space provided by the pre-extracted visual features and obtain representations containing hierarchical information consistent to the corresponding stimuli. We make the generator network inspired by StyleGAN to untangle the complexity of fMRI representations and generate images. And the HVE network is composed of a standard feature pyramid over a ResNet backbone. Extensive experimental results on the latest public datasets have demonstrated the reconstruction accuracy of our proposed method outperforms the state-of-the-art approaches and the identifiability of different reconstructed faces has been greatly improved. In particular, we achieve feature editing for several facial attributes in fMRI domain based on the multiview (i.e., visual stimuli and evoked fMRI) latent representations.

摘要

从功能磁共振成像 (fMRI) 测量的大脑反应中重建视觉体验是脑解码中的一个具有挑战性但很重要的研究课题，特别是在解码视觉上相似的刺激（如人脸）时，这已经被证明更加困难。尽管面部属性是人脸识别的关键，但大多数现有的方法通常忽略了如何在感知的人脸重建中更精确地解码面部属性，这往往导致重建的人脸无法区分。为了解决这个问题，我们提出了一种名为 VSPnet（voxel2style2pixel）的新型神经解码框架，通过建立具有分离潜在表示的分层编码和解码网络作为媒介，从而更精细地恢复视觉刺激。我们设计了一个分层视觉编码器（命名为 HVE），从刺激中预先提取包含高层语义知识和底层视觉细节的特征。所提出的 VSPnet 由两个网络组成：多分支认知编码器和基于风格的图像生成器。编码器网络由多个线性回归分支构建，将大脑信号映射到由预先提取的视觉特征提供的潜在空间，并获得包含与相应刺激一致的分层信息的表示。我们受 StyleGAN 的启发构建了生成器网络，以解开 fMRI 表示的复杂性并生成图像。并且 HVE 网络由一个基于 ResNet 骨干的标准特征金字塔组成。在最新的公共数据集上进行的广泛实验结果表明，我们提出的方法的重建准确性优于最先进的方法，并且不同重建人脸的可识别性得到了大大提高。特别是，我们基于多视图（即视觉刺激和诱发 fMRI）潜在表示，在 fMRI 域中实现了几个面部属性的特征编辑。

相似文献

Reconstructing controllable faces from brain activity with hierarchical multiview representations.基于层次多视图表示的脑活动重建可控人脸。

Neural Netw. 2023 Sep;166:487-500. doi: 10.1016/j.neunet.2023.07.016. Epub 2023 Jul 28.

Hyperrealistic neural decoding for reconstructing faces from fMRI activations via the GAN latent space.基于 GAN 潜在空间从 fMRI 激活中重建人脸的超真实神经解码。

Sci Rep. 2022 Jan 7;12(1):141. doi: 10.1038/s41598-021-03938-w.

Retrieving and reconstructing conceptually similar images from fMRI with latent diffusion models and a neuro-inspired brain decoding model.使用潜在扩散模型和神经启发式脑解码模型从功能磁共振成像中检索和重建概念上相似的图像。

J Neural Eng. 2024 Jun 28;21(4). doi: 10.1088/1741-2552/ad593c.

Finding Distributed Needles in Neural Haystacks.在神经干草堆中寻找分布式的针。

J Neurosci. 2021 Feb 3;41(5):1019-1032. doi: 10.1523/JNEUROSCI.0904-20.2020. Epub 2020 Dec 17.

Transfer learning of deep neural network representations for fMRI decoding.基于深度神经网络表示的 fMRI 解码的迁移学习。

J Neurosci Methods. 2019 Dec 1;328:108319. doi: 10.1016/j.jneumeth.2019.108319. Epub 2019 Oct 1.

Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning.通过视觉引导的认知表示和对抗学习从大脑活动中重建可见图像。

Neuroimage. 2021 Mar;228:117602. doi: 10.1016/j.neuroimage.2020.117602. Epub 2021 Jan 1.

Reconstructing Perceived and Retrieved Faces from Activity Patterns in Lateral Parietal Cortex.从顶叶外侧皮质的活动模式重建感知和检索到的面孔。

J Neurosci. 2016 Jun 1;36(22):6069-82. doi: 10.1523/JNEUROSCI.4286-15.2016.

Brain2GAN: Feature-disentangled neural encoding and decoding of visual perception in the primate brain.脑到生成对抗网络：灵长类动物大脑中视觉感知的特征解缠神经编码和解码。

PLoS Comput Biol. 2024 May 6;20(5):e1012058. doi: 10.1371/journal.pcbi.1012058. eCollection 2024 May.

Functional Alignment-Auxiliary Generative Adversarial Network-Based Visual Stimuli Reconstruction via Multi-Subject fMRI.基于功能对齐-辅助生成对抗网络的多体 fMRI 视觉刺激重构。

IEEE Trans Neural Syst Rehabil Eng. 2023;31:2715-2725. doi: 10.1109/TNSRE.2023.3283405. Epub 2023 Jun 20.

Reconstructing Perceived Images From Human Brain Activities With Bayesian Deep Multiview Learning.基于贝叶斯深度学习多视图重建人类大脑活动的感知图像。

IEEE Trans Neural Netw Learn Syst. 2019 Aug;30(8):2310-2323. doi: 10.1109/TNNLS.2018.2882456. Epub 2018 Dec 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于层次多视图表示的脑活动重建可控人脸。

Reconstructing controllable faces from brain activity with hierarchical multiview representations.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献