基于深度神经网络和迁移学习的功能磁共振成像脑活动视觉编码模型。

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging.

机构信息

National Digital Switching System Engineering and Technological Research Center, Zhengzhou, 450000 China.

Center for Magnetic Resonance Imaging, Department of Neuroscience, University of Minnesota at Twin Cities, 55108 MN, USA.

出版信息

J Neurosci Methods. 2019 Sep 1;325:108318. doi: 10.1016/j.jneumeth.2019.108318. Epub 2019 Jun 27.

DOI:10.1016/j.jneumeth.2019.108318

PMID:31255596

Abstract

BACKGROUND

Building visual encoding models to accurately predict visual responses is a central challenge for current vision-based brain-machine interface techniques. To achieve high prediction accuracy on neural signals, visual encoding models should include precise visual features and appropriate prediction algorithms. Most existing visual encoding models employ hand-craft visual features (e.g., Gabor wavelets or semantic labels) or data-driven features (e.g., features extracted from deep neural networks (DNN)). They also assume a linear mapping between feature representations to brain activity. However, it remains unknown whether such linear mapping is sufficient for maximizing prediction accuracy.

NEW METHOD

We construct a new visual encoding framework to predict cortical responses in a benchmark functional magnetic resonance imaging (fMRI) dataset. In this framework, we employ the transfer learning technique to incorporate a pre-trained DNN (i.e., AlexNet) and train a nonlinear mapping from visual features to brain activity. This nonlinear mapping replaces the conventional linear mapping and is supposed to improve prediction accuracy on measured activity in the human visual cortex.

RESULTS

The proposed framework can significantly predict responses of over 20% voxels in early visual areas (i.e., V1-lateral occipital region, LO) and achieve unprecedented prediction accuracy.

COMPARISON WITH EXISTING METHODS

Comparing to two conventional visual encoding models, we find that the proposed encoding model shows consistent higher prediction accuracy in all early visual areas, especially in relatively anterior visual areas (i.e., V4 and LO).

CONCLUSIONS

Our work proposes a new framework to utilize pre-trained visual features and train non-linear mappings from visual features to brain activity.

摘要

背景

构建能够准确预测视觉反应的视觉编码模型是当前基于视觉的脑机接口技术的核心挑战。为了在神经信号上实现高预测精度，视觉编码模型应包含精确的视觉特征和适当的预测算法。大多数现有的视觉编码模型采用手工制作的视觉特征（例如，Gabor 小波或语义标签）或数据驱动的特征（例如，从深度神经网络（DNN）中提取的特征）。它们还假设特征表示与大脑活动之间存在线性映射。然而，目前尚不清楚这种线性映射是否足以最大限度地提高预测精度。

新方法

我们构建了一个新的视觉编码框架，以预测基准功能磁共振成像（fMRI）数据集的皮层反应。在这个框架中，我们采用迁移学习技术，将预先训练好的 DNN（即 AlexNet）纳入其中，并训练从视觉特征到大脑活动的非线性映射。这个非线性映射取代了传统的线性映射，有望提高对人类视觉皮层中测量活动的预测精度。

结果

所提出的框架可以显著预测早期视觉区域（即 V1-外侧枕叶区域，LO）超过 20%体素的反应，并实现前所未有的预测精度。

与现有方法的比较

与两个传统的视觉编码模型相比，我们发现所提出的编码模型在所有早期视觉区域中均表现出一致的更高预测精度，尤其是在相对靠前的视觉区域（即 V4 和 LO）。

结论

我们的工作提出了一种新的框架，用于利用预先训练的视觉特征并训练从视觉特征到大脑活动的非线性映射。

相似文献

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging.

J Neurosci Methods. 2019 Sep 1;325:108318. doi: 10.1016/j.jneumeth.2019.108318. Epub 2019 Jun 27.

A Texture Statistics Encoding Model Reveals Hierarchical Feature Selectivity across Human Visual Cortex.

J Neurosci. 2023 May 31;43(22):4144-4161. doi: 10.1523/JNEUROSCI.1822-22.2023. Epub 2023 May 1.

Transfer learning of deep neural network representations for fMRI decoding.

J Neurosci Methods. 2019 Dec 1;328:108319. doi: 10.1016/j.jneumeth.2019.108319. Epub 2019 Oct 1.

Structurally-constrained encoding framework using a multi-voxel reduced-rank latent model for human natural vision.

J Neural Eng. 2024 Jul 26;21(4). doi: 10.1088/1741-2552/ad6184.

A Comparative Analysis of Visual Encoding Models Based on Classification and Segmentation Task-Driven CNNs.

Comput Math Methods Med. 2020 Aug 1;2020:5408942. doi: 10.1155/2020/5408942. eCollection 2020.

The feature-weighted receptive field: an interpretable encoding model for complex feature spaces.

Neuroimage. 2018 Oct 15;180(Pt A):188-202. doi: 10.1016/j.neuroimage.2017.06.035. Epub 2017 Jun 20.

Unveiling functions of the visual cortex using task-specific deep neural networks.

PLoS Comput Biol. 2021 Aug 13;17(8):e1009267. doi: 10.1371/journal.pcbi.1009267. eCollection 2021 Aug.

Unsupervised feature learning improves prediction of human brain activity in response to natural images.

PLoS Comput Biol. 2014 Aug 7;10(8):e1003724. doi: 10.1371/journal.pcbi.1003724. eCollection 2014 Aug.

Large-scale parameters framework with large convolutional kernel for encoding visual fMRI activity information.

Cereb Cortex. 2024 Jul 3;34(7). doi: 10.1093/cercor/bhae257.

Enhancing neural encoding models for naturalistic perception with a multi-level integration of deep neural networks and cortical networks.

Sci Bull (Beijing). 2024 Jun 15;69(11):1738-1747. doi: 10.1016/j.scib.2024.02.035. Epub 2024 Feb 29.

引用本文的文献

Prediction of the Non-Reducing Biomineralization of Nuclide-Microbial Interactions by Machine Learning: The Case of Uranium and .

Toxics. 2025 Apr 13;13(4):305. doi: 10.3390/toxics13040305.

The decoder design and performance comparative analysis for closed-loop brain-machine interface system.

Cogn Neurodyn. 2024 Feb;18(1):147-164. doi: 10.1007/s11571-022-09919-7. Epub 2022 Dec 23.

Stacked regressions and structured variance partitioning for interpretable brain maps.

Neuroimage. 2024 Sep;298:120772. doi: 10.1016/j.neuroimage.2024.120772. Epub 2024 Aug 6.

Cerebral cortex functional reorganization in preschool children with congenital sensorineural hearing loss: a resting-state fMRI study.

Front Neurol. 2024 Jun 25;15:1423956. doi: 10.3389/fneur.2024.1423956. eCollection 2024.

Stacked regressions and structured variance partitioning for interpretable brain maps.

bioRxiv. 2023 Apr 24:2023.04.23.537988. doi: 10.1101/2023.04.23.537988.

A comparative analysis of masking empirical mode decomposition and a neural network with feed-forward and back propagation along with masking empirical mode decomposition to improve the classification performance for a reliable brain-computer interface.

Front Comput Neurosci. 2022 Nov 4;16:1010770. doi: 10.3389/fncom.2022.1010770. eCollection 2022.

Feature-space selection with banded ridge regression.

Neuroimage. 2022 Dec 1;264:119728. doi: 10.1016/j.neuroimage.2022.119728. Epub 2022 Nov 8.

High-Level Visual Encoding Model Framework with Hierarchical Ventral Stream-Optimized Neural Networks.

Brain Sci. 2022 Aug 19;12(8):1101. doi: 10.3390/brainsci12081101.

fMRI Brain Decoding and Its Applications in Brain-Computer Interface: A Survey.

Brain Sci. 2022 Feb 7;12(2):228. doi: 10.3390/brainsci12020228.

A Visual Encoding Model Based on Contrastive Self-Supervised Learning for Human Brain Activity along the Ventral Visual Stream.

Brain Sci. 2021 Jul 29;11(8):1004. doi: 10.3390/brainsci11081004.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度神经网络和迁移学习的功能磁共振成像脑活动视觉编码模型。

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging.

机构信息

出版信息

BACKGROUND

NEW METHOD

RESULTS

COMPARISON WITH EXISTING METHODS

CONCLUSIONS

背景

新方法

结果

与现有方法的比较

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献