• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大脑活动的虚假重建。

Spurious reconstruction from brain activity.

作者信息

Shirakawa Ken, Nagano Yoshihiro, Tanaka Misato, Aoki Shuntaro C, Muraki Yusuke, Majima Kei, Kamitani Yukiyasu

机构信息

Graduate School of Informatics, Kyoto University, Sakyo-ku, Kyoto, 606-8501, Japan; Computational Neuroscience Laboratories, Advanced Telecommunications Research Institute International, Seika, Soraku, Kyoto, 619-0288, Japan.

Graduate School of Informatics, Kyoto University, Sakyo-ku, Kyoto, 606-8501, Japan; Computational Neuroscience Laboratories, Advanced Telecommunications Research Institute International, Seika, Soraku, Kyoto, 619-0288, Japan.

出版信息

Neural Netw. 2025 Oct;190:107515. doi: 10.1016/j.neunet.2025.107515. Epub 2025 May 27.

DOI:10.1016/j.neunet.2025.107515
PMID:40499302
Abstract

Advances in brain decoding, particularly in visual image reconstruction, have sparked discussions about the societal implications and ethical considerations of neurotechnology. As reconstruction methods aim to recover visual experiences from brain activity and achieve prediction beyond training samples (zero-shot prediction), it is crucial to assess their capabilities and limitations to inform public expectations and regulations. Our case study of recent text-guided reconstruction methods, which leverage a large-scale dataset (Natural Scenes Dataset, NSD) and text-to-image diffusion models, reveals critical limitations in their generalizability, demonstrated by poor reconstructions on a different dataset. UMAP visualization of the text features from NSD images shows limited diversity with overlapping semantic and visual clusters between training and test sets. We identify that clustered training samples can lead to "output dimension collapse," restricting predictable output feature dimensions. While diverse training data improves generalization over the entire feature space without requiring exponential scaling, text features alone prove insufficient for mapping to the visual space. Our findings suggest that the apparent realism in current text-guided reconstructions stems from a combination of classification into trained categories and inauthentic image generation (hallucination) through diffusion models, rather than genuine visual reconstruction. We argue that careful selection of datasets and target features, coupled with rigorous evaluation methods, is essential for achieving authentic visual image reconstruction. These insights underscore the importance of grounding interdisciplinary discussions in a thorough understanding of the technology's current capabilities and limitations to ensure responsible development.

摘要

大脑解码技术的进展,尤其是视觉图像重建方面的进展,引发了关于神经技术的社会影响和伦理考量的讨论。由于重建方法旨在从大脑活动中恢复视觉体验并实现超越训练样本的预测(零样本预测),评估其能力和局限性对于引导公众期望和制定相关规定至关重要。我们对近期文本引导的重建方法进行的案例研究,这些方法利用了大规模数据集(自然场景数据集,NSD)和文本到图像的扩散模型,揭示了它们在泛化能力方面的关键局限性,这在不同数据集上的糟糕重建结果中得到了体现。对NSD图像的文本特征进行UMAP可视化显示,训练集和测试集之间语义和视觉聚类存在重叠,多样性有限。我们发现聚类的训练样本会导致“输出维度坍缩”,限制了可预测的输出特征维度。虽然多样化的训练数据无需指数级扩展就能在整个特征空间上提高泛化能力,但仅靠文本特征不足以映射到视觉空间。我们的研究结果表明,当前文本引导的重建中看似逼真的效果源于对训练类别进行分类以及通过扩散模型生成虚假图像(幻觉)的结合,而非真正的视觉重建。我们认为,精心选择数据集和目标特征,再加上严格的评估方法,对于实现真实的视觉图像重建至关重要。这些见解强调了在深入理解该技术当前的能力和局限性的基础上进行跨学科讨论的重要性,以确保其负责任的发展。

相似文献

1
Spurious reconstruction from brain activity.基于大脑活动的虚假重建。
Neural Netw. 2025 Oct;190:107515. doi: 10.1016/j.neunet.2025.107515. Epub 2025 May 27.
2
Short-Term Memory Impairment短期记忆障碍
3
Retrieving and reconstructing conceptually similar images from fMRI with latent diffusion models and a neuro-inspired brain decoding model.使用潜在扩散模型和神经启发式脑解码模型从功能磁共振成像中检索和重建概念上相似的图像。
J Neural Eng. 2024 Jun 28;21(4). doi: 10.1088/1741-2552/ad593c.
4
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
5
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
Perceptions and experiences of the prevention, detection, and management of postpartum haemorrhage: a qualitative evidence synthesis.预防、检测和管理产后出血的认知和经验:定性证据综合。
Cochrane Database Syst Rev. 2023 Nov 27;11(11):CD013795. doi: 10.1002/14651858.CD013795.pub2.
9
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
10
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

引用本文的文献

1
Natural sounds can be reconstructed from human neuroimaging data using deep neural network representation.利用深度神经网络表示,可以从人类神经成像数据中重建自然声音。
PLoS Biol. 2025 Jul 23;23(7):e3003293. doi: 10.1371/journal.pbio.3003293. eCollection 2025 Jul.
2
Inter-individual and inter-site neural code conversion without shared stimuli.无共享刺激下的个体间和位点间神经编码转换
Nat Comput Sci. 2025 Jul;5(7):534-546. doi: 10.1038/s43588-025-00826-5. Epub 2025 Jul 11.
3
Dynamic representation of multidimensional object properties in the human brain.
人类大脑中多维物体属性的动态表征。
bioRxiv. 2025 Feb 28:2023.09.08.556679. doi: 10.1101/2023.09.08.556679.