变压器和皮层波：在时间上引入上下文的编码器。

Transformers and cortical waves: encoders for pulling in context across time.

机构信息

Department of Mathematics, Western University, London, Ontario, Canada; Fields Laboratory for Network Science, Fields Institute, Toronto, Ontario, Canada.

Department of Philosophy, University of California at San Diego, San Diego, CA, USA.

出版信息

Trends Neurosci. 2024 Oct;47(10):788-802. doi: 10.1016/j.tins.2024.08.006. Epub 2024 Sep 27.

DOI:10.1016/j.tins.2024.08.006

PMID:39341729

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11936488/

Abstract

The capabilities of transformer networks such as ChatGPT and other large language models (LLMs) have captured the world's attention. The crucial computational mechanism underlying their performance relies on transforming a complete input sequence - for example, all the words in a sentence - into a long 'encoding vector' that allows transformers to learn long-range temporal dependencies in naturalistic sequences. Specifically, 'self-attention' applied to this encoding vector enhances temporal context in transformers by computing associations between pairs of words in the input sequence. We suggest that waves of neural activity traveling across single cortical areas, or multiple regions on the whole-brain scale, could implement a similar encoding principle. By encapsulating recent input history into a single spatial pattern at each moment in time, cortical waves may enable a temporal context to be extracted from sequences of sensory inputs, the same computational principle as that used in transformers.

摘要

Transformer 网络（如 ChatGPT 和其他大型语言模型 (LLMs)）的能力引起了全世界的关注。它们性能的关键计算机制依赖于将完整的输入序列（例如，句子中的所有单词）转换成长的“编码向量”，使转换器能够学习自然序列中的长程时间依赖性。具体来说，应用于该编码向量的“自注意力”通过计算输入序列中单词对之间的关联来增强转换器中的时间上下文。我们认为，在单个皮质区域或整个大脑尺度上的多个区域上传播的神经活动波可以实现类似的编码原理。通过在每个时间点将最近的输入历史封装到单个空间模式中，皮质波可以从感觉输入序列中提取时间上下文，这与转换器中使用的计算原理相同。

相似文献

Transformers and cortical waves: encoders for pulling in context across time.

Trends Neurosci. 2024 Oct;47(10):788-802. doi: 10.1016/j.tins.2024.08.006. Epub 2024 Sep 27.

Short-Term Memory Impairment

Sexual Harassment and Prevention Training

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

A dataset and benchmark for hospital course summarization with adapted large language models.

J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.

A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.

Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.

Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.

Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.

Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.

J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

Large Language Models and Empathy: Systematic Review.

J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.

引用本文的文献

Cortical language areas are coupled via a soft hierarchy of model-based linguistic features.

bioRxiv. 2025 Jun 3:2025.06.02.657491. doi: 10.1101/2025.06.02.657491.

本文引用的文献

Image segmentation with traveling waves in an exactly solvable recurrent neural network.

Proc Natl Acad Sci U S A. 2025 Jan 7;122(1):e2321319121. doi: 10.1073/pnas.2321319121. Epub 2025 Jan 3.

Predictive sequence learning in the hippocampal formation.

Neuron. 2024 Aug 7;112(15):2645-2658.e4. doi: 10.1016/j.neuron.2024.05.024. Epub 2024 Jun 24.

Neural assemblies coordinated by cortical waves are associated with waking and hallucinatory brain states.

Cell Rep. 2024 Apr 23;43(4):114017. doi: 10.1016/j.celrep.2024.114017. Epub 2024 Apr 4.

Internal feedback in the cortical perception-action loop enables fast and accurate behavior.

Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2300445120. doi: 10.1073/pnas.2300445120. Epub 2023 Sep 22.

Active oscillations in microscale navigation.

Anim Cogn. 2023 Nov;26(6):1837-1850. doi: 10.1007/s10071-023-01819-5. Epub 2023 Sep 4.

Serial dependence in visual perception: A meta-analysis and review.

J Vis. 2023 Aug 1;23(8):18. doi: 10.1167/jov.23.8.18.

Waves traveling over a map of visual space can ignite short-term predictions of sensory input.

Nat Commun. 2023 Jun 9;14(1):3409. doi: 10.1038/s41467-023-39076-2.

Complexity of cortical wave patterns of the wake mouse cortex.

Nat Commun. 2023 Mar 15;14(1):1434. doi: 10.1038/s41467-023-37088-6.

Large Language Models and the Reverse Turing Test.

Neural Comput. 2023 Feb 17;35(3):309-342. doi: 10.1162/neco_a_01563.

Propagating spatiotemporal activity patterns across macaque motor cortex carry kinematic information.

Proc Natl Acad Sci U S A. 2023 Jan 24;120(4):e2212227120. doi: 10.1073/pnas.2212227120. Epub 2023 Jan 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

变压器和皮层波：在时间上引入上下文的编码器。

Transformers and cortical waves: encoders for pulling in context across time.

机构信息

Department of Mathematics, Western University, London, Ontario, Canada; Fields Laboratory for Network Science, Fields Institute, Toronto, Ontario, Canada.

Department of Philosophy, University of California at San Diego, San Diego, CA, USA.

出版信息

Trends Neurosci. 2024 Oct;47(10):788-802. doi: 10.1016/j.tins.2024.08.006. Epub 2024 Sep 27.

DOI:10.1016/j.tins.2024.08.006

PMID:39341729

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11936488/

Abstract

摘要

变压器和皮层波：在时间上引入上下文的编码器。

Transformers and cortical waves: encoders for pulling in context across time.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

变压器和皮层波：在时间上引入上下文的编码器。

Transformers and cortical waves: encoders for pulling in context across time.

机构信息

出版信息