• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大型语言模型负责人请注意。

Attention heads of large language models.

作者信息

Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Yang Mingchuan, Tang Bo, Xiong Feiyu, Li Zhiyu

机构信息

Institute for Advanced Algorithms Research (IAAR), Shanghai, China.

Institute for AI Industry Research (AIR), Tsinghua University, Beijing, China.

出版信息

Patterns (N Y). 2025 Feb 6;6(2):101176. doi: 10.1016/j.patter.2025.101176. eCollection 2025 Feb 14.

DOI:10.1016/j.patter.2025.101176
PMID:40041856
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11873009/
Abstract

Large language models (LLMs) have demonstrated performance approaching human levels in tasks such as long-text comprehension and mathematical reasoning, but they remain black-box systems. Understanding the reasoning bottlenecks of LLMs remains a critical challenge, as these limitations are deeply tied to their internal architecture. Attention heads play a pivotal role in reasoning and are thought to share similarities with human brain functions. In this review, we explore the roles and mechanisms of attention heads to help demystify the internal reasoning processes of LLMs. We first introduce a four-stage framework inspired by the human thought process. Using this framework, we review existing research to identify and categorize the functions of specific attention heads. Additionally, we analyze the experimental methodologies used to discover these special heads and further summarize relevant evaluation methods and benchmarks. Finally, we discuss the limitations of current research and propose several potential future directions.

摘要

大语言模型(LLMs)在长文本理解和数学推理等任务中展现出接近人类水平的性能,但它们仍然是黑箱系统。理解大语言模型的推理瓶颈仍然是一项关键挑战,因为这些限制与它们的内部架构紧密相关。注意力头在推理中起着关键作用,并且被认为与人类大脑功能有相似之处。在这篇综述中,我们探讨注意力头的作用和机制,以帮助揭开大语言模型内部推理过程的神秘面纱。我们首先引入一个受人类思维过程启发的四阶段框架。利用这个框架,我们回顾现有研究,以识别和分类特定注意力头的功能。此外,我们分析用于发现这些特殊头的实验方法,并进一步总结相关评估方法和基准。最后,我们讨论当前研究的局限性,并提出几个潜在的未来研究方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/d3145d4ef766/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/bd3236b8019d/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/f6df3c27e72c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/8bbc0793c128/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/3c5aba332d01/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/9384fc3478a7/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/cb317058f1b3/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/2eff70de9334/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/0bd4e71202f2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/507cbb8b5167/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/74a5a3367586/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/bac4c8ce6769/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/d3145d4ef766/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/bd3236b8019d/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/f6df3c27e72c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/8bbc0793c128/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/3c5aba332d01/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/9384fc3478a7/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/cb317058f1b3/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/2eff70de9334/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/0bd4e71202f2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/507cbb8b5167/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/74a5a3367586/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/bac4c8ce6769/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b538/11873009/d3145d4ef766/gr12.jpg

相似文献

1
Attention heads of large language models.大型语言模型负责人请注意。
Patterns (N Y). 2025 Feb 6;6(2):101176. doi: 10.1016/j.patter.2025.101176. eCollection 2025 Feb 14.
2
Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study.将医学知识图谱融入大语言模型进行诊断预测:设计与应用研究
JMIR AI. 2025 Feb 24;4:e58670. doi: 10.2196/58670.
3
Large language models and psychiatry.大型语言模型与精神病学。
Int J Law Psychiatry. 2025 Jul-Aug;101:102086. doi: 10.1016/j.ijlp.2025.102086. Epub 2025 Feb 27.
4
Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine.诊断推理提示揭示了医学中大型语言模型可解释性的潜力。
NPJ Digit Med. 2024 Jan 24;7(1):20. doi: 10.1038/s41746-024-01010-1.
5
A Causality-Aware Paradigm for Evaluating Creativity of Multimodal Large Language Models.一种用于评估多模态大语言模型创造力的因果感知范式。
IEEE Trans Pattern Anal Mach Intell. 2025 May;47(5):3830-3846. doi: 10.1109/TPAMI.2025.3539433. Epub 2025 Apr 8.
6
Leveraging Large Language Models for Precision Monitoring of Chemotherapy-Induced Toxicities: A Pilot Study with Expert Comparisons and Future Directions.利用大语言模型进行化疗诱导毒性的精准监测:一项专家比较及未来方向的试点研究
Cancers (Basel). 2024 Aug 12;16(16):2830. doi: 10.3390/cancers16162830.
7
Distilling mathematical reasoning capabilities into Small Language Models.将数学推理能力提炼到小语言模型中。
Neural Netw. 2024 Nov;179:106594. doi: 10.1016/j.neunet.2024.106594. Epub 2024 Aug 2.
8
LOGIC: LLM-originated guidance for internal cognitive improvement of small language models in stance detection.逻辑:大型语言模型生成的用于小语言模型在立场检测中进行内部认知改进的指导。
PeerJ Comput Sci. 2024 Dec 3;10:e2585. doi: 10.7717/peerj-cs.2585. eCollection 2024.
9
Language models and psychological sciences.语言模型与心理科学。
Front Psychol. 2023 Oct 20;14:1279317. doi: 10.3389/fpsyg.2023.1279317. eCollection 2023.
10
Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis.全球医学考试中的大语言模型:平台开发与综合分析
J Med Internet Res. 2024 Dec 27;26:e66114. doi: 10.2196/66114.

引用本文的文献

1
Development and validation of large language model rating scales for automatically transcribed psychological therapy sessions.用于自动转录心理治疗会话的大语言模型评分量表的开发与验证
Sci Rep. 2025 Aug 12;15(1):29541. doi: 10.1038/s41598-025-14923-y.
2
Artificial Intelligence in Orthopedic Surgery: Current Applications, Challenges, and Future Directions.骨科手术中的人工智能:当前应用、挑战及未来方向。
MedComm (2020). 2025 Jun 25;6(7):e70260. doi: 10.1002/mco2.70260. eCollection 2025 Jul.
3
Deep learning for MRI-based acute and subacute ischaemic stroke lesion segmentation-a systematic review, meta-analysis, and pilot evaluation of key results.

本文引用的文献

1
PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking.PromptLink:利用大语言模型进行跨源生物医学概念链接。
Int ACM SIGIR Conf Res Dev Inf Retr. 2024 Jul;2024:2589-2593. doi: 10.1145/3626772.3657904. Epub 2024 Jul 11.
2
Large language models predict human sensory judgments across six modalities.大型语言模型可预测人类在六种感觉模式下的判断。
Sci Rep. 2024 Sep 13;14(1):21445. doi: 10.1038/s41598-024-72071-1.
3
Language models, like humans, show content effects on reasoning tasks.语言模型和人类一样,在推理任务中表现出内容效应。
基于MRI的急性和亚急性缺血性中风病变分割的深度学习——系统综述、荟萃分析及关键结果的初步评估
Front Med Technol. 2025 Jun 10;7:1491197. doi: 10.3389/fmedt.2025.1491197. eCollection 2025.
PNAS Nexus. 2024 Jul 16;3(7):pgae233. doi: 10.1093/pnasnexus/pgae233. eCollection 2024 Jul.
4
Language models and psychological sciences.语言模型与心理科学。
Front Psychol. 2023 Oct 20;14:1279317. doi: 10.3389/fpsyg.2023.1279317. eCollection 2023.
5
The neural architecture of language: Integrative modeling converges on predictive processing.语言的神经结构:综合建模趋向于预测处理。
Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.
6
Human metacognition across domains: insights from individual differences and neuroimaging.跨领域的人类元认知:来自个体差异和神经成像的见解
Personal Neurosci. 2018 Aug 10;1. doi: 10.1017/pen.2018.16. Epub 2018 Oct 12.
7
Human-level concept learning through probabilistic program induction.通过概率编程归纳实现人类水平的概念学习。
Science. 2015 Dec 11;350(6266):1332-8. doi: 10.1126/science.aab3050.
8
Operant conditioning.操作性条件反射
Annu Rev Psychol. 2003;54:115-44. doi: 10.1146/annurev.psych.54.101601.145124. Epub 2002 Jun 10.
9
Machine psychology: autonomous behavior, perceptual categorization and conditioning in a brain-based device.机器心理学:基于大脑的设备中的自主行为、感知分类与条件作用
Cereb Cortex. 2002 Aug;12(8):818-30. doi: 10.1093/cercor/12.8.818.
10
Models of word production.词汇生成模型。
Trends Cogn Sci. 1999 Jun;3(6):223-232. doi: 10.1016/s1364-6613(99)01319-4.