• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

注意力机制及其在复杂系统中的应用。

Attention Mechanisms and Their Applications to Complex Systems.

作者信息

Hernández Adrián, Amigó José M

机构信息

Centro de Investigación Operativa, Universidad Miguel Hernández, Av. de la Universidad s/n, 03202 Elche, Spain.

出版信息

Entropy (Basel). 2021 Feb 26;23(3):283. doi: 10.3390/e23030283.

DOI:10.3390/e23030283
PMID:33652728
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7996841/
Abstract

Deep learning models and graphics processing units have completely transformed the field of machine learning. Recurrent neural networks and long short-term memories have been successfully used to model and predict complex systems. However, these classic models do not perform sequential reasoning, a process that guides a task based on perception and memory. In recent years, attention mechanisms have emerged as a promising solution to these problems. In this review, we describe the key aspects of attention mechanisms and some relevant attention techniques and point out why they are a remarkable advance in machine learning. Then, we illustrate some important applications of these techniques in the modeling of complex systems.

摘要

深度学习模型和图形处理单元彻底改变了机器学习领域。循环神经网络和长短期记忆已成功用于对复杂系统进行建模和预测。然而,这些经典模型并不执行序列推理,序列推理是一个基于感知和记忆来指导任务的过程。近年来,注意力机制已成为解决这些问题的一个有前景的方案。在本综述中,我们描述了注意力机制的关键方面以及一些相关的注意力技术,并指出它们为何是机器学习中的一项显著进展。然后,我们阐述了这些技术在复杂系统建模中的一些重要应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/5abab03a12c0/entropy-23-00283-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/e963f601247f/entropy-23-00283-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/3e3d13df0ffe/entropy-23-00283-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/c2d7a3c077bb/entropy-23-00283-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/488eab09e4ca/entropy-23-00283-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/40b54117124c/entropy-23-00283-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/4990f11aef24/entropy-23-00283-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/ad68cd2e0cd3/entropy-23-00283-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/86330fa4dda5/entropy-23-00283-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/7c4e2c85fd26/entropy-23-00283-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/2e67bed83959/entropy-23-00283-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/5abab03a12c0/entropy-23-00283-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/e963f601247f/entropy-23-00283-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/3e3d13df0ffe/entropy-23-00283-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/c2d7a3c077bb/entropy-23-00283-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/488eab09e4ca/entropy-23-00283-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/40b54117124c/entropy-23-00283-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/4990f11aef24/entropy-23-00283-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/ad68cd2e0cd3/entropy-23-00283-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/86330fa4dda5/entropy-23-00283-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/7c4e2c85fd26/entropy-23-00283-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/2e67bed83959/entropy-23-00283-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f93/7996841/5abab03a12c0/entropy-23-00283-g011.jpg

相似文献

1
Attention Mechanisms and Their Applications to Complex Systems.注意力机制及其在复杂系统中的应用。
Entropy (Basel). 2021 Feb 26;23(3):283. doi: 10.3390/e23030283.
2
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
3
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
4
[Influence of attention on an auditory-verbal learning test in schizophrenic patients].[注意力对精神分裂症患者听觉言语学习测试的影响]
Encephale. 2002 Jul-Aug;28(4):291-7.
5
Engineering neural systems for high-level problem solving.工程化神经系统以解决高级别问题。
Neural Netw. 2016 Jul;79:37-52. doi: 10.1016/j.neunet.2016.03.006. Epub 2016 Mar 31.
6
Deep Reinforcement Learning for Sequence-to-Sequence Models.深度强化学习在序列到序列模型中的应用。
IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2469-2489. doi: 10.1109/TNNLS.2019.2929141. Epub 2019 Aug 15.
7
Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals.验证深度神经网络用于从 EEG 信号中在线解码运动想象运动。
Sensors (Basel). 2019 Jan 8;19(1):210. doi: 10.3390/s19010210.
8
Distinct patterns of structural damage underlie working memory and reasoning deficits after traumatic brain injury.创伤性脑损伤后工作记忆和推理缺陷的结构损伤基础不同。
Brain. 2020 Apr 1;143(4):1158-1176. doi: 10.1093/brain/awaa067.
9
Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.基于双向长短期记忆和注意力机制的神经网络的生物医学词义消歧。
BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):502. doi: 10.1186/s12859-019-3079-8.
10
Real-time computing without stable states: a new framework for neural computation based on perturbations.无稳定状态的实时计算:基于扰动的神经计算新框架。
Neural Comput. 2002 Nov;14(11):2531-60. doi: 10.1162/089976602760407955.

引用本文的文献

1
Enhancing english oral translation through cross-modal learning and synchronous optimization.通过跨模态学习与同步优化提升英语口译水平
PLoS One. 2025 Aug 18;20(8):e0329381. doi: 10.1371/journal.pone.0329381. eCollection 2025.
2
Enhanced gastrointestinal disease classification using a convvit hybrid model on endoscopic images.使用卷积视觉变换器(ConvVit)混合模型对内镜图像进行增强的胃肠道疾病分类
Phys Eng Sci Med. 2025 Jul 21. doi: 10.1007/s13246-025-01600-7.
3
Uncovering subtype-specific metabolic signatures in breast cancer through multimodal integration, attention-based deep learning, and self-organizing maps.

本文引用的文献

1
Attention in Psychology, Neuroscience, and Machine Learning.心理学、神经科学和机器学习中的注意力
Front Comput Neurosci. 2020 Apr 16;14:29. doi: 10.3389/fncom.2020.00029. eCollection 2020.
2
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
3
Hybrid computing using a neural network with dynamic external memory.使用具有动态外部存储器的神经网络进行混合计算。
通过多模态整合、基于注意力的深度学习和自组织映射揭示乳腺癌亚型特异性代谢特征。
Sci Rep. 2025 Jul 1;15(1):21775. doi: 10.1038/s41598-025-06459-y.
4
Tiny Language Models for Automation and Control: Overview, Potential Applications, and Future Research Directions.用于自动化与控制的微型语言模型:概述、潜在应用及未来研究方向
Sensors (Basel). 2025 Feb 21;25(5):1318. doi: 10.3390/s25051318.
5
Natural Language Processing Methods for the Study of Protein-Ligand Interactions.用于蛋白质-配体相互作用研究的自然语言处理方法
J Chem Inf Model. 2025 Mar 10;65(5):2191-2213. doi: 10.1021/acs.jcim.4c01907. Epub 2025 Feb 24.
6
Feasibility study of opportunistic osteoporosis screening on chest CT using a multi-feature fusion DCNN model.利用多特征融合 DCNN 模型对胸部 CT 进行机会性骨质疏松筛查的可行性研究。
Arch Osteoporos. 2024 Oct 17;19(1):98. doi: 10.1007/s11657-024-01455-7.
7
Enhancing early Parkinson's disease detection through multimodal deep learning and explainable AI: insights from the PPMI database.通过多模态深度学习和可解释人工智能提高早期帕金森病检测:来自 PPMI 数据库的见解。
Sci Rep. 2024 Sep 9;14(1):20941. doi: 10.1038/s41598-024-70165-4.
8
Application of an end-to-end model with self-attention mechanism in cardiac disease prediction.具有自注意力机制的端到端模型在心脏病预测中的应用。
Front Physiol. 2024 Jan 12;14:1308774. doi: 10.3389/fphys.2023.1308774. eCollection 2023.
9
Recognition of rare antinuclear antibody patterns based on a novel attention-based enhancement framework.基于新型注意力增强框架识别罕见抗核抗体模式。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad531.
10
Deep learning tools to accelerate antibiotic discovery.深度学习工具加速抗生素发现。
Expert Opin Drug Discov. 2023 Jul-Dec;18(11):1245-1257. doi: 10.1080/17460441.2023.2250721. Epub 2023 Oct 18.
Nature. 2016 Oct 27;538(7626):471-476. doi: 10.1038/nature20101. Epub 2016 Oct 12.
4
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
5
Inhibition in multiclass classification.多类分类中的抑制。
Neural Comput. 2012 Sep;24(9):2473-507. doi: 10.1162/NECO_a_00321. Epub 2012 May 17.
6
Identification and control of dynamical systems using neural networks.利用神经网络对动态系统进行识别与控制。
IEEE Trans Neural Netw. 1990;1(1):4-27. doi: 10.1109/72.80202.
7
Neurodynamics of biased competition and cooperation for attention: a model with spiking neurons.注意力偏向竞争与合作的神经动力学:一个基于脉冲神经元的模型。
J Neurophysiol. 2005 Jul;94(1):295-313. doi: 10.1152/jn.01095.2004. Epub 2005 Feb 9.
8
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.