• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习遗忘:使用长短期记忆网络进行持续预测。

Learning to forget: continual prediction with LSTM.

作者信息

Gers F A, Schmidhuber J, Cummins F

机构信息

IDSIA, Lugano, Switzerland.

出版信息

Neural Comput. 2000 Oct;12(10):2451-71. doi: 10.1162/089976600300015015.

DOI:10.1162/089976600300015015
PMID:11032042
Abstract

Long short-term memory (LSTM; Hochreiter & Schmidhuber, 1997) can solve numerous tasks not solvable by previous learning algorithms for recurrent neural networks (RNNs). We identify a weakness of LSTM networks processing continual input streams that are not a priori segmented into subsequences with explicitly marked ends at which the network's internal state could be reset. Without resets, the state may grow indefinitely and eventually cause the network to break down. Our remedy is a novel, adaptive "forget gate" that enables an LSTM cell to learn to reset itself at appropriate times, thus releasing internal resources. We review illustrative benchmark problems on which standard LSTM outperforms other RNN algorithms. All algorithms (including LSTM) fail to solve continual versions of these problems. LSTM with forget gates, however, easily solves them, and in an elegant way.

摘要

长短期记忆网络(LSTM;霍赫雷特和施密德胡伯,1997)能够解决许多传统递归神经网络(RNN)学习算法无法解决的任务。我们发现LSTM网络在处理连续输入流时存在一个弱点,这些输入流并非先验地被分割成带有明确标记结尾的子序列,而在这些结尾处网络的内部状态可以被重置。如果不进行重置,状态可能会无限增长并最终导致网络崩溃。我们的解决方法是一种新颖的自适应“遗忘门”,它使LSTM单元能够学会在适当的时候重置自身,从而释放内部资源。我们回顾了一些具有代表性的基准问题,在这些问题上标准LSTM优于其他RNN算法。所有算法(包括LSTM)都无法解决这些问题的连续版本。然而,带有遗忘门的LSTM能够轻松地解决它们,而且方式很巧妙。

相似文献

1
Learning to forget: continual prediction with LSTM.学习遗忘:使用长短期记忆网络进行持续预测。
Neural Comput. 2000 Oct;12(10):2451-71. doi: 10.1162/089976600300015015.
2
Subtraction Gates: Another Way to Learn Long-Term Dependencies in Recurrent Neural Networks.减法门控:循环神经网络中学习长期依赖关系的另一种方法。
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1740-1751. doi: 10.1109/TNNLS.2020.3043752. Epub 2022 Apr 4.
3
A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures.递归神经网络综述:长短期记忆细胞和网络架构。
Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21.
4
Working Memory Connections for LSTM.长短期记忆网络的工作记忆连接。
Neural Netw. 2021 Dec;144:334-341. doi: 10.1016/j.neunet.2021.08.030. Epub 2021 Sep 4.
5
Gated Orthogonal Recurrent Units: On Learning to Forget.门控正交循环单元:关于学习遗忘
Neural Comput. 2019 Apr;31(4):765-783. doi: 10.1162/neco_a_01174. Epub 2019 Feb 14.
6
Explicit Duration Recurrent Networks.显式持续时间递归网络。
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3120-3130. doi: 10.1109/TNNLS.2021.3051019. Epub 2022 Jul 6.
7
Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks.使用长短期记忆(LSTM)循环神经网络进行短话语中的语言识别
PLoS One. 2016 Jan 29;11(1):e0146917. doi: 10.1371/journal.pone.0146917. eCollection 2016.
8
Prediction for Origin-Destination Distribution of Dockless Shared Bicycles: A Case Study in Nanjing City.无桩共享单车的 OD 分布预测:以南京市为例。
Front Public Health. 2022 Apr 8;10:849766. doi: 10.3389/fpubh.2022.849766. eCollection 2022.
9
Performance of recurrent neural networks with Monte Carlo dropout for predicting pharmacokinetic parameters from dynamic contrast-enhanced magnetic resonance imaging data.用于从动态对比增强磁共振成像数据预测药代动力学参数的蒙特卡洛随机失活循环神经网络的性能
J Appl Clin Med Phys. 2025 Feb;26(2):e14586. doi: 10.1002/acm2.14586. Epub 2024 Dec 23.
10
Extended-Range Prediction Model Using NSGA-III Optimized RNN-GRU-LSTM for Driver Stress and Drowsiness.基于 NSGA-III 优化 RNN-GRU-LSTM 的驾驶员应激和困倦的扩展范围预测模型。
Sensors (Basel). 2021 Sep 25;21(19):6412. doi: 10.3390/s21196412.

引用本文的文献

1
Time series forecasting of chlorophyll-a concentrations in the Chesapeake Bay.切萨皮克湾叶绿素a浓度的时间序列预测
Sci Rep. 2025 Aug 22;15(1):30877. doi: 10.1038/s41598-025-16352-3.
2
Fatigue Life Prediction of CFRP-FBG Sensor-Reinforced RC Beams Enabled by LSTM-Based Deep Learning.基于长短期记忆网络的深度学习实现的碳纤维增强塑料-光纤布拉格光栅传感器增强钢筋混凝土梁疲劳寿命预测
Polymers (Basel). 2025 Jul 31;17(15):2112. doi: 10.3390/polym17152112.
3
Revolutionizing clinical decision making through deep learning and topic modeling for pathway optimization.
通过深度学习和主题建模实现临床决策的变革,以优化诊疗路径。
Sci Rep. 2025 Aug 6;15(1):28787. doi: 10.1038/s41598-025-12679-z.
4
Pupil Detection Algorithm Based on ViM.基于ViM的瞳孔检测算法
Sensors (Basel). 2025 Jun 26;25(13):3978. doi: 10.3390/s25133978.
5
Forecasting tuberculosis in Ethiopia using deep learning: progress toward sustainable development goal evidence from global burden of disease 1990-2021.利用深度学习预测埃塞俄比亚的结核病:1990 - 2021年全球疾病负担研究中可持续发展目标的进展证据
BMC Infect Dis. 2025 Jul 1;25(1):870. doi: 10.1186/s12879-025-11228-3.
6
An Enhanced Hybrid Model Combining CNN, BiLSTM, and Attention Mechanism for ECG Segment Classification.一种结合卷积神经网络(CNN)、双向长短期记忆网络(BiLSTM)和注意力机制的增强混合模型用于心电图段分类
Biomed Eng Comput Biol. 2025 Jun 17;16:11795972251341051. doi: 10.1177/11795972251341051. eCollection 2025.
7
Artificial intelligence for streamflow prediction in river basins: a use case in Mar Menor.用于流域径流预测的人工智能:以马略尔卡岛为例
Sci Rep. 2025 Jun 3;15(1):19481. doi: 10.1038/s41598-025-04524-0.
8
Startup Drift Compensation of MEMS INS Based on PSO-GRNN Network.基于粒子群优化广义回归神经网络的微机电系统惯性导航系统初始对准漂移补偿
Micromachines (Basel). 2025 Apr 29;16(5):524. doi: 10.3390/mi16050524.
9
Use of Technologies for the Acquisition and Processing Strategies for Motion Data Analysis.用于运动数据分析的采集与处理策略的技术应用
Biomimetics (Basel). 2025 May 20;10(5):339. doi: 10.3390/biomimetics10050339.
10
Inverse design of metal-organic frameworks using deep dreaming approaches.使用深度梦境方法进行金属有机框架的逆向设计。
Nat Commun. 2025 May 23;16(1):4806. doi: 10.1038/s41467-025-59952-3.