• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于训练损失的自适应学习率的有效神经网络训练。

Effective neural network training with adaptive learning rate based on training loss.

机构信息

Graduate School of Information Science and Technology, Hokkaido University, Kita 14 Nishi 9 Kita-ku, Sapporo, Japan.

出版信息

Neural Netw. 2018 May;101:68-78. doi: 10.1016/j.neunet.2018.01.016. Epub 2018 Feb 13.

DOI:10.1016/j.neunet.2018.01.016
PMID:29494873
Abstract

A method that uses an adaptive learning rate is presented for training neural networks. Unlike most conventional updating methods in which the learning rate gradually decreases during training, the proposed method increases or decreases the learning rate adaptively so that the training loss (the sum of cross-entropy losses for all training samples) decreases as much as possible. It thus provides a wider search range for solutions and thus a lower test error rate. The experiments with some well-known datasets to train a multilayer perceptron show that the proposed method is effective for obtaining a better test accuracy under certain conditions.

摘要

本文提出了一种使用自适应学习率的方法来训练神经网络。与大多数传统的更新方法不同,这些方法在训练过程中逐渐降低学习率,本文提出的方法则自适应地增加或减少学习率,以使训练损失(所有训练样本的交叉熵损失之和)尽可能减小。因此,它为解决方案提供了更广泛的搜索范围,从而降低了测试错误率。通过使用一些著名的数据集训练多层感知机的实验表明,在某些条件下,该方法可以有效地获得更好的测试准确性。

相似文献

1
Effective neural network training with adaptive learning rate based on training loss.基于训练损失的自适应学习率的有效神经网络训练。
Neural Netw. 2018 May;101:68-78. doi: 10.1016/j.neunet.2018.01.016. Epub 2018 Feb 13.
2
Research on a learning rate with energy index in deep learning.深度学习中能量指数学习率的研究。
Neural Netw. 2019 Feb;110:225-231. doi: 10.1016/j.neunet.2018.12.009. Epub 2018 Dec 19.
3
Novel maximum-margin training algorithms for supervised neural networks.用于监督神经网络的新型最大间隔训练算法。
IEEE Trans Neural Netw. 2010 Jun;21(6):972-84. doi: 10.1109/TNN.2010.2046423. Epub 2010 Apr 19.
4
Complexity issues in natural gradient descent method for training multilayer perceptrons.用于训练多层感知器的自然梯度下降法中的复杂性问题。
Neural Comput. 1998 Nov 15;10(8):2137-57. doi: 10.1162/089976698300017007.
5
Maximum entropy methods for extracting the learned features of deep neural networks.用于提取深度神经网络学习特征的最大熵方法。
PLoS Comput Biol. 2017 Oct 30;13(10):e1005836. doi: 10.1371/journal.pcbi.1005836. eCollection 2017 Oct.
6
Deep learning-based detection and classification of geographic atrophy using a deep convolutional neural network classifier.使用深度卷积神经网络分类器基于深度学习的地理性萎缩检测与分类
Graefes Arch Clin Exp Ophthalmol. 2018 Nov;256(11):2053-2060. doi: 10.1007/s00417-018-4098-2. Epub 2018 Aug 8.
7
DMP3: a dynamic multilayer perceptron construction algorithm.DMP3:一种动态多层感知器构建算法。
Int J Neural Syst. 2001 Apr;11(2):145-65. doi: 10.1142/S0129065701000576.
8
Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods.基于深度神经网络的交通标志识别系统:空间转换器和随机优化方法分析。
Neural Netw. 2018 Mar;99:158-165. doi: 10.1016/j.neunet.2018.01.005. Epub 2018 Jan 31.
9
One pass learning for generalized classifier neural network.广义分类器神经网络的单通道学习
Neural Netw. 2016 Jan;73:70-6. doi: 10.1016/j.neunet.2015.10.008. Epub 2015 Oct 26.
10
Application of Deep Learning Architectures for Accurate and Rapid Detection of Internal Mechanical Damage of Blueberry Using Hyperspectral Transmittance Data.深度学习架构在利用高光谱透过率数据准确快速检测蓝莓内部机械损伤中的应用。
Sensors (Basel). 2018 Apr 7;18(4):1126. doi: 10.3390/s18041126.

引用本文的文献

1
Prediction of Cerebrospinal Fluid (CSF) Pressure with Generative Adversarial Network Synthetic Plasma-CSF Biomarker Pairing.利用生成对抗网络合成血浆-脑脊液生物标志物配对预测脑脊液(CSF)压力
Neuroinformatics. 2025 Jul 10;23(3):38. doi: 10.1007/s12021-025-09729-2.
2
Hyperparameter Optimization for Tomato Leaf Disease Recognition Based on YOLOv11m.基于YOLOv11m的番茄叶部病害识别超参数优化
Plants (Basel). 2025 Feb 21;14(5):653. doi: 10.3390/plants14050653.
3
Artificial neural network controlled DSTATCOM for mitigating power quality concerns in solar PV and wind system.
用于缓解太阳能光伏和风能系统电能质量问题的人工神经网络控制静止同步补偿器
Sci Rep. 2025 Feb 11;15(1):5016. doi: 10.1038/s41598-025-88540-0.
4
Efficient adaptive learning rate for convolutional neural network based on quadratic interpolation egret swarm optimization algorithm.基于二次插值白鹭群优化算法的卷积神经网络高效自适应学习率
Heliyon. 2024 Sep 13;10(18):e37814. doi: 10.1016/j.heliyon.2024.e37814. eCollection 2024 Sep 30.
5
An analytical approach for unsupervised learning rate estimation using rectified linear units.一种使用整流线性单元进行无监督学习率估计的分析方法。
Front Neurosci. 2024 Apr 8;18:1362510. doi: 10.3389/fnins.2024.1362510. eCollection 2024.
6
Fully automated whole brain segmentation from rat MRI scans with a convolutional neural network.使用卷积神经网络从大鼠磁共振成像扫描中进行全自动全脑分割。
J Neurosci Methods. 2024 May;405:110078. doi: 10.1016/j.jneumeth.2024.110078. Epub 2024 Feb 8.
7
Inference of network connectivity from temporally binned spike trains.从时间分箱的尖峰火车推断网络连通性。
J Neurosci Methods. 2024 Apr;404:110073. doi: 10.1016/j.jneumeth.2024.110073. Epub 2024 Feb 2.
8
A Learning-Rate Modulable and Reliable TiO Memristor Array for Robust, Fast, and Accurate Neuromorphic Computing.用于稳健、快速和准确神经形态计算的学习率可调且可靠的 TiO 忆阻器阵列。
Adv Sci (Weinh). 2022 Aug;9(22):e2201117. doi: 10.1002/advs.202201117. Epub 2022 Jun 5.
9
Off-State Performance Characterization of an AlGaN/GaN Device via Artificial Neural Networks.基于人工神经网络的AlGaN/GaN器件关态性能表征
Micromachines (Basel). 2022 May 5;13(5):737. doi: 10.3390/mi13050737.
10
Neural network prediction of 30-day mortality following primary total hip arthroplasty.初次全髋关节置换术后30天死亡率的神经网络预测
J Orthop. 2021 Nov 25;28:91-95. doi: 10.1016/j.jor.2021.11.013. eCollection 2021 Nov-Dec.