• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AdaCN:一种用于非凸随机优化的自适应三次牛顿方法。

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.

机构信息

School of Systems Engineering, National University of Defense Technology, Changsha 410073, China.

出版信息

Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.

DOI:10.1155/2021/5790608
PMID:34804146
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8598341/
Abstract

In this work, we introduce AdaCN, a novel adaptive cubic Newton method for nonconvex stochastic optimization. AdaCN dynamically captures the curvature of the loss landscape by diagonally approximated Hessian plus the norm of difference between previous two estimates. It only requires at most first order gradients and updates with linear complexity for both time and memory. In order to reduce the variance introduced by the stochastic nature of the problem, AdaCN hires the first and second moment to implement and exponential moving average on iteratively updated stochastic gradients and approximated stochastic Hessians, respectively. We validate AdaCN in extensive experiments, showing that it outperforms other stochastic first order methods (including SGD, Adam, and AdaBound) and stochastic quasi-Newton method (i.e., Apollo), in terms of both convergence speed and generalization performance.

摘要

在这项工作中,我们引入了 AdaCN,这是一种用于非凸随机优化的新颖自适应立方牛顿方法。AdaCN 通过对角近似的海森矩阵和前两次估计之间的差的范数来动态捕捉损失曲面的曲率。它只需要最多一阶梯度,并以线性复杂度进行时间和内存更新。为了减少问题的随机性质引入的方差,AdaCN 使用一阶和二阶矩分别对迭代更新的随机梯度和近似随机海森矩阵进行实现和指数移动平均。我们在广泛的实验中验证了 AdaCN,表明它在收敛速度和泛化性能方面都优于其他随机一阶方法(包括 SGD、Adam 和 AdaBound)和随机拟牛顿方法(即 Apollo)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/c7ed4ebcaf4c/CIN2021-5790608.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/5a56c09e0806/CIN2021-5790608.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/d57929f83158/CIN2021-5790608.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/1d7e74301db0/CIN2021-5790608.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/2582ecb48e5c/CIN2021-5790608.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/5be3de2b5434/CIN2021-5790608.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/d833e9512c21/CIN2021-5790608.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/b868b0eb7232/CIN2021-5790608.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/c7ed4ebcaf4c/CIN2021-5790608.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/5a56c09e0806/CIN2021-5790608.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/d57929f83158/CIN2021-5790608.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/1d7e74301db0/CIN2021-5790608.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/2582ecb48e5c/CIN2021-5790608.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/5be3de2b5434/CIN2021-5790608.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/d833e9512c21/CIN2021-5790608.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/b868b0eb7232/CIN2021-5790608.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e33/8598341/c7ed4ebcaf4c/CIN2021-5790608.alg.001.jpg

相似文献

1
AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.AdaCN:一种用于非凸随机优化的自适应三次牛顿方法。
Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.
2
A novel adaptive cubic quasi-Newton optimizer for deep learning based medical image analysis tasks, validated on detection of COVID-19 and segmentation for COVID-19 lung infection, liver tumor, and optic disc/cup.一种用于深度学习的新型自适应三次拟牛顿优化器,在 COVID-19 检测和 COVID-19 肺部感染、肝脏肿瘤以及视盘/杯分割等医学图像分析任务中得到验证。
Med Phys. 2023 Mar;50(3):1528-1538. doi: 10.1002/mp.15969. Epub 2022 Oct 6.
3
Faster Stochastic Quasi-Newton Methods.更快的随机拟牛顿法
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4388-4397. doi: 10.1109/TNNLS.2021.3056947. Epub 2022 Aug 31.
4
A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications.一种用于大规模非凸优化的随机拟牛顿法及其应用
IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4776-4790. doi: 10.1109/TNNLS.2019.2957843. Epub 2020 Oct 29.
5
Communication-efficient distributed cubic Newton with compressed lazy Hessian.带压缩惰性海森的通信高效分布式三次牛顿法
Neural Netw. 2024 Jun;174:106212. doi: 10.1016/j.neunet.2024.106212. Epub 2024 Feb 27.
6
UAdam: Unified Adam-Type Algorithmic Framework for Nonconvex Optimization.UAdam:用于非凸优化的统一Adam型算法框架。
Neural Comput. 2024 Aug 19;36(9):1912-1938. doi: 10.1162/neco_a_01692.
7
Stochastic Optimization for Nonconvex Problem With Inexact Hessian Matrix, Gradient, and Function.具有不精确海森矩阵、梯度和函数的非凸问题的随机优化
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1651-1663. doi: 10.1109/TNNLS.2023.3326177. Epub 2025 Jan 7.
8
Asynchronous Parallel Stochastic Quasi-Newton Methods.异步并行随机拟牛顿法
Parallel Comput. 2021 Apr;101. doi: 10.1016/j.parco.2020.102721. Epub 2020 Nov 4.
9
Stochastic quasi-gradient methods: variance reduction via Jacobian sketching.随机拟梯度方法:通过雅可比矩阵草图实现方差缩减。
Math Program. 2021;188(1):135-192. doi: 10.1007/s10107-020-01506-0. Epub 2020 May 12.
10
Preconditioned Stochastic Gradient Descent.预处理随机梯度下降。
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1454-1466. doi: 10.1109/TNNLS.2017.2672978. Epub 2017 Mar 9.

本文引用的文献

1
Gradient regularization of Newton method with Bregman distances.基于布雷格曼距离的牛顿法梯度正则化
Math Program. 2024;204(1-2):1-25. doi: 10.1007/s10107-023-01943-7. Epub 2023 Mar 24.