文献检索，用中文搜 PubMed

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

School of Systems Engineering, National University of Defense Technology, Changsha 410073, China.

Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.

In this work, we introduce AdaCN, a novel adaptive cubic Newton method for nonconvex stochastic optimization. AdaCN dynamically captures the curvature of the loss landscape by diagonally approximated Hessian plus the norm of difference between previous two estimates. It only requires at most first order gradients and updates with linear complexity for both time and memory. In order to reduce the variance introduced by the stochastic nature of the problem, AdaCN hires the first and second moment to implement and exponential moving average on iteratively updated stochastic gradients and approximated stochastic Hessians, respectively. We validate AdaCN in extensive experiments, showing that it outperforms other stochastic first order methods (including SGD, Adam, and AdaBound) and stochastic quasi-Newton method (i.e., Apollo), in terms of both convergence speed and generalization performance.

在这项工作中，我们引入了 AdaCN，这是一种用于非凸随机优化的新颖自适应立方牛顿方法。AdaCN 通过对角近似的海森矩阵和前两次估计之间的差的范数来动态捕捉损失曲面的曲率。它只需要最多一阶梯度，并以线性复杂度进行时间和内存更新。为了减少问题的随机性质引入的方差，AdaCN 使用一阶和二阶矩分别对迭代更新的随机梯度和近似随机海森矩阵进行实现和指数移动平均。我们在广泛的实验中验证了 AdaCN，表明它在收敛速度和泛化性能方面都优于其他随机一阶方法（包括 SGD、Adam 和 AdaBound）和随机拟牛顿方法（即 Apollo）。

相似文献

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.AdaCN：一种用于非凸随机优化的自适应三次牛顿方法。

Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.

A novel adaptive cubic quasi-Newton optimizer for deep learning based medical image analysis tasks, validated on detection of COVID-19 and segmentation for COVID-19 lung infection, liver tumor, and optic disc/cup.一种用于深度学习的新型自适应三次拟牛顿优化器，在 COVID-19 检测和 COVID-19 肺部感染、肝脏肿瘤以及视盘/杯分割等医学图像分析任务中得到验证。

Med Phys. 2023 Mar;50(3):1528-1538. doi: 10.1002/mp.15969. Epub 2022 Oct 6.

Faster Stochastic Quasi-Newton Methods.更快的随机拟牛顿法

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4388-4397. doi: 10.1109/TNNLS.2021.3056947. Epub 2022 Aug 31.

A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications.一种用于大规模非凸优化的随机拟牛顿法及其应用

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4776-4790. doi: 10.1109/TNNLS.2019.2957843. Epub 2020 Oct 29.

Communication-efficient distributed cubic Newton with compressed lazy Hessian.带压缩惰性海森的通信高效分布式三次牛顿法

Neural Netw. 2024 Jun;174:106212. doi: 10.1016/j.neunet.2024.106212. Epub 2024 Feb 27.

UAdam: Unified Adam-Type Algorithmic Framework for Nonconvex Optimization.UAdam：用于非凸优化的统一Adam型算法框架。

Neural Comput. 2024 Aug 19;36(9):1912-1938. doi: 10.1162/neco_a_01692.

Stochastic Optimization for Nonconvex Problem With Inexact Hessian Matrix, Gradient, and Function.具有不精确海森矩阵、梯度和函数的非凸问题的随机优化

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1651-1663. doi: 10.1109/TNNLS.2023.3326177. Epub 2025 Jan 7.

Asynchronous Parallel Stochastic Quasi-Newton Methods.异步并行随机拟牛顿法

Parallel Comput. 2021 Apr;101. doi: 10.1016/j.parco.2020.102721. Epub 2020 Nov 4.

Stochastic quasi-gradient methods: variance reduction via Jacobian sketching.随机拟梯度方法：通过雅可比矩阵草图实现方差缩减。

Math Program. 2021;188(1):135-192. doi: 10.1007/s10107-020-01506-0. Epub 2020 May 12.

Preconditioned Stochastic Gradient Descent.预处理随机梯度下降。

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1454-1466. doi: 10.1109/TNNLS.2017.2672978. Epub 2017 Mar 9.

本文引用的文献

Gradient regularization of Newton method with Bregman distances.基于布雷格曼距离的牛顿法梯度正则化

Math Program. 2024;204(1-2):1-25. doi: 10.1007/s10107-023-01943-7. Epub 2023 Mar 24.

School of Systems Engineering, National University of Defense Technology, Changsha 410073, China.

Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.

相似文献

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.AdaCN：一种用于非凸随机优化的自适应三次牛顿方法。

Comput Intell Neurosci. 2021 Nov 10;2021:5790608. doi: 10.1155/2021/5790608. eCollection 2021.

Med Phys. 2023 Mar;50(3):1528-1538. doi: 10.1002/mp.15969. Epub 2022 Oct 6.

Faster Stochastic Quasi-Newton Methods.更快的随机拟牛顿法

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4388-4397. doi: 10.1109/TNNLS.2021.3056947. Epub 2022 Aug 31.

A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications.一种用于大规模非凸优化的随机拟牛顿法及其应用

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4776-4790. doi: 10.1109/TNNLS.2019.2957843. Epub 2020 Oct 29.

Communication-efficient distributed cubic Newton with compressed lazy Hessian.带压缩惰性海森的通信高效分布式三次牛顿法

Neural Netw. 2024 Jun;174:106212. doi: 10.1016/j.neunet.2024.106212. Epub 2024 Feb 27.

UAdam: Unified Adam-Type Algorithmic Framework for Nonconvex Optimization.UAdam：用于非凸优化的统一Adam型算法框架。

Neural Comput. 2024 Aug 19;36(9):1912-1938. doi: 10.1162/neco_a_01692.

Stochastic Optimization for Nonconvex Problem With Inexact Hessian Matrix, Gradient, and Function.具有不精确海森矩阵、梯度和函数的非凸问题的随机优化

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1651-1663. doi: 10.1109/TNNLS.2023.3326177. Epub 2025 Jan 7.

Asynchronous Parallel Stochastic Quasi-Newton Methods.异步并行随机拟牛顿法

Parallel Comput. 2021 Apr;101. doi: 10.1016/j.parco.2020.102721. Epub 2020 Nov 4.

Stochastic quasi-gradient methods: variance reduction via Jacobian sketching.随机拟梯度方法：通过雅可比矩阵草图实现方差缩减。

Math Program. 2021;188(1):135-192. doi: 10.1007/s10107-020-01506-0. Epub 2020 May 12.

Preconditioned Stochastic Gradient Descent.预处理随机梯度下降。

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1454-1466. doi: 10.1109/TNNLS.2017.2672978. Epub 2017 Mar 9.

本文引用的文献

Gradient regularization of Newton method with Bregman distances.基于布雷格曼距离的牛顿法梯度正则化

Math Program. 2024;204(1-2):1-25. doi: 10.1007/s10107-023-01943-7. Epub 2023 Mar 24.

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

AdaCN：一种用于非凸随机优化的自适应三次牛顿方法。

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.

机构信息

出版信息

相似文献

本文引用的文献

AdaCN：一种用于非凸随机优化的自适应三次牛顿方法。

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization.

机构信息

出版信息

相似文献

本文引用的文献