• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

马尔可夫链随机 DCA 及其在 PDEs 正则化深度学习中的应用。

Markov chain stochastic DCA and applications in deep learning with PDEs regularization.

机构信息

Université de Lorraine, LGIPM, Metz, 57000, France.

Université de Lorraine, LGIPM, Metz, 57000, France; Institut Universitaire de France (IUF), Paris, France.

出版信息

Neural Netw. 2024 Feb;170:149-166. doi: 10.1016/j.neunet.2023.11.032. Epub 2023 Nov 13.

DOI:10.1016/j.neunet.2023.11.032
PMID:37984042
Abstract

This paper addresses a large class of nonsmooth nonconvex stochastic DC (difference-of-convex functions) programs where endogenous uncertainty is involved and i.i.d. (independent and identically distributed) samples are not available. Instead, we assume that it is only possible to access Markov chains whose sequences of distributions converge to the target distributions. This setting is legitimate as Markovian noise arises in many contexts including Bayesian inference, reinforcement learning, and stochastic optimization in high-dimensional or combinatorial spaces. We then design a stochastic algorithm named Markov chain stochastic DCA (MCSDCA) based on DCA (DC algorithm) - a well-known method for nonconvex optimization. We establish the convergence analysis in both asymptotic and nonasymptotic senses. The MCSDCA is then applied to deep learning via PDEs (partial differential equations) regularization, where two realizations of MCSDCA are constructed, namely MCSDCA-odLD and MCSDCA-udLD, based on overdamped and underdamped Langevin dynamics, respectively. Numerical experiments on time series prediction and image classification problems with a variety of neural network topologies show the merits of the proposed methods.

摘要

本文针对一大类非光滑非凸随机 DC(凸差函数)程序,其中涉及内源性不确定性且无法获得独立同分布 (i.i.d.) 样本。相反,我们假设只能访问其分布序列收敛到目标分布的马尔可夫链。这种设置是合理的,因为马尔可夫噪声出现在许多上下文包括贝叶斯推断、强化学习和高维或组合空间中的随机优化中。然后,我们基于 DCA(凸差算法)设计了一种名为 Markov chain stochastic DCA(MCSDCA)的随机算法,这是一种用于非凸优化的知名方法。我们在渐进和非渐进意义上建立了收敛分析。然后,通过偏微分方程 (PDE) 正则化将 MCSDCA 应用于深度学习,基于过阻尼和欠阻尼朗之万动力学分别构建了两种 MCSDCA 的实现,即 MCSDCA-odLD 和 MCSDCA-udLD。具有各种神经网络拓扑结构的时间序列预测和图像分类问题的数值实验表明了所提出方法的优点。

相似文献

1
Markov chain stochastic DCA and applications in deep learning with PDEs regularization.马尔可夫链随机 DCA 及其在 PDEs 正则化深度学习中的应用。
Neural Netw. 2024 Feb;170:149-166. doi: 10.1016/j.neunet.2023.11.032. Epub 2023 Nov 13.
2
Online Stochastic DCA With Applications to Principal Component Analysis.应用于主成分分析的在线随机判别式坐标下降法
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):7035-7047. doi: 10.1109/TNNLS.2022.3213558. Epub 2024 May 2.
3
Stochastic DCA for minimizing a large sum of DC functions with application to multi-class logistic regression.随机动态规划算法用于最小化大量 DC 函数之和,应用于多类逻辑回归。
Neural Netw. 2020 Dec;132:220-231. doi: 10.1016/j.neunet.2020.08.024. Epub 2020 Sep 2.
4
Stochastic proximal gradient methods for nonconvex problems in Hilbert spaces.希尔伯特空间中非凸问题的随机近端梯度方法。
Comput Optim Appl. 2021;78(3):705-740. doi: 10.1007/s10589-020-00259-y. Epub 2021 Jan 12.
5
A subgradient-based neurodynamic algorithm to constrained nonsmooth nonconvex interval-valued optimization.基于次梯度的神经动力学算法求解约束非光滑非凸区间值优化问题。
Neural Netw. 2023 Mar;160:259-273. doi: 10.1016/j.neunet.2023.01.012. Epub 2023 Jan 20.
6
Stochastic gradient Langevin dynamics with adaptive drifts.具有自适应漂移的随机梯度朗之万动力学
J Stat Comput Simul. 2022;92(2):318-336. doi: 10.1080/00949655.2021.1958812. Epub 2021 Jul 27.
7
Markov chain Monte Carlo inference for Markov jump processes via the linear noise approximation.通过线性噪声逼近对马尔可夫跳跃过程进行马尔可夫链蒙特卡罗推断。
Philos Trans A Math Phys Eng Sci. 2012 Dec 31;371(1984):20110541. doi: 10.1098/rsta.2011.0541. Print 2013 Feb 13.
8
Bayesian polynomial neural networks and polynomial neural ordinary differential equations.贝叶斯多项式神经网络和多项式神经常微分方程。
PLoS Comput Biol. 2024 Oct 10;20(10):e1012414. doi: 10.1371/journal.pcbi.1012414. eCollection 2024 Oct.
9
Deep network embedding with dimension selection.深度网络嵌入与维度选择。
Neural Netw. 2024 Nov;179:106512. doi: 10.1016/j.neunet.2024.106512. Epub 2024 Jul 11.
10
Bayesian restoration of a hidden Markov chain with applications to DNA sequencing.应用于DNA测序的隐马尔可夫链的贝叶斯恢复
J Comput Biol. 1999 Summer;6(2):261-77. doi: 10.1089/cmb.1999.6.261.