• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习:作为统计数据同化问题的深度学习。

Machine Learning: Deepest Learning as Statistical Data Assimilation Problems.

作者信息

Abarbanel Henry D I, Rozdeba Paul J, Shirman Sasha

机构信息

Marine Physical Laboratory, Scripps Institution of Oceanography, and Department of Physics, University of California, San Diego, La Jolla, CA 92093-0374, U.S.A.

Department of Physics, University of California, San Diego, La Jolla, CA 92093-0374, U.S.A.

出版信息

Neural Comput. 2018 Aug;30(8):2025-2055. doi: 10.1162/neco_a_01094. Epub 2018 Jun 12.

DOI:10.1162/neco_a_01094
PMID:29894650
Abstract

We formulate an equivalence between machine learning and the formulation of statistical data assimilation as used widely in physical and biological sciences. The correspondence is that layer number in a feedforward artificial network setting is the analog of time in the data assimilation setting. This connection has been noted in the machine learning literature. We add a perspective that expands on how methods from statistical physics and aspects of Lagrangian and Hamiltonian dynamics play a role in how networks can be trained and designed. Within the discussion of this equivalence, we show that adding more layers (making the network deeper) is analogous to adding temporal resolution in a data assimilation framework. Extending this equivalence to recurrent networks is also discussed. We explore how one can find a candidate for the global minimum of the cost functions in the machine learning context using a method from data assimilation. Calculations on simple models from both sides of the equivalence are reported. Also discussed is a framework in which the time or layer label is taken to be continuous, providing a differential equation, the Euler-Lagrange equation and its boundary conditions, as a necessary condition for a minimum of the cost function. This shows that the problem being solved is a two-point boundary value problem familiar in the discussion of variational methods. The use of continuous layers is denoted "deepest learning." These problems respect a symplectic symmetry in continuous layer phase space. Both Lagrangian versions and Hamiltonian versions of these problems are presented. Their well-studied implementation in a discrete time/layer, while respecting the symplectic structure, is addressed. The Hamiltonian version provides a direct rationale for backpropagation as a solution method for a certain two-point boundary value problem.

摘要

我们阐述了机器学习与物理和生物科学中广泛使用的统计数据同化公式之间的等价关系。对应关系是,前馈人工网络设置中的层数类似于数据同化设置中的时间。这种联系在机器学习文献中已有提及。我们补充了一个观点,即统计物理方法以及拉格朗日和哈密顿动力学的各个方面如何在网络的训练和设计中发挥作用。在讨论这种等价关系时,我们表明增加更多层(使网络更深)类似于在数据同化框架中增加时间分辨率。还讨论了将这种等价关系扩展到递归网络的情况。我们探索如何使用数据同化方法在机器学习背景下找到成本函数全局最小值的候选值。报告了对等价关系两侧简单模型的计算。还讨论了一个框架,其中时间或层标签被视为连续的,从而提供一个微分方程、欧拉 - 拉格朗日方程及其边界条件,作为成本函数最小值的必要条件。这表明正在解决的问题是变分方法讨论中熟悉的两点边值问题。连续层的使用被称为“深度最深学习”。这些问题在连续层相空间中遵循辛对称性。给出了这些问题的拉格朗日版本和哈密顿版本。讨论了它们在离散时间/层中经过充分研究的实现方式,同时尊重辛结构。哈密顿版本为反向传播作为解决特定两点边值问题的方法提供了直接的理论依据。

相似文献

1
Machine Learning: Deepest Learning as Statistical Data Assimilation Problems.机器学习:作为统计数据同化问题的深度学习。
Neural Comput. 2018 Aug;30(8):2025-2055. doi: 10.1162/neco_a_01094. Epub 2018 Jun 12.
2
Systematic variational method for statistical nonlinear state and parameter estimation.用于统计非线性状态和参数估计的系统变分方法。
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Nov;92(5):052901. doi: 10.1103/PhysRevE.92.052901. Epub 2015 Nov 2.
3
Ensemble-based kernel learning for a class of data assimilation problems with imperfect forward simulators.基于集成的核学习方法在一类具有不完全正演模拟的同化问题中的应用。
PLoS One. 2019 Jul 11;14(7):e0219247. doi: 10.1371/journal.pone.0219247. eCollection 2019.
4
Combining data assimilation and machine learning to build data-driven models for unknown long time dynamics-Applications in cardiovascular modeling.结合数据同化和机器学习,构建用于未知长时间动力学的基于数据的模型——在心血管建模中的应用。
Int J Numer Method Biomed Eng. 2021 Jul;37(7):e3471. doi: 10.1002/cnm.3471. Epub 2021 Jun 6.
5
Where do features come from?特征从何而来?
Cogn Sci. 2014 Aug;38(6):1078-101. doi: 10.1111/cogs.12049. Epub 2013 Jun 25.
6
The Principle of Covariance and the Hamiltonian Formulation of General Relativity.协变原理与广义相对论的哈密顿表述
Entropy (Basel). 2021 Feb 10;23(2):215. doi: 10.3390/e23020215.
7
Learning earth system models from observations: machine learning or data assimilation?从观测中学习地球系统模型:机器学习还是数据同化?
Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200089. doi: 10.1098/rsta.2020.0089. Epub 2021 Feb 15.
8
Bayesian machine learning for quantum molecular dynamics.用于量子分子动力学的贝叶斯机器学习
Phys Chem Chem Phys. 2019 Jun 26;21(25):13392-13410. doi: 10.1039/c9cp01883b.
9
Feedforward Approximations to Dynamic Recurrent Network Architectures.动态递归网络架构的前馈近似
Neural Comput. 2018 Feb;30(2):546-567. doi: 10.1162/neco_a_01042. Epub 2017 Nov 21.
10
Hamiltonian neural networks for solving equations of motion.用于求解运动方程的哈密顿神经网络。
Phys Rev E. 2022 Jun;105(6-2):065305. doi: 10.1103/PhysRevE.105.065305.

引用本文的文献

1
Reconstructing computational system dynamics from neural data with recurrent neural networks.基于递归神经网络从神经数据中重建计算系统动态。
Nat Rev Neurosci. 2023 Nov;24(11):693-710. doi: 10.1038/s41583-023-00740-7. Epub 2023 Oct 4.
2
Identifying nonlinear dynamical systems via generative recurrent neural networks with applications to fMRI.通过生成式递归神经网络识别非线性动力系统及其在 fMRI 中的应用。
PLoS Comput Biol. 2019 Aug 21;15(8):e1007263. doi: 10.1371/journal.pcbi.1007263. eCollection 2019 Aug.