使用少量样本进行贝叶斯回归和稀有事件统计的输出加权最优采样。

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples.

作者信息

Sapsis Themistoklis P

机构信息

Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.

出版信息

Proc Math Phys Eng Sci. 2020 Feb;476(2234):20190834. doi: 10.1098/rspa.2019.0834. Epub 2020 Feb 19.

DOI:10.1098/rspa.2019.0834

PMID:32201483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7069488/

Abstract

For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evaluations. This problem can be seen in the context of active learning or optimal experimental design. We employ Bayesian regression to represent the derived model uncertainty due to finite and small number of input-output pairs. In this context we evaluate existing methods for optimal sample selection, such as model error minimization and mutual information maximization. We show that for the case of known output variance, the commonly employed criteria in the literature do not take into account the output values of the existing input-output pairs, while for the case of unknown output variance this dependence can be very weak. We introduce a criterion that takes into account the values of the output for the existing samples and adaptively selects inputs from regions of the parameter space which have an important contribution to the output. The new method allows for application to high-dimensional inputs, paving the way for optimal experimental design in high dimensions.

摘要

对于许多重要问题，感兴趣的量是参数的未知函数，该参数是具有已知统计量的随机向量。由于输出对该随机向量的依赖性未知，挑战在于使用最少数量的函数评估来识别其统计量。这个问题可以在主动学习或最优实验设计的背景下看到。我们采用贝叶斯回归来表示由于有限且少量的输入 - 输出对而产生的模型不确定性。在此背景下，我们评估现有的最优样本选择方法，例如模型误差最小化和互信息最大化。我们表明，对于已知输出方差的情况，文献中常用的标准没有考虑现有输入 - 输出对的输出值，而对于未知输出方差的情况，这种依赖性可能非常弱。我们引入一种考虑现有样本输出值的标准，并从对输出有重要贡献的参数空间区域中自适应地选择输入。新方法允许应用于高维输入，为高维最优实验设计铺平了道路。

相似文献

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples.使用少量样本进行贝叶斯回归和稀有事件统计的输出加权最优采样。

Proc Math Phys Eng Sci. 2020 Feb;476(2234):20190834. doi: 10.1098/rspa.2019.0834. Epub 2020 Feb 19.

Sequential sampling strategy for extreme event statistics in nonlinear dynamical systems.非线性动力系统中极端事件统计的序贯抽样策略。

Proc Natl Acad Sci U S A. 2018 Oct 30;115(44):11138-11143. doi: 10.1073/pnas.1813263115. Epub 2018 Oct 16.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Optimal criteria and their asymptotic form for data selection in data-driven reduced-order modelling with Gaussian process regression.高斯过程回归数据驱动降阶建模中数据选择的最优准则及其渐近形式。

Philos Trans A Math Phys Eng Sci. 2022 Aug 8;380(2229):20210197. doi: 10.1098/rsta.2021.0197. Epub 2022 Jun 20.

Applications of Monte Carlo Simulation in Modelling of Biochemical Processes蒙特卡罗模拟在生化过程建模中的应用

Active Learning of Bayesian Linear Models with High-Dimensional Binary Features by Parameter Confidence-Region Estimation.基于参数置信区域估计的高维二值特征贝叶斯线性模型的主动学习。

Neural Comput. 2020 Oct;32(10):1998-2031. doi: 10.1162/neco_a_01310. Epub 2020 Aug 14.

Case studies in Bayesian microbial risk assessments.贝叶斯微生物风险评估案例研究。

Environ Health. 2009 Dec 21;8 Suppl 1(Suppl 1):S19. doi: 10.1186/1476-069X-8-S1-S19.

On the Complexity of Logistic Regression Models.Logistic 回归模型的复杂性。

Neural Comput. 2019 Aug;31(8):1592-1623. doi: 10.1162/neco_a_01207. Epub 2019 Jul 1.

Active learning for adaptive surrogate model improvement in high-dimensional problems.用于高维问题中自适应代理模型改进的主动学习。

Struct Multidiscipl Optim. 2024;67(7):122. doi: 10.1007/s00158-024-03816-9. Epub 2024 Jul 10.

Bayesian Input Design for Linear Dynamical Model Discrimination.用于线性动态模型辨别的贝叶斯输入设计

Entropy (Basel). 2019 Mar 30;21(4):351. doi: 10.3390/e21040351.

引用本文的文献

Information FOMO: The Unhealthy Fear of Missing Out on Information-A Method for Removing Misleading Data for Healthier Models.信息错失恐惧症：对错过信息的不健康恐惧——一种去除误导性数据以建立更健康模型的方法。

Entropy (Basel). 2024 Sep 30;26(10):835. doi: 10.3390/e26100835.

Explaining People's Worry Levels During the Covid-19 Pandemic: An Analysis of Socio-Economic and Cultural Dimensions.解释新冠疫情期间人们的担忧程度：社会经济和文化维度分析

Front Psychol. 2021 Oct 6;12:737917. doi: 10.3389/fpsyg.2021.737917. eCollection 2021.

本文引用的文献

A robotic Intelligent Towing Tank for learning complex fluid-structure dynamics.一种用于学习复杂流固动力学的机器人智能拖曳水池。

Sci Robot. 2019 Nov 27;4(36). doi: 10.1126/scirobotics.aay5063.

Statistical dynamical model to predict extreme events and anomalous features in shallow water waves with abrupt depth change.具有突变水深的浅水波中极端事件和异常特征的统计动力模型预测。

Proc Natl Acad Sci U S A. 2019 Mar 5;116(10):3982-3987. doi: 10.1073/pnas.1820467116. Epub 2019 Feb 13.

Sequential sampling strategy for extreme event statistics in nonlinear dynamical systems.非线性动力系统中极端事件统计的序贯抽样策略。

Proc Natl Acad Sci U S A. 2018 Oct 30;115(44):11138-11143. doi: 10.1073/pnas.1813263115. Epub 2018 Oct 16.

New perspectives for the prediction and statistical quantification of extreme events in high-dimensional dynamical systems.高维动力系统中极端事件预测与统计量化的新视角。

Philos Trans A Math Phys Eng Sci. 2018 Aug 28;376(2127). doi: 10.1098/rsta.2017.0133.

A variational approach to probing extreme events in turbulent dynamical systems.一种探测湍流动力系统中极端事件的变分方法。

Sci Adv. 2017 Sep 22;3(9):e1701533. doi: 10.1126/sciadv.1701533. eCollection 2017 Sep.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用少量样本进行贝叶斯回归和稀有事件统计的输出加权最优采样。

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples.

作者信息

Sapsis Themistoklis P

机构信息

Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.

出版信息

Proc Math Phys Eng Sci. 2020 Feb;476(2234):20190834. doi: 10.1098/rspa.2019.0834. Epub 2020 Feb 19.

DOI:10.1098/rspa.2019.0834

PMID:32201483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7069488/

Abstract

摘要

使用少量样本进行贝叶斯回归和稀有事件统计的输出加权最优采样。

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

使用少量样本进行贝叶斯回归和稀有事件统计的输出加权最优采样。

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples.

作者信息

机构信息

出版信息