Suppr超能文献

多臂老虎机中先验设定错误和选择性停止的风险。

The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits.

作者信息

Loecher Markus

机构信息

Berlin School of Economics and Law, Berlin, Germany.

出版信息

Front Artif Intell. 2021 Jul 9;4:715690. doi: 10.3389/frai.2021.715690. eCollection 2021.

Abstract

The connection between optimal stopping times of American Options and multi-armed bandits is the subject of active research. This article investigates the effects of optional stopping in a particular class of multi-armed bandit experiments, which randomly allocates observations to arms proportional to the Bayesian posterior probability that each arm is optimal (). The interplay between optional stopping and prior mismatch is examined. We propose a novel partitioning of regret into peri/post testing. We further show a strong dependence of the parameters of interest on the assumed prior probability density.

摘要

美式期权的最优停止时间与多臂老虎机之间的联系是当前积极研究的主题。本文研究了在一类特定的多臂老虎机实验中选择性停止的影响,该实验根据每个臂是最优臂的贝叶斯后验概率,将观测值随机分配到各个臂上。本文还研究了选择性停止与先验不匹配之间的相互作用。我们提出了一种将遗憾分为测试前/测试后的新颖划分方法。我们进一步表明,感兴趣的参数强烈依赖于假定的先验概率密度。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验