Suppr超能文献

用于在采样控制下实时监测高维数据的强盗变点检测

Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control.

作者信息

Zhang Wanrong, Mei Yajun

机构信息

Harvard University.

Georgia Institute of Technology.

出版信息

Technometrics. 2023;65(1):33-43. doi: 10.1080/00401706.2022.2054861. Epub 2022 Apr 22.

Abstract

In many real-world problems of real-time monitoring high-dimensional streaming data, one wants to detect an undesired event or change quickly once it occurs, but under the sampling control constraint in the sense that one might be able to only observe or use selected components data for decision-making per time step in the resource-constrained environments. In this paper, we propose to incorporate multi-armed bandit approaches into sequential change-point detection to develop an efficient bandit change-point detection algorithm based on the limiting Bayesian approach to incorporate a prior knowledge of potential changes. Our proposed algorithm, termed Thompson-Sampling-Shiryaev-Roberts-Pollak (TSSRP), consists of two policies per time step: the adaptive sampling policy applies the Thompson Sampling algorithm to balance between exploration for acquiring long-term knowledge and exploitation for immediate reward gain, and the statistical decision policy fuses the local Shiryaev-Roberts-Pollak statistics to determine whether to raise a global alarm by sum shrinkage techniques. Extensive numerical simulations and case studies demonstrate the statistical and computational efficiency of our proposed TSSRP algorithm.

摘要

在许多实时监测高维流数据的实际问题中,一旦不期望的事件或变化发生,人们希望能迅速检测到,但要在采样控制约束下,即在资源受限环境中,每次时间步长可能只能观察或使用选定的分量数据进行决策。在本文中,我们建议将多臂赌博机方法纳入顺序变化点检测,以基于极限贝叶斯方法开发一种有效的赌博机变化点检测算法,纳入潜在变化的先验知识。我们提出的算法称为汤普森采样- Shiryaev - Roberts - Pollak(TSSRP),每个时间步长由两种策略组成:自适应采样策略应用汤普森采样算法,在获取长期知识的探索和获取即时奖励的利用之间进行平衡;统计决策策略融合局部的Shiryaev - Roberts - Pollak统计量,通过求和收缩技术确定是否发出全局警报。大量的数值模拟和案例研究证明了我们提出的TSSRP算法的统计和计算效率。

相似文献

2
Adaptive Partially Observed Sequential Change Detection and Isolation.自适应部分观测序列变化检测与隔离
Technometrics. 2022;64(4):502-512. doi: 10.1080/00401706.2022.2124307. Epub 2022 Nov 8.
4
A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit.一种针对单峰高斯博弈且具有对数遗憾值的汤普森采样算法。
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5332-5341. doi: 10.1109/TNNLS.2023.3295360. Epub 2023 Sep 1.
5
An empirical evaluation of active inference in multi-armed bandits.多臂赌博机中主动推理的实证评估。
Neural Netw. 2021 Dec;144:229-246. doi: 10.1016/j.neunet.2021.08.018. Epub 2021 Aug 26.

本文引用的文献

1
Accurate estimation of influenza epidemics using Google search data via ARGO.通过ARGO利用谷歌搜索数据准确估计流感疫情。
Proc Natl Acad Sci U S A. 2015 Nov 24;112(47):14473-8. doi: 10.1073/pnas.1515373112. Epub 2015 Nov 9.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验