Suppr超能文献

带删失数据的Q学习法

Q-LEARNING WITH CENSORED DATA.

作者信息

Goldberg Yair, Kosorok Michael R

机构信息

Department of Biostatistics, The University of North Carolina At Chapel Hill, Chapel Hill, NC 27599, U.S.A.

出版信息

Ann Stat. 2012 Feb 1;40(1):529-560. doi: 10.1214/12-AOS968.

Abstract

We develop methodology for a multistage-decision problem with flexible number of stages in which the rewards are survival times that are subject to censoring. We present a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages. We provide finite sample bounds on the generalization error of the policy learned by the algorithm, and show that when the optimal Q-function belongs to the approximation space, the expected survival time for policies obtained by the algorithm converges to that of the optimal policy. We simulate a multistage clinical trial with flexible number of stages and apply the proposed censored-Q-learning algorithm to find individualized treatment regimens. The methodology presented in this paper has implications in the design of personalized medicine trials in cancer and in other life-threatening diseases.

摘要

我们针对具有灵活阶段数的多阶段决策问题开发了一种方法,其中奖励是受删失影响的生存时间。我们提出了一种新颖的Q学习算法,该算法针对删失数据进行了调整,并允许灵活的阶段数。我们给出了算法学习到的策略的泛化误差的有限样本界,并表明当最优Q函数属于逼近空间时,算法得到的策略的预期生存时间收敛到最优策略的预期生存时间。我们模拟了一个具有灵活阶段数的多阶段临床试验,并应用所提出的删失Q学习算法来寻找个性化治疗方案。本文提出的方法对癌症和其他危及生命疾病的个性化医学试验设计具有重要意义。

相似文献

1
Q-LEARNING WITH CENSORED DATA.带删失数据的Q学习法
Ann Stat. 2012 Feb 1;40(1):529-560. doi: 10.1214/12-AOS968.
4
10

引用本文的文献

10
Reinforcement Learning for Precision Oncology.用于精准肿瘤学的强化学习
Cancers (Basel). 2021 Sep 15;13(18):4624. doi: 10.3390/cancers13184624.

本文引用的文献

9
An overview of statistical learning theory.统计学习理论概述。
IEEE Trans Neural Netw. 1999;10(5):988-99. doi: 10.1109/72.788640.
10
On an exponential bound for the Kaplan-Meier estimator.关于Kaplan-Meier估计量的指数界。
Lifetime Data Anal. 2007 Dec;13(4):481-96. doi: 10.1007/s10985-007-9055-z. Epub 2007 Aug 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验