• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于优化动态治疗方案的贝叶斯机器学习方法。

A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.

作者信息

Murray Thomas A, Yuan Ying, Thall Peter F

机构信息

Department of Biostatistics, MD Anderson Cancer Center.

出版信息

J Am Stat Assoc. 2018;113(523):1255-1267. doi: 10.1080/01621459.2017.1340887. Epub 2018 Oct 8.

DOI:10.1080/01621459.2017.1340887
PMID:30739965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6366650/
Abstract

Medical therapy often consists of multiple stages, with a treatment chosen by the physician at each stage based on the patient's history of treatments and clinical outcomes. These decisions can be formalized as a dynamic treatment regime. This paper describes a new approach for optimizing dynamic treatment regimes that bridges the gap between Bayesian inference and existing approaches, like Q-learning. The proposed approach fits a series of Bayesian regression models, one for each stage, in reverse sequential order. Each model uses as a response variable the remaining payoff assuming optimal actions are taken at subsequent stages, and as covariates the current history and relevant actions at that stage. The key difficulty is that the optimal decision rules at subsequent stages are unknown, and even if these decision rules were known the relevant response variables may be counterfactual. However, posterior distributions can be derived from the previously fitted regression models for the optimal decision rules and the counterfactual response variables under a particular set of rules. The proposed approach averages over these posterior distributions when fitting each regression model. An efficient sampling algorithm for estimation is presented, along with simulation studies that compare the proposed approach with Q-learning.

摘要

医学治疗通常包括多个阶段,医生在每个阶段会根据患者的治疗史和临床结果选择一种治疗方法。这些决策可以形式化为动态治疗方案。本文描述了一种优化动态治疗方案的新方法,该方法弥合了贝叶斯推断与现有方法(如Q学习)之间的差距。所提出的方法以反向顺序拟合一系列贝叶斯回归模型,每个阶段一个。每个模型将假设在后续阶段采取最优行动时的剩余收益作为响应变量,并将当前历史和该阶段的相关行动作为协变量。关键困难在于后续阶段的最优决策规则是未知的,而且即使这些决策规则已知,相关的响应变量也可能是反事实的。然而,在特定的一组规则下,可以从先前拟合的回归模型中推导出最优决策规则和反事实响应变量的后验分布。所提出的方法在拟合每个回归模型时对这些后验分布进行平均。还提出了一种用于估计的有效抽样算法,以及将所提出的方法与Q学习进行比较的模拟研究。

相似文献

1
A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.一种用于优化动态治疗方案的贝叶斯机器学习方法。
J Am Stat Assoc. 2018;113(523):1255-1267. doi: 10.1080/01621459.2017.1340887. Epub 2018 Oct 8.
2
Dynamic Treatment Regimes Using Bayesian Additive Regression Trees for Censored Outcomes.基于贝叶斯加性回归树的删失结局动态治疗方案。
Lifetime Data Anal. 2024 Jan;30(1):181-212. doi: 10.1007/s10985-023-09605-8. Epub 2023 Sep 2.
3
Decision making and uncertainty quantification for individualized treatments using Bayesian Additive Regression Trees.基于贝叶斯加法回归树的个体化治疗的决策制定与不确定性量化。
Stat Methods Med Res. 2019 Apr;28(4):1079-1093. doi: 10.1177/0962280217746191. Epub 2017 Dec 18.
4
C-learning: A new classification framework to estimate optimal dynamic treatment regimes.C学习法:一种用于估计最优动态治疗方案的新分类框架。
Biometrics. 2018 Sep;74(3):891-899. doi: 10.1111/biom.12836. Epub 2017 Dec 11.
5
Bayesian inference for optimal dynamic treatment regimes in practice.贝叶斯推断在实践中最优动态治疗方案的应用。
Int J Biostat. 2023 May 17;19(2):309-331. doi: 10.1515/ijb-2022-0073. eCollection 2023 Nov 1.
6
Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.用于多阶段多治疗决策的自适应对比度加权学习
Biometrics. 2017 Mar;73(1):145-155. doi: 10.1111/biom.12539. Epub 2016 May 23.
7
Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes.基于多目标树的强化学习估计宽容动态治疗方案。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad017.
8
Interactive Q-learning for Quantiles.用于分位数的交互式Q学习
J Am Stat Assoc. 2017;112(518):638-649. doi: 10.1080/01621459.2016.1155993. Epub 2017 Mar 31.
9
Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: main content.用于估计最优动态治疗方案的动态模式边际结构均值模型,第一部分:主要内容。
Int J Biostat. 2010;6(2):Article 8.
10
Entropy Learning for Dynamic Treatment Regimes.动态治疗方案的熵学习
Stat Sin. 2019;29(4):1633-1655. doi: 10.5705/ss.202018.0076.

引用本文的文献

1
Sparse 2-stage Bayesian meta-analysis for individualized treatments.用于个体化治疗的稀疏两阶段贝叶斯荟萃分析。
Biometrics. 2025 Jul 3;81(3). doi: 10.1093/biomtc/ujaf082.
2
Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.利用带有缺失数据的序贯多组分配随机试验数据优化动态治疗方案
BMC Med Res Methodol. 2025 Jul 1;25(1):162. doi: 10.1186/s12874-025-02595-1.
3
A Bayesian Approach to the G-Formula via Iterative Conditional Regression.一种通过迭代条件回归实现G公式的贝叶斯方法。

本文引用的文献

1
Bayesian Nonparametric Estimation for Dynamic Treatment Regimes with Sequential Transition Times.具有顺序转移时间的动态治疗方案的贝叶斯非参数估计
J Am Stat Assoc. 2016;111(515):921-935. doi: 10.1080/01621459.2015.1086353. Epub 2016 Oct 18.
2
Nonparametric survival analysis using Bayesian Additive Regression Trees (BART).使用贝叶斯加法回归树(BART)进行非参数生存分析。
Stat Med. 2016 Jul 20;35(16):2741-53. doi: 10.1002/sim.6893. Epub 2016 Feb 7.
3
Bayesian Dose-Finding in Two Treatment Cycles Based on the Joint Utility of Efficacy and Toxicity.
Stat Med. 2025 Jun;44(13-14):e70123. doi: 10.1002/sim.70123.
4
A Bayesian multivariate hierarchical model for developing a treatment benefit index using mixed types of outcomes.利用混合类型结局构建治疗获益指数的贝叶斯多元层次模型。
BMC Med Res Methodol. 2024 Sep 27;24(1):218. doi: 10.1186/s12874-024-02333-z.
5
Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes.基于多目标树的强化学习估计宽容动态治疗方案。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad017.
6
Percutaneous Coronary Intervention Mortality, Cost, Complications, and Disparities after Radiation Therapy: Artificial Intelligence-Augmented, Cost Effectiveness, and Computational Ethical Analysis.放射治疗后的经皮冠状动脉介入治疗死亡率、成本、并发症及差异:人工智能增强、成本效益及计算伦理分析
J Cardiovasc Dev Dis. 2023 Oct 30;10(11):445. doi: 10.3390/jcdd10110445.
7
Designing Optimal, Data-Driven Policies from Multisite Randomized Trials.从多中心随机试验中设计最佳的数据驱动政策。
Psychometrika. 2023 Dec;88(4):1171-1196. doi: 10.1007/s11336-023-09937-2. Epub 2023 Oct 24.
8
Interpreting Randomized Controlled Trials.解读随机对照试验
Cancers (Basel). 2023 Sep 22;15(19):4674. doi: 10.3390/cancers15194674.
9
Dynamic Treatment Regimes Using Bayesian Additive Regression Trees for Censored Outcomes.基于贝叶斯加性回归树的删失结局动态治疗方案。
Lifetime Data Anal. 2024 Jan;30(1):181-212. doi: 10.1007/s10985-023-09605-8. Epub 2023 Sep 2.
10
Bayesian Index Models for Heterogeneous Treatment Effects on a Binary Outcome.用于二元结局的异质性治疗效果的贝叶斯指数模型。
Stat Biosci. 2023;15(2):397-418. doi: 10.1007/s12561-023-09370-0. Epub 2023 May 19.
基于疗效和毒性联合效用的两治疗周期贝叶斯剂量探索
J Am Stat Assoc. 2015 Jun 1;110(510):711-722. doi: 10.1080/01621459.2014.926815.
4
New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes.用于估计最优动态治疗方案的新统计学习方法。
J Am Stat Assoc. 2015;110(510):583-598. doi: 10.1080/01621459.2014.937488.
5
Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.利用累积数据优化多阶段动态治疗方案。
Stat Med. 2015 Nov 20;34(26):3424-43. doi: 10.1002/sim.6558. Epub 2015 Jun 21.
6
Dynamic Treatment Regimes.动态治疗方案
Annu Rev Stat Appl. 2014;1:447-464. doi: 10.1146/annurev-statistics-022513-115553.
7
Dynamic treatment regimes: technical challenges and applications.动态治疗方案:技术挑战与应用
Electron J Stat. 2014;8(1):1225-1272. doi: 10.1214/14-ejs920.
8
Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions.用于序贯治疗决策的最优动态治疗方案的稳健估计。
Biometrika. 2013;100(3). doi: 10.1093/biomet/ast014.
9
Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.使用自适应n选m自助法对最优动态治疗方案进行推断。
Biometrics. 2013 Sep;69(3):714-23. doi: 10.1111/biom.12052. Epub 2013 Jul 11.
10
Q-learning: a data analysis method for constructing adaptive interventions.Q 学习:一种构建自适应干预措施的数据分析方法。
Psychol Methods. 2012 Dec;17(4):478-94. doi: 10.1037/a0029373. Epub 2012 Oct 1.