• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.基于树的分步调整强化学习在使用测试和治疗观察数据评估嵌套动态治疗方案中的应用。
Stat Med. 2021 Nov 30;40(27):6164-6177. doi: 10.1002/sim.9177. Epub 2021 Sep 7.
2
TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.基于树的强化学习用于估计最优动态治疗方案
Ann Appl Stat. 2018 Sep;12(3):1914-1938. doi: 10.1214/18-AOAS1137. Epub 2018 Sep 11.
3
Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes.基于多目标树的强化学习估计宽容动态治疗方案。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad017.
4
Restricted sub-tree learning to estimate an optimal dynamic treatment regime using observational data.基于观测数据的受限子树学习来估计最优动态治疗规则。
Stat Med. 2021 Nov 20;40(26):5796-5812. doi: 10.1002/sim.9155. Epub 2021 Aug 2.
5
Imputation-based Q-learning for optimizing dynamic treatment regimes with right-censored survival outcome.基于插补的 Q 学习优化右删失生存结局的动态治疗方案。
Biometrics. 2023 Dec;79(4):3676-3689. doi: 10.1111/biom.13872. Epub 2023 May 17.
6
Penalized Spline-Involved Tree-based (PenSIT) Learning for estimating an optimal dynamic treatment regime using observational data.基于惩罚样条的树状结构(PenSIT)学习法,用于利用观测数据估计最优动态治疗方案。
Stat Methods Med Res. 2022 Dec;31(12):2338-2351. doi: 10.1177/09622802221122397. Epub 2022 Oct 3.
7
Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences.利用限制治疗序列的观测数据估计基于树的动态治疗规则。
Biometrics. 2023 Sep;79(3):2260-2271. doi: 10.1111/biom.13754. Epub 2022 Oct 9.
8
Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network.通过深度 Q 网络从医疗注册数据中学习动态治疗方案。
Sci Rep. 2019 Feb 6;9(1):1495. doi: 10.1038/s41598-018-37142-0.
9
Optimal dynamic treatment regime estimation using information extraction from unstructured clinical text.利用从非结构化临床文本中提取的信息进行最优动态治疗方案估计。
Biom J. 2022 Apr;64(4):805-817. doi: 10.1002/bimj.202100077. Epub 2022 Feb 3.
10
Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.利用累积数据优化多阶段动态治疗方案。
Stat Med. 2015 Nov 20;34(26):3424-43. doi: 10.1002/sim.6558. Epub 2015 Jun 21.

引用本文的文献

1
Reinforcement Learning and Its Clinical Applications Within Healthcare: A Systematic Review of Precision Medicine and Dynamic Treatment Regimes.强化学习及其在医疗保健领域的临床应用:精准医学与动态治疗方案的系统综述
Healthcare (Basel). 2025 Jul 19;13(14):1752. doi: 10.3390/healthcare13141752.
2
Energy landscape analysis and time-series clustering analysis of patient state multistability related to rheumatoid arthritis drug treatment: The KURAMA cohort study.能量景观分析和与类风湿关节炎药物治疗相关的患者状态多稳定性的时间序列聚类分析:KURAMA 队列研究。
PLoS One. 2024 May 6;19(5):e0302308. doi: 10.1371/journal.pone.0302308. eCollection 2024.

本文引用的文献

1
TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.基于树的强化学习用于估计最优动态治疗方案
Ann Appl Stat. 2018 Sep;12(3):1914-1938. doi: 10.1214/18-AOAS1137. Epub 2018 Sep 11.
2
Optimizing Prostate Cancer Surveillance: Using Data-driven Models for Informed Decision-making.优化前列腺癌监测:使用数据驱动模型进行明智决策。
Eur Urol. 2019 Jun;75(6):918-919. doi: 10.1016/j.eururo.2018.12.006. Epub 2018 Dec 19.
3
HIGH-DIMENSIONAL A-LEARNING FOR OPTIMAL DYNAMIC TREATMENT REGIMES.用于优化动态治疗方案的高维A学习法
Ann Stat. 2018 Jun;46(3):925-957. doi: 10.1214/17-AOS1570. Epub 2018 May 3.
4
Multi-Objective Markov Decision Processes for Data-Driven Decision Support.用于数据驱动决策支持的多目标马尔可夫决策过程
J Mach Learn Res. 2016;17. Epub 2016 Dec 1.
5
Estimation of the optimal regime in treatment of prostate cancer recurrence from observational data using flexible weighting models.使用灵活加权模型从观察性数据估计前列腺癌复发治疗的最佳方案。
Biometrics. 2017 Jun;73(2):635-645. doi: 10.1111/biom.12621. Epub 2016 Nov 28.
6
Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.用于多阶段多治疗决策的自适应对比度加权学习
Biometrics. 2017 Mar;73(1):145-155. doi: 10.1111/biom.12539. Epub 2016 May 23.
7
New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes.用于估计最优动态治疗方案的新统计学习方法。
J Am Stat Assoc. 2015;110(510):583-598. doi: 10.1080/01621459.2014.937488.
8
Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.利用累积数据优化多阶段动态治疗方案。
Stat Med. 2015 Nov 20;34(26):3424-43. doi: 10.1002/sim.6558. Epub 2015 Jun 21.
9
Dynamic Treatment Regimes.动态治疗方案
Annu Rev Stat Appl. 2014;1:447-464. doi: 10.1146/annurev-statistics-022513-115553.
10
Prostate cancer: measuring PSA.前列腺癌:检测 PSA。
Intern Med J. 2014 May;44(5):433-40. doi: 10.1111/imj.12407.

基于树的分步调整强化学习在使用测试和治疗观察数据评估嵌套动态治疗方案中的应用。

Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

The James Buchanan Brady Urological Institute and Department of Urology, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA.

出版信息

Stat Med. 2021 Nov 30;40(27):6164-6177. doi: 10.1002/sim.9177. Epub 2021 Sep 7.

DOI:10.1002/sim.9177
PMID:34490942
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8595655/
Abstract

Dynamic treatment regimes (DTRs) include a sequence of treatment decision rules, in which treatment is adapted over time in response to the changes in an individual's disease progression and health care history. In medical practice, nested test-and-treat strategies are common to improve cost-effectiveness. For example, for patients at risk of prostate cancer, only patients who have high prostate-specific antigen (PSA) need a biopsy, which is costly and invasive, to confirm the diagnosis and help determine the treatment if needed. A decision about treatment happens after the biopsy, and is thus nested within the decision of whether to do the test. However, current existing statistical methods are not able to accommodate such a naturally embedded property of the treatment decision within the test decision. Therefore, we developed a new statistical learning method, step-adjusted tree-based reinforcement learning, to evaluate DTRs within such a nested multistage dynamic decision framework using observational data. At each step within each stage, we combined the robust semiparametric estimation via augmented inverse probability weighting with a tree-based reinforcement learning method to deal with the counterfactual optimization. The simulation studies demonstrated robust performance of the proposed methods under different scenarios. We further applied our method to evaluate the necessity of prostate biopsy and identify the optimal test-and-treat regimes for prostate cancer patients using data from the Johns Hopkins University prostate cancer active surveillance dataset.

摘要

动态治疗方案(DTR)包括一系列治疗决策规则,其中治疗会随着个体疾病进展和医疗史的变化而进行调整。在医疗实践中,嵌套式测试和治疗策略常用于提高成本效益。例如,对于有前列腺癌风险的患者,只有前列腺特异性抗原(PSA)较高的患者需要进行活检来确诊,并在必要时帮助确定治疗方案,而活检既昂贵又具侵入性。治疗决策是在活检后做出的,因此嵌套在是否进行测试的决策中。然而,当前现有的统计方法无法适应测试决策中治疗决策的这种自然嵌入特性。因此,我们开发了一种新的统计学习方法,即逐步调整基于树的强化学习,以使用观察数据在这种嵌套的多阶段动态决策框架内评估 DTR。在每个阶段的每个步骤中,我们将增强逆概率加权的稳健半参数估计与基于树的强化学习方法相结合,以处理反事实优化问题。模拟研究表明,在不同情况下,所提出的方法具有稳健的性能。我们进一步应用我们的方法来评估前列腺活检的必要性,并使用约翰霍普金斯大学前列腺癌主动监测数据集来确定前列腺癌患者的最佳测试和治疗方案。