精准医学时代的工具：如何使用Q学习从队列和登记数据中制定高度个性化的治疗建议。

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

作者信息

Krakow Elizabeth F, Hemmer Michael, Wang Tao, Logan Brent, Arora Mukta, Spellman Stephen, Couriel Daniel, Alousi Amin, Pidala Joseph, Last Michael, Lachance Silvy, Moodie Erica E M

出版信息

Am J Epidemiol. 2017 Jul 15;186(2):160-172. doi: 10.1093/aje/kwx027.

DOI:10.1093/aje/kwx027

PMID:28472335

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6664807/

Abstract

Q-learning is a method of reinforcement learning that employs backwards stagewise estimation to identify sequences of actions that maximize some long-term reward. The method can be applied to sequential multiple-assignment randomized trials to develop personalized adaptive treatment strategies (ATSs)-longitudinal practice guidelines highly tailored to time-varying attributes of individual patients. Sometimes, the basis for choosing which ATSs to include in a sequential multiple-assignment randomized trial (or randomized controlled trial) may be inadequate. Nonrandomized data sources may inform the initial design of ATSs, which could later be prospectively validated. In this paper, we illustrate challenges involved in using nonrandomized data for this purpose with a case study from the Center for International Blood and Marrow Transplant Research registry (1995-2007) aimed at 1) determining whether the sequence of therapeutic classes used in graft-versus-host disease prophylaxis and in refractory graft-versus-host disease is associated with improved survival and 2) identifying donor and patient factors with which to guide individualized immunosuppressant selections over time. We discuss how to communicate the potential benefit derived from following an ATS at the population and subgroup levels and how to evaluate its robustness to modeling assumptions. This worked example may serve as a model for developing ATSs from registries and cohorts in oncology and other fields requiring sequential treatment decisions.

摘要

Q学习是一种强化学习方法，它采用反向逐步估计来识别能使某些长期奖励最大化的行动序列。该方法可应用于序贯多重分配随机试验，以制定个性化自适应治疗策略（ATS）——高度针对个体患者随时间变化的特征量身定制的纵向实践指南。有时，在序贯多重分配随机试验（或随机对照试验）中选择纳入哪些ATS的依据可能并不充分。非随机数据源可为ATS的初始设计提供信息，随后可对其进行前瞻性验证。在本文中，我们通过国际血液和骨髓移植研究中心登记处（1995 - 2007年）的一个案例研究，阐述了为此目的使用非随机数据所涉及的挑战，该研究旨在：1）确定移植物抗宿主病预防和难治性移植物抗宿主病中使用的治疗类别序列是否与生存率提高相关；2）识别随着时间推移可用于指导个体化免疫抑制剂选择的供体和患者因素。我们讨论了如何在总体和亚组层面传达遵循ATS所带来的潜在益处，以及如何评估其对建模假设的稳健性。这个实例可作为一个模型，用于从肿瘤学及其他需要序贯治疗决策的领域的登记处和队列中开发ATS。

相似文献

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

Am J Epidemiol. 2017 Jul 15;186(2):160-172. doi: 10.1093/aje/kwx027.

Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network.

Sci Rep. 2019 Feb 6;9(1):1495. doi: 10.1038/s41598-018-37142-0.

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

Healthc Inform. 2017 Aug;2017:380-385. doi: 10.1109/ICHI.2017.45.

Comparison of graft-versus-host disease-free, relapse-free survival according to a variety of graft sources: antithymocyte globulin and single cord blood provide favorable outcomes in some subgroups.

Haematologica. 2016 Dec;101(12):1592-1602. doi: 10.3324/haematol.2016.149427. Epub 2016 Aug 4.

Comparison of Characteristics and Outcomes of Trial Participants and Nonparticipants: Example of Blood and Marrow Transplant Clinical Trials Network 0201 Trial.

Biol Blood Marrow Transplant. 2015 Oct;21(10):1815-22. doi: 10.1016/j.bbmt.2015.06.004. Epub 2015 Jun 11.

LIBERTI: A SMART study in plastic surgery.

Clin Trials. 2018 Jun;15(3):286-293. doi: 10.1177/1740774518762435. Epub 2018 Mar 25.

Machine learning applications and challenges in graft-versus-host disease: a scoping review.

Curr Opin Oncol. 2023 Nov 1;35(6):594-600. doi: 10.1097/CCO.0000000000000996. Epub 2023 Sep 1.

Improved graft-versus-host disease-free, relapse-free survival associated with bone marrow as the stem cell source in adults.

Haematologica. 2016 Jun;101(6):764-72. doi: 10.3324/haematol.2015.138990. Epub 2016 Apr 1.

Design of sequentially randomized trials for testing adaptive treatment strategies.

Stat Med. 2016 Mar 15;35(6):840-58. doi: 10.1002/sim.6747. Epub 2015 Sep 27.

Doubly-robust dynamic treatment regimen estimation via weighted least squares.

Biometrics. 2015 Sep;71(3):636-44. doi: 10.1111/biom.12306. Epub 2015 Apr 8.

引用本文的文献

Reinforcement Learning and Its Clinical Applications Within Healthcare: A Systematic Review of Precision Medicine and Dynamic Treatment Regimes.

Healthcare (Basel). 2025 Jul 19;13(14):1752. doi: 10.3390/healthcare13141752.

Using Pilot Data for Power Analysis of Observational Studies for the Estimation of Dynamic Treatment Regimes.

Obs Stud. 2023;9(4):25-48. doi: 10.1353/obs.2023.a906627.

The impact of using reinforcement learning to personalize communication on medication adherence: findings from the REINFORCE trial.

NPJ Digit Med. 2024 Feb 19;7(1):39. doi: 10.1038/s41746-024-01028-5.

Dynamic Treatment Regimes Using Bayesian Additive Regression Trees for Censored Outcomes.

Lifetime Data Anal. 2024 Jan;30(1):181-212. doi: 10.1007/s10985-023-09605-8. Epub 2023 Sep 2.

Prediction and recommendation by machine learning through repetitive internal validation for hepatic veno-occlusive disease/sinusoidal obstruction syndrome and early death after allogeneic hematopoietic cell transplantation.

Bone Marrow Transplant. 2022 Apr;57(4):538-546. doi: 10.1038/s41409-022-01583-z. Epub 2022 Jan 24.

REinforcement learning to improve non-adherence for diabetes treatments by Optimising Response and Customising Engagement (REINFORCE): study protocol of a pragmatic randomised trial.

BMJ Open. 2021 Dec 3;11(12):e052091. doi: 10.1136/bmjopen-2021-052091.

Reinforcement Learning for Precision Oncology.

Cancers (Basel). 2021 Sep 15;13(18):4624. doi: 10.3390/cancers13184624.

A scoping review of studies using observational data to optimise dynamic treatment regimens.

BMC Med Res Methodol. 2021 Feb 22;21(1):39. doi: 10.1186/s12874-021-01211-2.

Artificial Intelligence in Dermatology: A Practical Introduction to a Paradigm Shift.

Indian Dermatol Online J. 2020 Nov 8;11(6):881-889. doi: 10.4103/idoj.IDOJ_388_20. eCollection 2020 Nov-Dec.

Can the Risk of Severe Depression-Related Outcomes Be Reduced by Tailoring the Antidepressant Therapy to Patient Characteristics?

Am J Epidemiol. 2021 Jul 1;190(7):1210-1219. doi: 10.1093/aje/kwaa260.

本文引用的文献

Predictive Bayesian inference and dynamic treatment regimes.

Biom J. 2015 Nov;57(6):941-58. doi: 10.1002/bimj.201400153. Epub 2015 Aug 11.

Doubly-robust dynamic treatment regimen estimation via weighted least squares.

Biometrics. 2015 Sep;71(3):636-44. doi: 10.1111/biom.12306. Epub 2015 Apr 8.

Causal Inference Under Multiple Versions of Treatment.

J Causal Inference. 2013 May 1;1(1):1-20. doi: 10.1515/jci-2012-0002.

SMART designs in cancer research: Past, present, and future.

Clin Trials. 2014 Aug;11(4):445-456. doi: 10.1177/1740774514525691. Epub 2014 Apr 14.

Q-learning for estimating optimal dynamic treatment rules from observational data.

Can J Stat. 2012 Dec 1;40(4):629-645. doi: 10.1002/cjs.11162. Epub 2012 Nov 7.

Does antithymocyte globulin have a place in reduced-intensity conditioning for allogeneic hematopoietic stem cell transplantation?

Hematology Am Soc Hematol Educ Program. 2012;2012:246-50. doi: 10.1182/asheducation-2012.1.246.

Analysis of multi-stage treatments for recurrent diseases.

Stat Med. 2012 Oct 30;31(24):2805-21. doi: 10.1002/sim.5456. Epub 2012 Jul 24.

Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: main content.

Int J Biostat. 2010;6(2):Article 8.

Impact of immune modulation with anti-T-cell antibodies on the outcome of reduced-intensity allogeneic hematopoietic stem cell transplantation for hematologic malignancies.

Blood. 2011 Jun 23;117(25):6963-70. doi: 10.1182/blood-2011-01-332007. Epub 2011 Apr 4.

Variable Selection for Qualitative Interactions.

Stat Methodol. 2011 Jan 30;1(8):42-55. doi: 10.1016/j.stamet.2009.05.003.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

精准医学时代的工具：如何使用Q学习从队列和登记数据中制定高度个性化的治疗建议。

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

作者信息

Krakow Elizabeth F, Hemmer Michael, Wang Tao, Logan Brent, Arora Mukta, Spellman Stephen, Couriel Daniel, Alousi Amin, Pidala Joseph, Last Michael, Lachance Silvy, Moodie Erica E M

出版信息

Am J Epidemiol. 2017 Jul 15;186(2):160-172. doi: 10.1093/aje/kwx027.

DOI:10.1093/aje/kwx027

PMID:28472335

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6664807/

Abstract

摘要

精准医学时代的工具：如何使用Q学习从队列和登记数据中制定高度个性化的治疗建议。

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

精准医学时代的工具：如何使用Q学习从队列和登记数据中制定高度个性化的治疗建议。

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

作者信息

出版信息