Q 学习：一种构建自适应干预措施的数据分析方法。

Q-learning: a data analysis method for constructing adaptive interventions.

机构信息

Institute for Social Research, University of Michigan, Ann Arbor, MI 48106, USA.

出版信息

Psychol Methods. 2012 Dec;17(4):478-94. doi: 10.1037/a0029373. Epub 2012 Oct 1.

DOI:10.1037/a0029373

PMID:23025434

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3747013/

Abstract

Increasing interest in individualizing and adapting intervention services over time has led to the development of adaptive interventions. Adaptive interventions operationalize the individualization of a sequence of intervention options over time via the use of decision rules that input participant information and output intervention recommendations. We introduce Q-learning, which is a generalization of regression analysis to settings in which a sequence of decisions regarding intervention options or services is made. The use of Q is to indicate that this method is used to assess the relative quality of the intervention options. In particular, we use Q-learning with linear regression to estimate the optimal (i.e., most effective) sequence of decision rules. We illustrate how Q-learning can be used with data from sequential multiple assignment randomized trials (SMARTs; Murphy, 2005) to inform the construction of a more deeply tailored sequence of decision rules than those embedded in the SMART design. We also discuss the advantages of Q-learning compared to other data analysis approaches. Finally, we use the Adaptive Interventions for Children With ADHD SMART study (Center for Children and Families, University at Buffalo, State University of New York, William E. Pelham as principal investigator) for illustration.

摘要

随着人们对干预服务个性化和适应性的兴趣日益增加，适应性干预措施应运而生。适应性干预措施通过使用决策规则来实现干预选项的个性化和适应性，这些决策规则输入参与者信息并输出干预建议。我们引入了 Q 学习，它是回归分析在涉及干预选项或服务的一系列决策中的推广。使用 Q 表示该方法用于评估干预选项的相对质量。具体来说，我们使用 Q 学习和线性回归来估计最优（即最有效）的决策规则序列。我们展示了如何使用来自序贯多项随机试验（SMARTs；Murphy，2005）的数据来使用 Q 学习来通知构建比 SMART 设计中嵌入的决策规则更深入定制的决策规则序列。我们还讨论了 Q 学习与其他数据分析方法相比的优势。最后，我们使用注意力缺陷多动障碍儿童适应性干预措施 SMART 研究（纽约州立大学布法罗分校儿童与家庭中心，William E. Pelham 为首席研究员）进行说明。

相似文献

Q-learning: a data analysis method for constructing adaptive interventions.

Psychol Methods. 2012 Dec;17(4):478-94. doi: 10.1037/a0029373. Epub 2012 Oct 1.

Experimental design and primary data analysis methods for comparing adaptive interventions.

Psychol Methods. 2012 Dec;17(4):457-477. doi: 10.1037/a0029372. Epub 2012 Oct 1.

Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Stat Med. 2016 Jun 15;35(13):2221-34. doi: 10.1002/sim.6859. Epub 2016 Jan 10.

A SMART data analysis method for constructing adaptive treatment strategies for substance use disorders.

Addiction. 2017 May;112(5):901-909. doi: 10.1111/add.13743. Epub 2017 Feb 18.

Dynamic treatment regimes for managing chronic health conditions: a statistical perspective.

Am J Public Health. 2011 Jan;101(1):40-5. doi: 10.2105/AJPH.2010.198937. Epub 2010 Nov 18.

SMART longitudinal analysis: A tutorial for using repeated outcome measures from SMART studies to compare adaptive interventions.

Psychol Methods. 2020 Feb;25(1):1-29. doi: 10.1037/met0000219. Epub 2019 Jul 18.

Noninferiority and equivalence tests in sequential, multiple assignment, randomized trials (SMARTs).

Psychol Methods. 2020 Apr;25(2):182-205. doi: 10.1037/met0000232. Epub 2019 Sep 9.

A Data Analysis Method for Using Longitudinal Binary Outcome Data from a SMART to Compare Adaptive Interventions.

Multivariate Behav Res. 2019 Sep-Oct;54(5):613-636. doi: 10.1080/00273171.2018.1558042. Epub 2019 Jan 20.

A multiple imputation strategy for sequential multiple assignment randomized trials.

Stat Med. 2014 Oct 30;33(24):4202-14. doi: 10.1002/sim.6223. Epub 2014 Jun 11.

Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

Am J Epidemiol. 2017 Jul 15;186(2):160-172. doi: 10.1093/aje/kwx027.

引用本文的文献

Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.

BMC Med Res Methodol. 2025 Jul 1;25(1):162. doi: 10.1186/s12874-025-02595-1.

Simulating A/B testing versus SMART designs for LLM-driven patient engagement to close preventive care gaps.

NPJ Digit Med. 2024 Nov 18;7(1):322. doi: 10.1038/s41746-024-01330-2.

The eACT study design and methods: A sequential, multiple assignment, randomized trial of A novel adherence intervention for youth with epilepsy.

Contemp Clin Trials. 2024 Dec;147:107739. doi: 10.1016/j.cct.2024.107739. Epub 2024 Nov 10.

Reinforcement learning for individualized lung cancer screening schedules: A nested case-control study.

Cancer Med. 2024 Jul;13(13):e7436. doi: 10.1002/cam4.7436.

Learning optimal biomarker-guided treatment policy for chronic disorders.

Stat Med. 2024 Jun 30;43(14):2765-2782. doi: 10.1002/sim.10099. Epub 2024 May 3.

The impact of using reinforcement learning to personalize communication on medication adherence: findings from the REINFORCE trial.

NPJ Digit Med. 2024 Feb 19;7(1):39. doi: 10.1038/s41746-024-01028-5.

Modified interactive Q-learning for attenuating the impact of model misspecification with treatment effect heterogeneity.

Stat Methods Med Res. 2023 Nov;32(11):2240-2253. doi: 10.1177/09622802231206471. Epub 2023 Oct 20.

Outcome trajectory estimation for optimal dynamic treatment regimes with repeated measures.

J R Stat Soc Ser C Appl Stat. 2023 May 22;72(4):976-991. doi: 10.1093/jrsssc/qlad037. eCollection 2023 Aug.

Dynamic Treatment Regimes Using Bayesian Additive Regression Trees for Censored Outcomes.

Lifetime Data Anal. 2024 Jan;30(1):181-212. doi: 10.1007/s10985-023-09605-8. Epub 2023 Sep 2.

Design of experiments with sequential randomizations on multiple timescales: the hybrid experimental design.

Behav Res Methods. 2024 Mar;56(3):1770-1792. doi: 10.3758/s13428-023-02119-z. Epub 2023 May 8.

本文引用的文献

Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective.

Multivariate Behav Res. 1998 Oct 1;33(4):545-71. doi: 10.1207/s15327906mbr3304_5.

Experimental design and primary data analysis methods for comparing adaptive interventions.

Psychol Methods. 2012 Dec;17(4):457-477. doi: 10.1037/a0029372. Epub 2012 Oct 1.

Comparison of variable selection approaches for dynamic treatment regimes.

Int J Biostat. 2010;6(1):Article 6. doi: 10.2202/1557-4679.1178.

Variable Selection for Qualitative Interactions.

Stat Methodol. 2011 Jan 30;1(8):42-55. doi: 10.1016/j.stamet.2009.05.003.

A Simulation Study of Mediated Effect Measures.

Multivariate Behav Res. 1995 Jan 1;30(1):41. doi: 10.1207/s15327906mbr3001_3.

Adaptive Interventions in Drug Court: A Pilot Experiment.

Crim Justice Rev. 2008;33(3):343-360. doi: 10.1177/0734016808320325.

Inference for non-regular parameters in optimal dynamic treatment regimes.

Stat Methods Med Res. 2010 Jun;19(3):317-43. doi: 10.1177/0962280209105013. Epub 2009 Jul 16.

Adaptive designs for randomized trials in public health.

Annu Rev Public Health. 2009;30:1-25. doi: 10.1146/annurev.publhealth.031308.100223.

Evidence-based psychosocial treatments for attention-deficit/hyperactivity disorder.

J Clin Child Adolesc Psychol. 2008 Jan;37(1):184-214. doi: 10.1080/15374410701818681.

An adaptive approach to family intervention: linking engagement in family-centered intervention to reductions in adolescent problem behavior.

J Consult Clin Psychol. 2007 Aug;75(4):568-79. doi: 10.1037/0022-006X.75.4.568.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Q 学习：一种构建自适应干预措施的数据分析方法。

Q-learning: a data analysis method for constructing adaptive interventions.

机构信息

Institute for Social Research, University of Michigan, Ann Arbor, MI 48106, USA.

出版信息

Psychol Methods. 2012 Dec;17(4):478-94. doi: 10.1037/a0029373. Epub 2012 Oct 1.

DOI:10.1037/a0029373

PMID:23025434

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3747013/

Abstract

摘要

Q 学习：一种构建自适应干预措施的数据分析方法。

Q-learning: a data analysis method for constructing adaptive interventions.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Q 学习：一种构建自适应干预措施的数据分析方法。

Q-learning: a data analysis method for constructing adaptive interventions.

机构信息

出版信息