• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes.动态治疗方案的高维Q学习中价值函数的正确推断
J Am Stat Assoc. 2019;114(527):1404-1417. doi: 10.1080/01621459.2018.1506341. Epub 2018 Oct 29.
2
HIGH-DIMENSIONAL A-LEARNING FOR OPTIMAL DYNAMIC TREATMENT REGIMES.用于优化动态治疗方案的高维A学习法
Ann Stat. 2018 Jun;46(3):925-957. doi: 10.1214/17-AOS1570. Epub 2018 May 3.
3
Bayesian inference for optimal dynamic treatment regimes in practice.贝叶斯推断在实践中最优动态治疗方案的应用。
Int J Biostat. 2023 May 17;19(2):309-331. doi: 10.1515/ijb-2022-0073. eCollection 2023 Nov 1.
4
Entropy Learning for Dynamic Treatment Regimes.动态治疗方案的熵学习
Stat Sin. 2019;29(4):1633-1655. doi: 10.5705/ss.202018.0076.
5
A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.一种用于优化动态治疗方案的贝叶斯机器学习方法。
J Am Stat Assoc. 2018;113(523):1255-1267. doi: 10.1080/01621459.2017.1340887. Epub 2018 Oct 8.
6
Constructing dynamic treatment regimes with shared parameters for censored data.为删失数据构建具有共享参数的动态治疗方案。
Stat Med. 2020 Apr 30;39(9):1250-1263. doi: 10.1002/sim.8473. Epub 2020 Jan 17.
7
Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.使用自适应n选m自助法对最优动态治疗方案进行推断。
Biometrics. 2013 Sep;69(3):714-23. doi: 10.1111/biom.12052. Epub 2013 Jul 11.
8
Dynamic Treatment Regimes.动态治疗方案
Annu Rev Stat Appl. 2014;1:447-464. doi: 10.1146/annurev-statistics-022513-115553.
9
On optimal treatment regimes selection for mean survival time.关于平均生存时间的最优治疗方案选择
Stat Med. 2015 Mar 30;34(7):1169-84. doi: 10.1002/sim.6397. Epub 2014 Dec 16.
10
Resampling-based confidence intervals for model-free robust inference on optimal treatment regimes.基于重抽样的无模型稳健推断最优治疗方案的置信区间。
Biometrics. 2021 Jun;77(2):465-476. doi: 10.1111/biom.13337. Epub 2020 Aug 21.

引用本文的文献

1
Ranking tailoring variables for constructing individualized treatment rules: an application to schizophrenia.构建个体化治疗规则的排序调整变量:在精神分裂症中的应用
J R Stat Soc Ser C Appl Stat. 2022 Mar;71(2):309-330. doi: 10.1111/rssc.12533. Epub 2022 Mar 20.
2
Noninferiority and equivalence tests in sequential, multiple assignment, randomized trials (SMARTs).序贯、多次分配、随机试验(SMARTs)中的非劣效性和等效性检验。
Psychol Methods. 2020 Apr;25(2):182-205. doi: 10.1037/met0000232. Epub 2019 Sep 9.

本文引用的文献

1
STATISTICAL INFERENCE FOR THE MEAN OUTCOME UNDER A POSSIBLY NON-UNIQUE OPTIMAL TREATMENT STRATEGY.在可能非唯一的最优治疗策略下对平均结果的统计推断。
Ann Stat. 2016 Apr;44(2):713-742. doi: 10.1214/15-AOS1384. Epub 2016 Mar 17.
2
Penalized Q-Learning for Dynamic Treatment Regimens.用于动态治疗方案的惩罚性Q学习
Stat Sin. 2015 Jul;25(3):901-920. doi: 10.5705/ss.2012.364.
3
Dynamic treatment regimes: technical challenges and applications.动态治疗方案:技术挑战与应用
Electron J Stat. 2014;8(1):1225-1272. doi: 10.1214/14-ejs920.
4
Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.使用自适应n选m自助法对最优动态治疗方案进行推断。
Biometrics. 2013 Sep;69(3):714-23. doi: 10.1111/biom.12052. Epub 2013 Jul 11.
5
Non-Concave Penalized Likelihood with NP-Dimensionality.具有NP维数的非凹惩罚似然法
IEEE Trans Inf Theory. 2011 Aug;57(8):5467-5484. doi: 10.1109/TIT.2011.2158486.
6
PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES.个性化治疗规则的性能保证
Ann Stat. 2011 Apr 1;39(2):1180-1210. doi: 10.1214/10-AOS864.
7
Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].估计最优动态策略:在原假设下校正偏差:[最优动态策略:偏差校正]
Scand Stat Theory Appl. 2009 Sep 22;37(1):126-146. doi: 10.1111/j.1467-9469.2009.00661.x.
8
Inference for non-regular parameters in optimal dynamic treatment regimes.最优动态治疗方案中非正则参数的推断。
Stat Methods Med Res. 2010 Jun;19(3):317-43. doi: 10.1177/0962280209105013. Epub 2009 Jul 16.
9
Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design.缓解抑郁症的序贯治疗方案(STAR*D):原理与设计
Control Clin Trials. 2004 Feb;25(1):119-42. doi: 10.1016/s0197-2456(03)00112-0.
10
Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study.缓解抑郁症的序贯治疗方案(STAR*D)研究的背景与原理
Psychiatr Clin North Am. 2003 Jun;26(2):457-94, x. doi: 10.1016/s0193-953x(02)00107-7.

动态治疗方案的高维Q学习中价值函数的正确推断

Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes.

作者信息

Zhu Wensheng, Zeng Donglin, Song Rui

机构信息

Key Laboratory for Applied Statistics of MOE,School of Mathematics and Statistics, Northeast Normal University, Changchun 130024, China (

Departments of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA (

出版信息

J Am Stat Assoc. 2019;114(527):1404-1417. doi: 10.1080/01621459.2018.1506341. Epub 2018 Oct 29.

DOI:10.1080/01621459.2018.1506341
PMID:31929664
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6953729/
Abstract

Dynamic treatment regimes are a set of decision rules and each treatment decision is tailored over time according to patients' responses to previous treatments as well as covariate history. There is a growing interest in development of correct statistical inference for optimal dynamic treatment regimes to handle the challenges of non-regularity problems in the presence of non-respondents who have zero-treatment effects, especially when the dimension of the tailoring variables is high. In this paper, we propose a high-dimensional Q-learning (HQ-learning) to facilitate the inference of optimal values and parameters. The proposed method allows us to simultaneously estimate the optimal dynamic treatment regimes and select the important variables that truly contribute to the individual reward. At the same time, hard thresholding is introduced in the method to eliminate the effects of the non-respondents. The asymptotic properties for the parameter estimators as well as the estimated optimal value function are then established by adjusting the bias due to thresholding. Both simulation studies and real data analysis demonstrate satisfactory performance for obtaining the proper inference for the value function for the optimal dynamic treatment regimes.

摘要

动态治疗方案是一组决策规则,每个治疗决策会随着时间推移,根据患者对先前治疗的反应以及协变量历史进行调整。针对最优动态治疗方案,如何进行正确的统计推断以应对存在零治疗效果的无反应者情况下的非正则性问题挑战,尤其是在定制变量维度较高时,人们的兴趣日益浓厚。在本文中,我们提出了一种高维Q学习(HQ学习)方法,以促进对最优值和参数的推断。所提出的方法使我们能够同时估计最优动态治疗方案,并选择真正对个体奖励有贡献的重要变量。同时,该方法引入了硬阈值化来消除无反应者的影响。通过调整阈值化引起的偏差,建立了参数估计器以及估计的最优值函数的渐近性质。模拟研究和实际数据分析均表明,在获得最优动态治疗方案价值函数的正确推断方面,该方法具有令人满意的性能。