• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用限制治疗序列的观测数据估计基于树的动态治疗规则。

Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences.

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Institute for Social Research, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Biometrics. 2023 Sep;79(3):2260-2271. doi: 10.1111/biom.13754. Epub 2022 Oct 9.

DOI:10.1111/biom.13754
PMID:36063542
Abstract

A dynamic treatment regime (DTR) is a sequence of decision rules that provide guidance on how to treat individuals based on their static and time-varying status. Existing observational data are often used to generate hypotheses about effective DTRs. A common challenge with observational data, however, is the need for analysts to consider "restrictions" on the treatment sequences. Such restrictions may be necessary for settings where (1) one or more treatment sequences that were offered to individuals when the data were collected are no longer considered viable in practice, (2) specific treatment sequences are no longer available, or (3) the scientific focus of the analysis concerns a specific type of treatment sequences (eg, "stepped-up" treatments). To address this challenge, we propose a restricted tree-based reinforcement learning (RT-RL) method that searches for an interpretable DTR with the maximum expected outcome, given a (set of) user-specified restriction(s), which specifies treatment options (at each stage) that ought not to be considered as part of the estimated tree-based DTR. In simulations, we evaluate the performance of RT-RL versus the standard approach of ignoring the partial data for individuals not following the (set of) restriction(s). The method is illustrated using an observational data set to estimate a two-stage stepped-up DTR for guiding the level of care placement for adolescents with substance use disorder.

摘要

动态治疗方案(DTR)是一系列决策规则,用于根据个体的静态和时变状态提供治疗指导。现有的观察性数据通常用于生成关于有效 DTR 的假设。然而,观察性数据通常存在一个挑战,即分析师需要考虑治疗方案的“限制”。这些限制可能是必要的,例如在以下情况下:(1)在数据收集时提供给个体的一个或多个治疗方案在实践中不再可行;(2)特定的治疗方案不再可用;(3)分析的科学重点关注特定类型的治疗方案(例如“逐步升级”的治疗)。为了解决这个挑战,我们提出了一种受限的基于树的强化学习(RT-RL)方法,该方法在给定(一组)用户指定的限制的情况下,搜索具有最大预期结果的可解释 DTR,该限制指定了在每个阶段不应该考虑的治疗方案(作为估计的基于树的 DTR 的一部分)。在模拟中,我们评估了 RT-RL 与忽略不符合(一组)限制的个体的部分数据的标准方法的性能。该方法使用一个观察性数据集进行了说明,以估计用于指导有物质使用障碍的青少年护理水平的两阶段逐步升级 DTR。

相似文献

1
Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences.利用限制治疗序列的观测数据估计基于树的动态治疗规则。
Biometrics. 2023 Sep;79(3):2260-2271. doi: 10.1111/biom.13754. Epub 2022 Oct 9.
2
TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.基于树的强化学习用于估计最优动态治疗方案
Ann Appl Stat. 2018 Sep;12(3):1914-1938. doi: 10.1214/18-AOAS1137. Epub 2018 Sep 11.
3
Restricted sub-tree learning to estimate an optimal dynamic treatment regime using observational data.基于观测数据的受限子树学习来估计最优动态治疗规则。
Stat Med. 2021 Nov 20;40(26):5796-5812. doi: 10.1002/sim.9155. Epub 2021 Aug 2.
4
Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes.基于多目标树的强化学习估计宽容动态治疗方案。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad017.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.基于树的分步调整强化学习在使用测试和治疗观察数据评估嵌套动态治疗方案中的应用。
Stat Med. 2021 Nov 30;40(27):6164-6177. doi: 10.1002/sim.9177. Epub 2021 Sep 7.
7
Q-learning for estimating optimal dynamic treatment rules from observational data.用于从观察数据中估计最优动态治疗规则的Q学习法。
Can J Stat. 2012 Dec 1;40(4):629-645. doi: 10.1002/cjs.11162. Epub 2012 Nov 7.
8
Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.用于多阶段多治疗决策的自适应对比度加权学习
Biometrics. 2017 Mar;73(1):145-155. doi: 10.1111/biom.12539. Epub 2016 May 23.
9
Estimating the causal effect of treatment regimes for organ transplantation.估计器官移植治疗方案的因果效应。
Biometrics. 2018 Dec;74(4):1407-1416. doi: 10.1111/biom.12921. Epub 2018 Jul 10.
10
Penalized Spline-Involved Tree-based (PenSIT) Learning for estimating an optimal dynamic treatment regime using observational data.基于惩罚样条的树状结构(PenSIT)学习法,用于利用观测数据估计最优动态治疗方案。
Stat Methods Med Res. 2022 Dec;31(12):2338-2351. doi: 10.1177/09622802221122397. Epub 2022 Oct 3.

引用本文的文献

1
An Infrastructure Framework for Remote Patient Monitoring Interventions and Research.远程患者监测干预和研究的基础设施框架。
J Med Internet Res. 2024 May 30;26:e51234. doi: 10.2196/51234.