NCSU, Raleigh, NC, USA.
Purdue Pharma L.P., Stamford, CT, USA.
Stat Med. 2018 Apr 30;37(9):1407-1418. doi: 10.1002/sim.7566. Epub 2018 Feb 21.
There is growing interest and investment in precision medicine as a means to provide the best possible health care. A treatment regime formalizes precision medicine as a sequence of decision rules, one per clinical intervention period, that specify if, when and how current treatment should be adjusted in response to a patient's evolving health status. It is standard to define a regime as optimal if, when applied to a population of interest, it maximizes the mean of some desirable clinical outcome, such as efficacy. However, in many clinical settings, a high-quality treatment regime must balance multiple competing outcomes; eg, when a high dose is associated with substantial symptom reduction but a greater risk of an adverse event. We consider the problem of estimating the most efficacious treatment regime subject to constraints on the risk of adverse events. We combine nonparametric Q-learning with policy-search to estimate a high-quality yet parsimonious treatment regime. This estimator applies to both observational and randomized data, as well as settings with variable, outcome-dependent follow-up, mixed treatment types, and multiple time points. This work is motivated by and framed in the context of dosing for chronic pain; however, the proposed framework can be applied generally to estimate a treatment regime which maximizes the mean of one primary outcome subject to constraints on one or more secondary outcomes. We illustrate the proposed method using data pooled from 5 open-label flexible dosing clinical trials for chronic pain.
人们对精准医学越来越感兴趣并投入其中,将其作为提供最佳医疗保健的一种手段。治疗方案将精准医学形式化为一系列决策规则,每个规则对应一个临床干预期,规定应如何根据患者不断变化的健康状况调整当前治疗。如果将一种治疗方案应用于感兴趣的人群中,能使某种理想的临床结果(如疗效)的平均值最大化,则将其定义为最优方案是很常见的。然而,在许多临床环境中,高质量的治疗方案必须平衡多种相互竞争的结果;例如,当高剂量与显著的症状减轻相关联,但却伴随着更大的不良事件风险时。我们考虑了在限制不良事件风险的情况下,估计最有效的治疗方案的问题。我们将非参数 Q 学习与策略搜索相结合,以估计高质量但简约的治疗方案。该估计器适用于观察性和随机数据,以及具有可变、依赖于结果的随访、混合治疗类型和多个时间点的设置。这项工作的动机和框架是为慢性疼痛的剂量设定;然而,所提出的框架可以一般地应用于估计一种治疗方案,该方案可以使一个主要结果的平均值最大化,同时限制一个或多个次要结果。我们使用来自 5 项慢性疼痛开放标签灵活剂量临床试验的数据进行了演示。