I-BioStat, KU Leuven, Leuven, Belgium.
Janssen Pharmaceutical Companies of Johnson & Johnson, Beerse, Belgium.
Stat Med. 2018 Dec 20;37(29):4525-4538. doi: 10.1002/sim.7939. Epub 2018 Aug 23.
The maximum entropy principle offers a constructive criterion for setting up probability distributions on the basis of partial knowledge. In the present work, the principle is applied to tackle an important problem in the surrogate marker field, namely, the evaluation of a binary outcome as a putative surrogate for a binary true endpoint within a causal inference framework. In the first step, the maximum entropy principle is used to determine the relative frequencies associated with the values of the vector of potential outcomes. Subsequently, in the second step, these relative frequencies are used in combination with two newly proposed metrics of surrogacy, the so-called individual causal association and the surrogate predictive function, to assess the validity of the surrogate. The procedure is conceptually similar to the use of noninformative or reference priors in Bayesian statistics. Additionally, approximate, identifiable bounds are proposed for the estimands of interest, and their performance is studied via simulations. The methods are illustrated using data from a clinical trial involving schizophrenic patients, and a newly developed and user-friendly R package Surrogate is provided to carry out the validation exercise.
最大熵原理为基于部分知识为概率分布建立提供了一个建设性的准则。在本研究中,该原理被应用于解决替代标志物领域的一个重要问题,即在因果推理框架内,将二元结局评估为二元真实结局的潜在替代物。在第一步中,最大熵原理用于确定与潜在结果向量值相关联的相对频率。随后,在第二步中,这些相对频率与两个新提出的替代物度量,即所谓的个体因果关联和替代预测函数,一起用于评估替代物的有效性。该程序在概念上类似于在贝叶斯统计学中使用非信息或参考先验。此外,还提出了感兴趣的估计量的近似可识别界限,并通过模拟研究了它们的性能。该方法使用涉及精神分裂症患者的临床试验数据进行说明,并提供了一个新开发的用户友好的 R 包 Surrogate 来进行验证。