Meisner Allison, Parikh Chirag R, Kerr Kathleen F
Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland, USA.
Division of Nephrology, Johns Hopkins University, Baltimore, Maryland, USA.
Stat Med. 2020 Oct 30;39(24):3412-3426. doi: 10.1002/sim.8673. Epub 2020 Aug 13.
Motivated by a study of acute kidney injury, we consider the setting of biomarker studies involving patients at multiple centers where the goal is to develop a biomarker combination for diagnosis, prognosis, or screening. As biomarker studies become larger, this type of data structure will be encountered more frequently. In the presence of multiple centers, one way to assess the predictive capacity of a given combination is to consider the center-adjusted area under the receiver operating characteristic curve (aAUC), a summary of the ability of the combination to discriminate between cases and controls in each center. Rather than using a general method, such as logistic regression, to construct the biomarker combination, we propose directly maximizing the aAUC. Furthermore, it may be desirable to have a biomarker combination with similar performance across centers. To that end, we allow for penalization of the variability in the center-specific AUCs. We demonstrate desirable asymptotic properties of the resulting combinations. Simulations provide small-sample evidence that maximizing the aAUC can lead to combinations with improved performance. We also use simulated data to illustrate the utility of constructing combinations by maximizing the aAUC while penalizing variability. Finally, we apply these methods to data from the study of acute kidney injury.
受一项关于急性肾损伤研究的启发,我们考虑生物标志物研究的情形,该研究涉及多个中心的患者,目标是开发用于诊断、预后或筛查的生物标志物组合。随着生物标志物研究规模的扩大,这种数据结构会更频繁地出现。在存在多个中心的情况下,评估给定组合预测能力的一种方法是考虑中心调整后的受试者工作特征曲线下面积(aAUC),它总结了该组合在每个中心区分病例和对照的能力。我们不是使用诸如逻辑回归等通用方法来构建生物标志物组合,而是建议直接最大化aAUC。此外,可能希望有一个在各中心具有相似性能的生物标志物组合。为此,我们允许对各中心特定AUC的变异性进行惩罚。我们证明了所得组合具有理想的渐近性质。模拟提供了小样本证据,表明最大化aAUC可以得到性能更优的组合。我们还使用模拟数据来说明在惩罚变异性的同时通过最大化aAUC构建组合的效用。最后,我们将这些方法应用于急性肾损伤研究的数据。