Chen Kan, Heng Siyu, Long Qi, Zhang Bo
Graduate Group of Applied Mathematics and Computational Science, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania, U.S.A.
Department of Biostatistics, School of Global Public Health, New York University, New York City, New York, U.S.A.
J Comput Graph Stat. 2023;32(2):528-538. doi: 10.1080/10618600.2022.2116447. Epub 2022 Oct 19.
One central goal of design of observational studies is to embed non-experimental data into an approximate randomized controlled trial using statistical matching. Despite empirical researchers' best intention and effort to create high-quality matched samples, residual imbalance due to observed covariates not being well matched often persists. Although statistical tests have been developed to test the randomization assumption and its implications, few provide a means to quantify the level of residual confounding due to observed covariates not being well matched in matched samples. In this article, we develop two generic classes of exact statistical tests for a biased randomization assumption. One important by-product of our testing framework is a quantity called (RSV), which provides a means to quantify the level of residual confounding due to imperfect matching of observed covariates in a matched sample. We advocate taking into account RSV in the downstream primary analysis. The proposed methodology is illustrated by re-examining a famous observational study concerning the effect of right heart catheterization (RHC) in the initial care of critically ill patients. Code implementing the method can be found in the supplementary materials.
观察性研究设计的一个核心目标是利用统计匹配将非实验数据嵌入到近似随机对照试验中。尽管实证研究人员尽了最大的努力来创建高质量的匹配样本,但由于观察到的协变量没有得到很好的匹配而导致的残余不平衡往往仍然存在。虽然已经开发了统计检验来检验随机化假设及其影响,但很少有检验能提供一种方法来量化由于观察到的协变量在匹配样本中没有得到很好的匹配而导致的残余混杂水平。在本文中,我们针对有偏随机化假设开发了两类精确的统计检验。我们检验框架的一个重要副产品是一个称为(RSV)的量,它提供了一种方法来量化由于匹配样本中观察到的协变量匹配不完善而导致的残余混杂水平。我们主张在下游的主要分析中考虑RSV。通过重新审视一项关于右心导管插入术(RHC)在危重病患者初始护理中的作用的著名观察性研究,对所提出的方法进行了说明。实现该方法的代码可在补充材料中找到。