Shook-Sa Bonnie E, Hudgens Michael G, Knittel Andrea K, Edmonds Andrew, Ramirez Catalina, Cole Stephen R, Cohen Mardge, Adedimeji Adebola, Taylor Tonya, Michel Katherine G, Kovacs Andrea, Cohen Jennifer, Donohue Jessica, Foster Antonina, Fischl Margaret A, Long Dustin, Adimora Adaora A
Department of Biostatistics, University of North Carolina at Chapel Hill.
School of Medicine, University of North Carolina at Chapel Hill.
Ann Appl Stat. 2024 Sep;18(3):2147-2165. doi: 10.1214/24-aoas1874. Epub 2024 Aug 5.
Causal inference methods can be applied to estimate the effect of a point exposure or treatment on an outcome of interest using data from observational studies. For example, in the Women's Interagency HIV Study, it is of interest to understand the effects of incarceration on the number of sexual partners and the number of cigarettes smoked after incarceration. In settings like this where the outcome is a count, the estimand is often the causal mean ratio, i.e., the ratio of the counterfactual mean count under exposure to the counterfactual mean count under no exposure. This paper considers estimators of the causal mean ratio based on inverse probability of treatment weights, the parametric g-formula, and doubly robust estimation, each of which can account for overdispersion, zero-inflation, and heaping in the measured outcome. Methods are compared in simulations and are applied to data from the Women's Interagency HIV Study.
因果推断方法可用于利用观察性研究的数据,估计某一点暴露或治疗对感兴趣结局的影响。例如,在女性机构间HIV研究中,了解监禁对性伴侣数量以及监禁后吸烟数量的影响很有意义。在这样的情形中,当结局是一个计数时,估计量通常是因果均值比,即暴露下的反事实均值计数与无暴露下的反事实均值计数之比。本文考虑基于治疗权重的逆概率、参数化g公式和双重稳健估计的因果均值比估计量,每一种估计量都可以解释测量结局中的过度离散、零膨胀和堆积现象。在模拟中对这些方法进行了比较,并将其应用于女性机构间HIV研究的数据。