Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA.
Int J Epidemiol. 2024 Oct 13;53(6). doi: 10.1093/ije/dyae139.
We outline a geometric perspective on causal inference in cohort studies that can help epidemiologists understand the role of standardization in controlling for confounding. For simplicity, we focus on a binary exposure X, a binary outcome D, and a binary confounder C that is not causally affected by X. Rothman diagrams plot the risk of disease in the unexposed on the x-axis and the risk in the exposed on the y-axis. The crude risks define a point in the unit square, and the stratum-specific risks at each level of C define two other points in the unit square. Standardization produces points along the line segment connecting the stratum-specific points. When there is confounding by C, the crude point is off this line segment. The set of all possible crude points is a rectangle with corners at the stratum-specific points and sides parallel to the axes. When there are more than two strata, standardization produces points in the convex hull of the stratum-specific points, and there is confounding if the crude point is outside this convex hull. We illustrate these ideas using data from a study in Newcastle, United Kingdom, in which the causal effect of smoking on 20-year mortality was confounded by age.
我们概述了一种在队列研究中进行因果推断的几何视角,这有助于流行病学家理解标准化在控制混杂因素方面的作用。为了简单起见,我们重点关注二元暴露因素 X、二元结果 D 和不受 X 因果影响的二元混杂因素 C。Rothman 图将未暴露组的疾病风险绘制在 x 轴上,将暴露组的风险绘制在 y 轴上。粗风险定义了单位正方形中的一个点,每个 C 水平的分层特异性风险定义了单位正方形中的另外两个点。标准化会在分层特异性点之间的线段上生成点。当 C 存在混杂时,粗风险点会偏离这条线段。所有可能的粗风险点集是一个矩形,其角在分层特异性点处,边与坐标轴平行。当存在多个分层时,标准化会在分层特异性点的凸包内生成点,如果粗风险点在凸包之外,则存在混杂。我们使用来自英国纽卡斯尔的一项研究的数据来说明这些想法,该研究中吸烟对 20 年死亡率的因果效应受到年龄的混杂。