Garcia-Huidobro Diego, Michael Oakes J
Department of Family Medicine, School of Medicine, Pontificia Universidad Catolica de Chile, Santiago, Chile.
Department of Family Social Science, University of Minnesota, Saint Paul, MN, USA.
Int J Psychol. 2017 Apr;52(2):96-105. doi: 10.1002/ijop.12275. Epub 2016 Apr 20.
Randomised controlled trials (RCTs) are typically viewed as the gold standard for causal inference. This is because effects of interest can be identified with the fewest assumptions, especially imbalance in background characteristics. Yet because conducting RCTs are expensive, time consuming and sometimes unethical, observational studies are frequently used to study causal associations. In these studies, imbalance, or confounding, is usually controlled with multiple regression, which entails strong assumptions. The purpose of this manuscript is to describe strengths and weaknesses of several methods to control for confounding in observational studies, and to demonstrate their use in cross-sectional dataset that use patient registration data from the Juan Pablo II Primary Care Clinic in La Pintana-Chile. The dataset contains responses from 5855 families who provided complete information on family socio-demographics, family functioning and health problems among their family members. We employ regression adjustment, stratification, restriction, matching, propensity score matching, standardisation and inverse probability weighting to illustrate the approaches to better causal inference in non-experimental data and compare results. By applying study design and data analysis techniques that control for confounding in different ways than regression adjustment, researchers may strengthen the scientific relevance of observational studies.
随机对照试验(RCTs)通常被视为因果推断的黄金标准。这是因为可以用最少的假设来识别感兴趣的效应,尤其是背景特征方面的不平衡。然而,由于进行随机对照试验成本高昂、耗时且有时不道德,观察性研究经常被用于研究因果关联。在这些研究中,不平衡或混杂通常通过多元回归来控制,而这需要很强的假设。本手稿的目的是描述观察性研究中几种控制混杂的方法的优缺点,并展示它们在横断面数据集中的应用,该数据集使用了智利拉平塔纳市胡安·巴勃罗二世初级保健诊所的患者登记数据。该数据集包含5855个家庭的回复,这些家庭提供了关于家庭社会人口统计学、家庭功能以及家庭成员健康问题的完整信息。我们采用回归调整、分层、限制、匹配倾向得分匹配、标准化和逆概率加权来说明在非实验数据中进行更好因果推断的方法,并比较结果。通过应用以与回归调整不同的方式控制混杂的研究设计和数据分析技术,研究人员可以增强观察性研究的科学相关性。