Department of Health Behavior, The State University of New York at Buffalo, Buffalo, NY 14214-8028, United States.
Addict Behav. 2010 Jun;35(6):558-63. doi: 10.1016/j.addbeh.2010.01.002. Epub 2010 Feb 1.
Analyzing data that arises from correlated observations such as husband-wife pairs, siblings, or repeated assessments of the same individuals over time requires more specialized analytic tools. Additionally, outcomes that are not normally distributed such as count data, (e.g., number of symptoms or number of problems endorsed) also require specialized analytic tools. Generalized estimating equations (GEE) are a very flexible tool for dealing with correlated data (such as data derived from related individuals such as families). The objective of this report was to compare traditional ordinary least squares regression (OLS) to a GEE approach for analyzing family data.
Using data from an ongoing five-wave longitudinal study of newlywed couples, we examined a subset of 173 families with children between the ages of 4 and 11 at two data collection points. The relation between parental risk factors (e.g., heavy drinking, aggression, marital quality) and child internalizing symptoms was examined within the context of two regression-based models: traditional OLS regression and a GEE approach.
Overall, the GEE approach allowed a more complete use of the available data, provided more robust findings, and produced more reliable parameter estimates.
GEE models are a flexible regression-based approach for dealing with related data that arises from correlated data such as family data. Further, given the availability of the models in common statistical programs, family researchers should consider these models for their work.
分析相关观察数据,如夫妻、兄弟姐妹或同一人群随时间的重复评估,需要更专业的分析工具。此外,像计数数据(例如,症状或问题的数量)这样的非正态分布的结果也需要专门的分析工具。广义估计方程(GEE)是一种非常灵活的工具,可用于处理相关数据(例如来自相关个体的数据,如家庭)。本报告的目的是比较传统的普通最小二乘法回归(OLS)和 GEE 方法在分析家庭数据中的应用。
使用正在进行的新婚夫妇五波纵向研究的数据,我们在两个数据收集点检查了 173 个有 4 至 11 岁儿童的家庭的一个子集。在两个基于回归的模型中,考察了父母风险因素(例如,大量饮酒、攻击、婚姻质量)与儿童内化症状之间的关系:传统的 OLS 回归和 GEE 方法。
总体而言,GEE 方法允许更完整地利用可用数据,提供更稳健的发现,并产生更可靠的参数估计。
GEE 模型是一种灵活的基于回归的方法,用于处理相关数据,这些数据源于相关数据,如家庭数据。此外,鉴于这些模型在常见统计程序中的可用性,家庭研究人员应该考虑在他们的工作中使用这些模型。