Zhang Y, Glynn R J, Felson D T
Boston University Arthritis Center, Boston University Medical Center, MA 02118, USA.
J Rheumatol. 1996 Jul;23(7):1130-4.
To illustrate newly developed statistical methods in analysis of correlated binary outcome data in musculoskeletal (MSK) disease.
We applied 3 alternative statistical approaches to evaluate the relation of several risk factors to presence of knee osteoarthritis using data from the Framingham Osteoarthritis Study. The methods were (1) an ordinary logistic regression model using each knee as an independent unit of observation; (2) an ordinary logistic regression model treating each person rather than the knee as the unit of analysis; and (3) generalized estimating equation (GEE) and polychotomous logistic regression (PCHLE) using each knee as the unit of analysis but accounting for the correlation between fellow knees. We discuss the advantages and disadvantages of each method with respect to validity, precision, and interpretability.
The GEE and PCHLE models had clear advantages. They simultaneously evaluated the effects of person specific and knee specific risk factors, increased precision, enhanced the interpretability of variables, and provided new insights about how risk factors act.
While the choice of statistical approach depends critically on the scientific question of interest, the GEE and PCHLE approaches will often be optimal in assessments of factors associated with MSK conditions affecting multiple correlated sites within the body, especially when the interest of the study focuses on site specific risk factors.
阐述肌肉骨骼(MSK)疾病相关二元结局数据分析中新开发的统计方法。
我们应用3种替代统计方法,利用弗明汉骨关节炎研究的数据评估多种风险因素与膝关节骨关节炎存在情况之间的关系。这些方法分别为:(1)以每个膝关节作为独立观察单位的普通逻辑回归模型;(2)以每个人而非膝关节作为分析单位的普通逻辑回归模型;(3)以每个膝关节作为分析单位但考虑同侧膝关节之间相关性的广义估计方程(GEE)和多分类逻辑回归(PCHLE)。我们从有效性、精确性和可解释性方面讨论了每种方法的优缺点。
GEE和PCHLE模型具有明显优势。它们同时评估了个体特异性和膝关节特异性风险因素的影响,提高了精确性,增强了变量的可解释性,并提供了关于风险因素作用方式的新见解。
虽然统计方法的选择主要取决于感兴趣的科学问题,但在评估与影响身体内多个相关部位的MSK疾病相关因素时,GEE和PCHLE方法通常是最佳选择,尤其是当研究重点关注部位特异性风险因素时。