Beyerlein Andreas, Fahrmeir Ludwig, Mansmann Ulrich, Toschke André M
Ludwig-Maximilians University of Munich, Division of Pediatric Epidemiology, Institute of Social Pediatrics and Adolescent Medicine, Munich, Germany.
BMC Med Res Methodol. 2008 Sep 8;8:59. doi: 10.1186/1471-2288-8-59.
Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations.
Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity.
GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models.
GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.
体重指数(BMI)数据通常具有偏态分布,对于此类数据,常见的统计建模方法(如简单线性或逻辑回归)存在局限性。
通过拟合优度测量和解释方法比较了不同的回归方法来预测儿童BMI,包括广义线性模型(GLM)、分位数回归以及位置、尺度和形状广义相加模型(GAMLSS)。我们分析了2001年至2002年在德国巴伐利亚州参加入学健康检查的4967名儿童的数据。看电视、进餐频率、母乳喂养、孕期吸烟、母亲肥胖、父母社会阶层以及生命最初2年的体重增加被视为肥胖的风险因素。
就广义赤池信息准则而言,与常见的GLM相比,GAMLSS在估计风险因素对转换和未转换的BMI数据的影响方面拟合效果要好得多。与GAMLSS相比,分位数回归允许对预先指定的分布分位数进行额外解释,例如指超重或肥胖的分位数。在所检查的任何模型类型中,看电视、母亲BMI和最初2年的体重增加与身体组成直接相关,进餐频率与身体组成呈显著负相关。相比之下,孕期吸烟与身体组成无直接关联,母乳喂养和父母社会阶层在GLM模型中与身体组成无显著负相关,但在GAMLSS模型和部分分位数回归模型中有此关联。可从GAMLSS和分位数回归模型估计特定风险因素的BMI百分位数曲线。
对于BMI数据进行风险因素建模,GAMLSS和分位数回归似乎比常见的GLM更合适。