Helfenstein U, Steiner M, Menghini G
Department of Biostatistics, University of Zurich, Switzerland.
Community Dent Health. 1997 Dec;14(4):221-6.
Ordinary multiple regression and logistic multiple regression are widely applied statistical methods which allow a researcher to 'explain' or 'predict' a response variable from a set of explanatory variables or predictors. In these models it is usually assumed that quantitative predictors such as age enter linearly into the model.
During recent years these methods have been further developed to allow more flexibility in the way explanatory variables 'act' on a response variable. The methods are called 'generalised additive models' (GAM). The rigid linear terms characterising the association between response and predictors are replaced in an optimal way by flexible curved functions of the predictors (the 'profiles'). Plotting the 'profiles' allows the researcher to visualise easily the shape by which predictors 'act' over the whole range of values. The method facilitates detection of particular shapes such as 'bumps', 'U-shapes', 'J-shapes, 'threshold values' etc. Information about the shape of the association is not revealed by traditional methods. The shapes of the profiles may be checked by performing a Monte Carlo simulation ('bootstrapping').
After the presentation of the GAM a relevant case study is presented in order to demonstrate application and use of the method. The dependence of caries in primary teeth on a set of explanatory variables is investigated. Since GAMs may not be easily accessible to dentists, this article presents them in an introductory condensed form. It was thought that a nonmathematical summary and a worked example might encourage readers to consider the methods described.
GAMs may be of great value to dentists in allowing visualisation of the shape by which predictors 'act' and obtaining a better understanding of the complex relationships between predictors and response.
普通多元回归和逻辑多元回归是广泛应用的统计方法,可让研究人员从一组解释变量或预测变量中“解释”或“预测”响应变量。在这些模型中,通常假定年龄等定量预测变量以线性方式进入模型。
近年来,这些方法得到了进一步发展,以使解释变量对响应变量“起作用”的方式更具灵活性。这些方法被称为“广义相加模型”(GAM)。表征响应与预测变量之间关联的刚性线性项以最佳方式被预测变量的灵活曲线函数(“轮廓”)所取代。绘制“轮廓”可让研究人员轻松直观地看到预测变量在整个取值范围内“起作用”的形状。该方法有助于检测特定形状,如“凸起”“U形”“J形”“阈值”等。传统方法无法揭示有关关联形状的信息。轮廓的形状可通过进行蒙特卡罗模拟(“自展法”)来检验。
在介绍广义相加模型之后,给出一个相关案例研究以演示该方法的应用。研究乳牙龋齿对一组解释变量的依赖性。由于牙医可能不太容易接触到广义相加模型,本文以简介浓缩的形式呈现它们。认为非数学性的总结和一个实例可能会鼓励读者考虑所描述的方法。
广义相加模型对牙医可能具有很大价值,可让他们直观看到预测变量“起作用”的形状,并更好地理解预测变量与响应之间的复杂关系。