Lewsey J D, Gilthorpe M S, Bulman J S, Bedi R
Biostatistics Unit, Eastman Dental Institute for Oral Health Care Services, University College London, United Kingdom.
Community Dent Health. 2000 Dec;17(4):212-7.
To introduce and encourage the use of generalised linear models (GLMs) in analysing caries data that do not require the response to be treated necessarily as a sample from a normal distribution.
At the present time, it is most likely that the sampling distribution of dmf/DMF in industrialised countries will not approximate normality. Generalised linear modelling can be conducted assuming many underlying distributions which, in fact, includes the normal distribution. In this paper three GLMs are employed (normal, Poisson, negative binomial) for modelling an example caries data set. In addition, a binomial model is used to model the dichotomous outcome of caries-free/caries-present.
The data comprised 871 Old Trafford, Manchester primary school children aged between 4 years 0 months and 5 years 11 months.
The effect of one study covariate was prominent in a normal model applied to all available dmf data but not in two non-normal models which used dmf > 0 data only. Furthermore, the same covariate was significant at the 5% level in a binomial model indicating that it influenced whether or not caries was present and not the level of dmf.
A suitable modelling approach for caries data is to employ a Poisson or a negative binomial model for the dmf/DMF response and a binomial model for the caries-free/caries-present outcome. This allows separate estimation of those factors which influence the magnitude of caries and those factors which influence whether caries is actually present or not.
介绍并鼓励使用广义线性模型(GLMs)分析龋病数据,这类模型不要求将反应必然视为来自正态分布的样本。
目前,在工业化国家,dmf/DMF的抽样分布很可能不接近正态分布。广义线性建模可以在假设许多潜在分布的情况下进行,实际上这些分布包括正态分布。本文采用三种广义线性模型(正态、泊松、负二项式)对一个龋病数据集示例进行建模。此外,使用二项式模型对无龋/患龋的二分结果进行建模。
数据包括871名来自曼彻斯特老特拉福德的4岁0个月至5岁11个月的小学生。
在应用于所有可用dmf数据的正态模型中,一个研究协变量的效应显著,但在仅使用dmf>0数据的两个非正态模型中不显著。此外,在二项式模型中,相同的协变量在5%水平上显著,表明它影响的是是否患龋,而不是dmf水平。
对于龋病数据,合适的建模方法是对dmf/DMF反应采用泊松或负二项式模型,对无龋/患龋结果采用二项式模型。这样可以分别估计影响龋病严重程度的因素和影响是否实际患龋的因素。