Lee H, Stram D O
Department of Biostatistics, Harvard University, Cambridge, MA, USA.
Am J Hum Genet. 1996 Jan;58(1):213-24.
The present article discusses the use of computational methods based on generalized estimating equations (GEE), as a potential alternative to full maximum-likelihood methods, for performing segregation analysis of continuous phenotypes by using randomly selected family data. The method that we propose can estimate effect and degree of dominance of a major gene in the presence of additional nongenetic or polygenetic familial associations, by relating sample moments to their expectations calculated under the genetic model. It is known that all parameters in basic major-gene models cannot be identified, for estimation purposes, solely in terms of the first two sample moments of data from randomly selected families. Thus, we propose the use of higher (third order) sample moments to resolve this identifiability problem, in a pseudo-profile likelihood estimation scheme. In principle, our methods may be applied to fitting genetic models by using complex pedigrees and for estimation in the presence of missing phenotype data for family members. In order to assess its statistical efficiency we compare several variants of the method with each other and with maximum-likelihood estimates provided by the SAGE computer package in a simulation study.
本文讨论了基于广义估计方程(GEE)的计算方法,作为全最大似然方法的一种潜在替代方法,用于通过使用随机选择的家系数据对连续性状进行分离分析。我们提出的方法可以通过将样本矩与其在遗传模型下计算的期望值相关联,在存在额外的非遗传或多基因家族关联的情况下,估计主基因的效应和显性程度。众所周知,仅根据随机选择家系数据的前两个样本矩,无法从估计目的上识别基本主基因模型中的所有参数。因此,我们提出在伪轮廓似然估计方案中使用更高阶(三阶)样本矩来解决这个可识别性问题。原则上,我们的方法可应用于使用复杂家系拟合遗传模型,以及在家庭成员存在缺失表型数据的情况下进行估计。为了评估其统计效率,我们在模拟研究中将该方法的几个变体相互比较,并与SAGE计算机软件包提供的最大似然估计进行比较。