Bergemann T L, Huang Z
Biostatistics, University of Minnesota, Minneapolis, MN 55455, USA.
Hum Hered. 2009;68(4):268-77. doi: 10.1159/000228924. Epub 2009 Jul 22.
BACKGROUND/AIMS: The case-parent triad design is commonly used in genetic association studies. Generally, samples are drawn from an affected offspring, manifesting a phenotype of interest, as well as from the parents. The trio genotypes may be analyzed using a variety of available methods, but we focus on log-linear models because they test for genetic association and additionally estimate the relative risks of transmission. The models need to be modified to adjust for missing genotypes. Furthermore, instability in the parameter estimates can arise when certain kinds of genotype combinations do not appear in the dataset.
In this paper, we kill two birds with one stone. We propose a new method to simultaneously account for missing genotype data and genotype combinations with zero counts. This method solves a zero-inflated Poisson (ZIP) regression likelihood. The maximum likelihood estimates yield relative risks and the information matrix gives appropriate variance estimates for inference. A likelihood ratio test determines the significance of genetic association.
We compared the ZIP regression to previously proposed methods in both simulation studies and in a dataset that investigates the risk of orofacial clefts. The ZIP likelihood estimates regression coefficients with less bias than other methods when the minor allele frequency is small.
背景/目的:病例-双亲三联体设计常用于基因关联研究。通常,样本取自表现出感兴趣表型的患病后代以及其父母。三联体基因型可使用多种现有方法进行分析,但我们专注于对数线性模型,因为它们可检验基因关联并额外估计传递的相对风险。需要对模型进行修改以调整缺失的基因型。此外,当数据集中未出现某些类型的基因型组合时,参数估计可能会不稳定。
在本文中,我们一举两得。我们提出了一种新方法,可同时处理缺失的基因型数据和计数为零的基因型组合。此方法解决了零膨胀泊松(ZIP)回归似然问题。最大似然估计得出相对风险,信息矩阵给出用于推断的适当方差估计。似然比检验确定基因关联的显著性。
我们在模拟研究和一个调查口腔面部裂隙风险的数据集里,将ZIP回归与先前提出的方法进行了比较。当次要等位基因频率较小时,ZIP似然估计回归系数的偏差比其他方法小。