Rampersaud E, Morris R W, Weinberg C R, Speer M C, Martin E R
Center for Human Genetics, Duke University Medical Center, Durham, NC 27710, USA.
Genet Epidemiol. 2007 Jan;31(1):18-30. doi: 10.1002/gepi.20189.
Genotype-based likelihood-ratio tests (LRT) of association that examine maternal and parent-of-origin effects have been previously developed in the framework of log-linear and conditional logistic regression models. In the situation where parental genotypes are missing, the expectation-maximization (EM) algorithm has been incorporated in the log-linear approach to allow incomplete triads to contribute to the LRT. We present an extension to this model which we call the Combined_LRT that incorporates additional information from the genotypes of unaffected siblings to improve assignment of incompletely typed families to mating type categories, thereby improving inference of missing parental data. Using simulations involving a realistic array of family structures, we demonstrate the validity of the Combined_LRT under the null hypothesis of no association and provide power comparisons under varying levels of missing data and using sibling genotype data. We demonstrate the improved power of the Combined_LRT compared with the family-based association test (FBAT), another widely used association test. Lastly, we apply the Combined_LRT to a candidate gene analysis in Autism families, some of which have missing parental genotypes. We conclude that the proposed log-linear model will be an important tool for future candidate gene studies, for many complex diseases where unaffected siblings can often be ascertained and where epigenetic factors such as imprinting may play a role in disease etiology.
先前已在对数线性和条件逻辑回归模型的框架内开发了基于基因型的关联似然比检验(LRT),用于检验母体效应和起源亲本效应。在亲本基因型缺失的情况下,期望最大化(EM)算法已被纳入对数线性方法中,以使不完全三联体能够对LRT有所贡献。我们提出了对该模型的一种扩展,我们称之为Combined_LRT,它纳入了来自未受影响同胞基因型的额外信息,以改善将不完全分型家庭分配到交配类型类别的情况,从而改进对缺失亲本数据的推断。通过涉及一系列实际家庭结构的模拟,我们证明了在无关联的零假设下Combined_LRT的有效性,并在不同程度的缺失数据情况下以及使用同胞基因型数据时提供了效能比较。我们证明了Combined_LRT与另一种广泛使用的关联检验——基于家庭的关联检验(FBAT)相比,具有更高的效能。最后,我们将Combined_LRT应用于自闭症家庭的候选基因分析,其中一些家庭存在亲本基因型缺失的情况。我们得出结论,对于许多复杂疾病,在通常可以确定未受影响的同胞且印记等表观遗传因素可能在疾病病因中起作用的情况下,所提出的对数线性模型将成为未来候选基因研究的重要工具。