Fan Ruzong, Jung Jeesun, Jin Lei
Department of Statistics, Texas A&M University, College Station, Texas 77843, USA.
Genetics. 2006 Jan;172(1):663-86. doi: 10.1534/genetics.105.046417. Epub 2005 Sep 19.
In this article, population-based regression models are proposed for high-resolution linkage disequilibrium mapping of quantitative trait loci (QTL). Two regression models, the "genotype effect model" and the "additive effect model," are proposed to model the association between the markers and the trait locus. The marker can be either diallelic or multiallelic. If only one marker is used, the method is similar to a classical setting by Nielsen and Weir, and the additive effect model is equivalent to the haplotype trend regression (HTR) method by Zaykin et al. If two/multiple marker data with phase ambiguity are used in the analysis, the proposed models can be used to analyze the data directly. By analytical formulas, we show that the genotype effect model can be used to model the additive and dominance effects simultaneously; the additive effect model takes care of the additive effect only. On the basis of the two models, F-test statistics are proposed to test association between the QTL and markers. By a simulation study, we show that the two models have reasonable type I error rates for a data set of moderate sample size. The noncentrality parameter approximations of F-test statistics are derived to make power calculation and comparison. By a simulation study, it is found that the noncentrality parameter approximations of F-test statistics work very well. Using the noncentrality parameter approximations, we compare the power of the two models with that of the HTR. In addition, a simulation study is performed to make a comparison on the basis of the haplotype frequencies of 10 SNPs of angiotensin-1 converting enzyme (ACE) genes.
在本文中,我们提出了基于群体的回归模型,用于数量性状基因座(QTL)的高分辨率连锁不平衡定位。提出了两种回归模型,即“基因型效应模型”和“加性效应模型”,以模拟标记与性状基因座之间的关联。标记可以是双等位基因或多等位基因。如果仅使用一个标记,该方法类似于Nielsen和Weir的经典设置,并且加性效应模型等同于Zaykin等人的单倍型趋势回归(HTR)方法。如果在分析中使用具有相位模糊性的两个/多个标记数据,则所提出的模型可直接用于分析数据。通过解析公式,我们表明基因型效应模型可用于同时模拟加性和显性效应;加性效应模型仅考虑加性效应。基于这两种模型,提出了F检验统计量来检验QTL与标记之间的关联。通过模拟研究,我们表明对于中等样本量的数据集,这两种模型具有合理的I型错误率。推导了F检验统计量的非中心参数近似值,以进行功效计算和比较。通过模拟研究发现,F检验统计量的非中心参数近似值效果很好。使用非中心参数近似值,我们将这两种模型的功效与HTR的功效进行了比较。此外,基于血管紧张素转换酶(ACE)基因的10个单核苷酸多态性(SNP)的单倍型频率进行了模拟研究以作比较。