Zhang Luyan, Li Huihui, Li Zhonglai, Wang Jiankang
School of Mathematical Sciences, Beijing Normal University, Beijing 100875, China.
Genetics. 2008 Oct;180(2):1177-90. doi: 10.1534/genetics.108.092122. Epub 2008 Sep 9.
F(2) populations are commonly used in genetic studies of animals and plants. For simplicity, most quantitative trait locus or loci (QTL) mapping methods have been developed on the basis of populations having two distinct genotypes at each polymorphic marker or gene locus. In this study, we demonstrate that dominance can cause the interactions between markers and propose an inclusive linear model that includes marker variables and marker interactions so as to completely control both additive and dominance effects of QTL. The proposed linear model is the theoretical basis for inclusive composite-interval QTL mapping (ICIM) for F(2) populations, which consists of two steps: first, the best regression model is selected by stepwise regression, which approximately identifies markers and marker interactions explaining both additive and dominance variations; second, the interval mapping approach is applied to the phenotypic values adjusted by the regression model selected in the first step. Due to the limited mapping population size, the large number of variables, and multicollinearity between variables, coefficients in the inclusive linear model cannot be accurately determined in the first step. Interval mapping is necessary in the second step to fine tune the QTL to their true positions. The efficiency of including marker interactions in mapping additive and dominance QTL was demonstrated by extensive simulations using three QTL distribution models with two population sizes and an actual rice F(2) population.
F(2)群体常用于动植物的遗传学研究。为简单起见,大多数数量性状基因座(QTL)定位方法都是基于在每个多态性标记或基因座具有两种不同基因型的群体而开发的。在本研究中,我们证明显性会导致标记间的相互作用,并提出一个包含标记变量和标记相互作用的包容性线性模型,以便完全控制QTL的加性和显性效应。所提出的线性模型是F(2)群体包容性复合区间QTL定位(ICIM)的理论基础,它包括两个步骤:首先,通过逐步回归选择最佳回归模型,该模型近似识别解释加性和显性变异的标记及标记相互作用;其次,将区间定位方法应用于由第一步选择的回归模型调整后的表型值。由于定位群体规模有限、变量数量众多以及变量间的多重共线性,包容性线性模型中的系数在第一步无法准确确定。第二步需要进行区间定位,以便将QTL精细定位到其真实位置。通过使用具有两种群体规模的三种QTL分布模型以及一个实际的水稻F(2)群体进行广泛模拟,证明了在定位加性和显性QTL时纳入标记相互作用的有效性。