Deng Weiping, Chen Hanfeng, Li Zhaohai
Department of Statistics, George Washington University, Washington, District of Columbia 20052, USA.
Genetics. 2006 Feb;172(2):1349-58. doi: 10.1534/genetics.105.047241. Epub 2005 Nov 4.
Often in genetic research, presence or absence of a disease is affected by not only the trait locus genotypes but also some covariates. The finite logistic regression mixture models and the methods under the models are developed for detection of a binary trait locus (BTL) through an interval-mapping procedure. The maximum-likelihood estimates (MLEs) of the logistic regression parameters are asymptotically unbiased. The null asymptotic distributions of the likelihood-ratio test (LRT) statistics for detection of a BTL are found to be given by the supremum of a chi2-process. The limiting null distributions are free of the null model parameters and are determined explicitly through only four (backcross case) or nine (intercross case) independent standard normal random variables. Therefore a threshold for detecting a BTL in a flanking marker interval can be approximated easily by using a Monte Carlo method. It is pointed out that use of a threshold incorrectly determined by reading off a chi2-probability table can result in an excessive false BTL detection rate much more severely than many researchers might anticipate. Simulation results show that the BTL detection procedures based on the thresholds determined by the limiting distributions perform quite well when the sample sizes are moderately large.
在基因研究中,疾病的存在与否通常不仅受性状位点基因型的影响,还受一些协变量的影响。有限逻辑回归混合模型及其模型下的方法是通过区间定位程序来检测二元性状位点(BTL)而开发的。逻辑回归参数的最大似然估计(MLE)是渐近无偏的。发现用于检测BTL的似然比检验(LRT)统计量的零渐近分布由卡方过程的上确界给出。极限零分布与零模型参数无关,仅通过四个(回交情况)或九个(杂交情况)独立的标准正态随机变量明确确定。因此,通过使用蒙特卡罗方法可以很容易地近似在侧翼标记区间中检测BTL的阈值。需要指出的是,使用通过读取卡方概率表错误确定的阈值可能导致过高的假BTL检测率,其严重程度远超许多研究人员的预期。模拟结果表明,当样本量适中时,基于由极限分布确定的阈值的BTL检测程序表现良好。