Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA.
Genetics. 2010 May;185(1):349-59. doi: 10.1534/genetics.110.114280. Epub 2010 Feb 15.
Genomewide multiple-loci mapping can be viewed as a challenging variable selection problem where the major objective is to select genetic markers related to a trait of interest. It is challenging because the number of genetic markers is large (often much larger than the sample size) and there is often strong linkage or linkage disequilibrium between markers. In this article, we developed two methods for genomewide multiple loci mapping: the Bayesian adaptive Lasso and the iterative adaptive Lasso. Compared with eight existing methods, the proposed methods have improved variable selection performance in both simulation and real data studies. The advantages of our methods come from the assignment of adaptive weights to different genetic makers and the iterative updating of these adaptive weights. The iterative adaptive Lasso is also computationally much more efficient than the commonly used marginal regression and stepwise regression methods. Although our methods are motivated by multiple-loci mapping, they are general enough to be applied to other variable selection problems.
全基因组多位点映射可以被视为一个具有挑战性的变量选择问题,其主要目标是选择与感兴趣的性状相关的遗传标记。之所以具有挑战性,是因为遗传标记的数量很大(通常比样本量要大得多),并且标记之间常常存在强连锁或连锁不平衡。在本文中,我们开发了两种全基因组多位点映射方法:贝叶斯自适应套索和迭代自适应套索。与八种现有方法相比,所提出的方法在模拟和真实数据研究中都提高了变量选择性能。我们方法的优势来自于对不同遗传标记赋予自适应权重以及这些自适应权重的迭代更新。迭代自适应套索在计算上也比常用的边际回归和逐步回归方法更加高效。尽管我们的方法是受多位点映射启发的,但它们足够通用,可以应用于其他变量选择问题。