Biswas Shameek, Storey John D, Akey Joshua M
Department of Genome Sciences, University of Washington, 1705 NE Pacific Street, Seattle, WA 98195, USA.
BMC Bioinformatics. 2008 May 20;9:244. doi: 10.1186/1471-2105-9-244.
The combination of gene expression profiling with linkage analysis has become a powerful paradigm for mapping gene expression quantitative trait loci (eQTL). To date, most studies have searched for eQTL by analyzing gene expression traits one at a time. As thousands of expression traits are typically analyzed, this can reduce power because of the need to correct for the number of hypothesis tests performed. In addition, gene expression traits exhibit a complex correlation structure, which is ignored when analyzing traits individually.
To address these issues, we applied two different multivariate dimension reduction techniques, the Singular Value Decomposition (SVD) and Independent Component Analysis (ICA) to gene expression traits derived from a cross between two strains of Saccharomyces cerevisiae. Both methods decompose the data into a set of meta-traits, which are linear combinations of all the expression traits. The meta-traits were enriched for several Gene Ontology categories including metabolic pathways, stress response, RNA processing, ion transport, retro-transposition and telomeric maintenance. Genome-wide linkage analysis was performed on the top 20 meta-traits from both techniques. In total, 21 eQTL were found, of which 11 are novel. Interestingly, both cis and trans-linkages to the meta-traits were observed.
These results demonstrate that dimension reduction methods are a useful and complementary approach for probing the genetic architecture of gene expression variation.
基因表达谱分析与连锁分析相结合已成为绘制基因表达数量性状位点(eQTL)的强大范例。迄今为止,大多数研究一次仅分析一个基因表达性状来寻找eQTL。由于通常要分析数千个表达性状,由于需要对所进行的假设检验数量进行校正,这可能会降低检验效能。此外,基因表达性状呈现出复杂的相关结构,单独分析性状时会忽略这一点。
为了解决这些问题,我们将两种不同的多变量降维技术,即奇异值分解(SVD)和独立成分分析(ICA)应用于源自酿酒酵母两个菌株杂交的基因表达性状。两种方法都将数据分解为一组元性状,这些元性状是所有表达性状的线性组合。元性状在几个基因本体论类别中富集,包括代谢途径、应激反应、RNA加工、离子转运、逆转座和端粒维持。对这两种技术的前20个元性状进行全基因组连锁分析。总共发现了21个eQTL,其中11个是新的。有趣的是,观察到了与元性状的顺式和反式连锁。
这些结果表明,降维方法是探索基因表达变异遗传结构的一种有用且互补的方法。