Institut de Biologie Intégrative et des Systèmes, Université Laval, QC, Canada.
Mol Ecol. 2010 Mar;19 Suppl 1:115-31. doi: 10.1111/j.1365-294X.2009.04477.x.
Next-generation sequencing allows the discovery of large numbers of single nucleotide polymorphisms (SNPs) in species where little genomic information was previously available. Here, we assembled, de novo, over 130 mb of non-normalized cDNA using 454 pyrosequencing data from dwarf and normal lake whitefish and backcross hybrids. Our main goals were to gather a large data set of SNP markers, document their distribution within coding regions, evaluate the effect of species divergence on allele frequencies and combine results with previous genomic studies to identify candidate genes underlying the adaptive divergence of lake whitefish. We identified 6094 putative SNPs in 2674 contigs (mean size: 576 bp, range: 101-6116) and 1540 synonymous and 1734 non-synonymous mutations for a genome-wide non-synonymous to synonymous substitution rate ratio (p(N)/p(S)) of 0.37. As expected based on the young age (<15 000 years) of whitefish species pair, the overall level of divergence between them was relatively weak. Yet, 89 SNPs showed pronounced allele frequency differences between sympatric normal and dwarf whitefish. Among these, SNPs in genes annotated to energy metabolic functions were the most abundant and this, in addition to previous experimental data at the gene expression and phenotypic level, brings compelling evidence that genes involved in energy metabolism are prime candidates explaining the adaptive divergence of lake whitefish species pairs. Finally, we unexpectedly identified 44 contigs annotated to transposable elements and these were predominantly composed of backcross hybrids sequences. This indicates an elevated activity of transposable elements, which could potentially contribute to the reduced fitness of hybrids previously documented.
下一代测序技术允许在以前基因组信息很少的物种中发现大量的单核苷酸多态性 (SNP)。在这里,我们使用来自矮化和正常湖白鱼以及回交杂种的 454 焦磷酸测序数据,从头组装了超过 130 Mb 的非标准化 cDNA。我们的主要目标是收集大量的 SNP 标记数据集,记录它们在编码区域内的分布,评估物种分化对等位基因频率的影响,并将结果与以前的基因组研究相结合,以确定导致湖白鱼适应性分化的候选基因。我们在 2674 个连续体中鉴定出 6094 个假定 SNP(平均大小:576 bp,范围:101-6116)和 1540 个同义突变和 1734 个非同义突变,全基因组非同义到同义取代率比值(p(N)/p(S))为 0.37。基于白鱼物种对的年轻年龄(<15000 年),它们之间的整体分化水平相对较弱,这是意料之中的。然而,89 个 SNP 在同域正常和矮化白鱼之间表现出明显的等位基因频率差异。在这些 SNP 中,注释为能量代谢功能的基因中的 SNP 最为丰富,这加上基因表达和表型水平的先前实验数据,提供了令人信服的证据,表明参与能量代谢的基因是解释湖白鱼物种对适应性分化的主要候选基因。最后,我们意外地鉴定出 44 个注释为转座元件的连续体,这些连续体主要由回交杂种序列组成。这表明转座元件的活性升高,这可能导致先前记录的杂种适应性降低。