Department of Entomology, University of Illinois, 320 Morrill Hall, 505 S, Goodwin Ave, Urbana, Illinois 61801, USA.
BMC Genomics. 2009 Dec 27;10:633. doi: 10.1186/1471-2164-10-633.
The full power of modern genetics has been applied to the study of speciation in only a small handful of genetic model species--all of which speciated allopatrically. Here we report the first large expressed sequence tag (EST) study of a candidate for ecological sympatric speciation, the apple maggot Rhagoletis pomonella, using massively parallel pyrosequencing on the Roche 454-FLX platform. To maximize transcript diversity we created and sequenced separate libraries from larvae, pupae, adult heads, and headless adult bodies.
We obtained 239,531 sequences which assembled into 24,373 contigs. A total of 6810 unique protein coding genes were identified among the contigs and long singletons, corresponding to 48% of all known Drosophila melanogaster protein-coding genes. Their distribution across GO classes suggests that we have obtained a representative sample of the transcriptome. Among these sequences are many candidates for potential R. pomonella "speciation genes" (or "barrier genes") such as those controlling chemosensory and life-history timing processes. Furthermore, we identified important marker loci including more than 40,000 single nucleotide polymorphisms (SNPs) and over 100 microsatellites. An initial search for SNPs at which the apple and hawthorn host races differ suggested at least 75 loci warranting further work. We also determined that developmental expression differences remained even after normalization; transcripts expected to show different expression levels between larvae and pupae in D. melanogaster also did so in R. pomonella. Preliminary comparative analysis of transcript presences and absences revealed evidence of gene loss in Drosophila and gain in the higher dipteran clade Schizophora.
These data provide a much needed resource for exploring mechanisms of divergence in this important model for sympatric ecological speciation. Our description of ESTs from a substantial portion of the R. pomonella transcriptome will facilitate future functional studies of candidate genes for olfaction and diapause-related life history timing, and will enable large scale expression studies. Similarly, the identification of new SNP and microsatellite markers will facilitate future population and quantitative genetic studies of divergence between the apple and hawthorn-infesting host races.
现代遗传学的全部力量仅应用于少数几种遗传模式物种的物种形成研究——所有这些物种都是异域物种形成的。在这里,我们报告了第一个关于生态同域物种形成候选者的大规模表达序列标签(EST)研究,使用 Roche 454-FLX 平台上的大规模平行焦磷酸测序。为了最大限度地提高转录多样性,我们分别从小型幼虫、蛹、成虫头部和无头成虫体中创建和测序了单独的文库。
我们获得了 239531 条序列,组装成 24373 个连续序列。在连续序列和长单序列中总共鉴定出 6810 个独特的蛋白质编码基因,对应于所有已知黑腹果蝇蛋白质编码基因的 48%。它们在 GO 类别的分布表明,我们已经获得了转录组的代表性样本。在这些序列中,有许多可能是 R. pomonella“物种形成基因”(或“隔离基因”)的候选基因,例如那些控制化学感觉和生活史时间进程的基因。此外,我们还鉴定了重要的标记基因座,包括超过 40000 个单核苷酸多态性(SNP)和 100 多个微卫星。对苹果和山楂宿主种群之间差异的 SNP 进行的初步搜索表明,至少有 75 个基因座值得进一步研究。我们还确定,即使在归一化后,发育表达差异仍然存在;在黑腹果蝇中预期在幼虫和蛹之间表现出不同表达水平的转录本在 R. pomonella 中也是如此。对转录本存在和缺失的初步比较分析表明,在果蝇中存在基因丢失,而在更高的双翅目分支 Schizophora 中存在基因获得。
这些数据为探索这个重要的同域生态物种形成模型中的分歧机制提供了急需的资源。我们对 R. pomonella 转录组的大部分 EST 的描述将有助于嗅觉和滞育相关生活史时间的候选基因的未来功能研究,并将能够进行大规模表达研究。同样,新的 SNP 和微卫星标记的鉴定将有助于未来对苹果和山楂寄生宿主种群之间的分歧进行种群和数量遗传研究。