Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
Genome Biol Evol. 2010 Jul 12;2:410-24. doi: 10.1093/gbe/evq029.
Second-generation sequencing technology has allowed a very large increase in sequencing throughput. In order to make use of this high throughput, we have developed a pipeline for sequencing and de novo assembly of multiple mitochondrial genomes without the costs of indexing. Simulation studies on a mixture of diverse animal mitochondrial genomes showed that mitochondrial genomes could be reassembled from a high coverage of short (35 nt) reads, such as those generated by a second-generation Illumina Genome Analyzer. We then assessed this experimentally with long-range polymerase chain reaction products from mitochondria of a human, a rat, a bird, a frog, an insect, and a mollusc. Comparison with reference genomes was used for deconvolution of the assembled contigs rather than for mapping of sequence reads. As proof of concept, we report the complete mollusc mitochondrial genome of an olive shell (Amalda northlandica). It has a very unusual putative control region, which contains a structure that would probably only be detectable by next-generation sequencing. The general approach has considerable potential, especially when combined with indexed sequencing of different groups of genomes.
第二代测序技术使得测序通量有了很大的提高。为了利用这种高通量,我们开发了一种无需索引即可对多个线粒体基因组进行测序和从头组装的流水线。在对不同动物线粒体基因组混合物的模拟研究中,我们发现线粒体基因组可以从高覆盖率的短(35nt)reads 中重新组装,例如第二代 Illumina Genome Analyzer 生成的那些。然后,我们用来自人类、大鼠、鸟类、青蛙、昆虫和软体动物线粒体的长距离聚合酶链反应产物对此进行了实验评估。与参考基因组的比较用于对组装的连续序列进行分解,而不是用于对序列读取进行映射。作为概念验证,我们报告了一种橄榄壳(Amalda northlandica)的完整软体动物线粒体基因组。它具有一个非常不寻常的假定控制区,其中包含一个结构,可能只有下一代测序才能检测到。这种通用方法具有很大的潜力,特别是与不同基因组组的索引测序相结合时。