MOH Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China.
PLoS One. 2012;7(12):e51188. doi: 10.1371/journal.pone.0051188. Epub 2012 Dec 7.
Next generation sequencing (NGS) technologies have greatly changed the landscape of transcriptomic studies of non-model organisms. Since there is no reference genome available, de novo assembly methods play key roles in the analysis of these data sets. Because of the huge amount of data generated by NGS technologies for each run, many assemblers, e.g., ABySS, Velvet and Trinity, are developed based on a de Bruijn graph due to its time- and space-efficiency. However, most of these assemblers were developed initially for the Illumina/Solexa platform. The performance of these assemblers on 454 transcriptomic data is unknown. In this study, we evaluated and compared the relative performance of these de Bruijn graph based assemblers on both simulated and real 454 transcriptomic data. The results suggest that Trinity, the Illumina/Solexa-specialized transcriptomic assembler, performs the best among the multiple de Bruijn graph assemblers, comparable to or even outperforming the standard 454 assembler Newbler which is based on the overlap-layout-consensus algorithm. Our evaluation is expected to provide helpful guidance for researchers to choose assemblers when analyzing 454 transcriptomic data.
下一代测序 (NGS) 技术极大地改变了非模式生物转录组研究的格局。由于没有参考基因组,从头组装方法在这些数据集的分析中起着关键作用。由于 NGS 技术为每个运行生成的大量数据,许多组装器,例如 ABySS、Velvet 和 Trinity,由于其时间和空间效率,都是基于 de Bruijn 图开发的。然而,这些组装器中的大多数最初是为 Illumina/Solexa 平台开发的。这些组装器在 454 转录组数据上的性能尚不清楚。在这项研究中,我们评估和比较了这些基于 de Bruijn 图的组装器在模拟和真实 454 转录组数据上的相对性能。结果表明,在多个 de Bruijn 图组装器中,Illumina/Solexa 专用转录组组装器 Trinity 的性能最佳,与基于重叠布局共识算法的标准 454 组装器 Newbler 相当,甚至优于后者。我们的评估有望为研究人员在分析 454 转录组数据时选择组装器提供有帮助的指导。