Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China.
Sci Rep. 2011;1:55. doi: 10.1038/srep00055. Epub 2011 Aug 5.
The rapid development of next generation sequencing (NGS) technology provides a new chance to extend the scale and resolution of genomic research. How to efficiently map millions of short reads to the reference genome and how to make accurate SNP calls are two major challenges in taking full advantage of NGS. In this article, we reviewed the current software tools for mapping and SNP calling, and evaluated their performance on samples from The Cancer Genome Atlas (TCGA) project. We found that BWA and Bowtie are better than the other alignment tools in comprehensive performance for Illumina platform, while NovoalignCS showed the best overall performance for SOLiD. Furthermore, we showed that next-generation sequencing platform has significantly lower coverage and poorer SNP-calling performance in the CpG islands, promoter and 5'-UTR regions of the genome. NGS experiments targeting for these regions should have higher sequencing depth than the normal genomic region.
下一代测序(NGS)技术的快速发展为扩展基因组研究的规模和分辨率提供了新的机会。如何有效地将数百万个短读段映射到参考基因组上,以及如何进行准确的 SNP 调用,是充分利用 NGS 的两个主要挑战。在本文中,我们回顾了当前用于映射和 SNP 调用的软件工具,并在来自癌症基因组图谱(TCGA)项目的样本上评估了它们的性能。我们发现,BWA 和 Bowtie 在综合性能方面优于 Illumina 平台的其他对齐工具,而 NovoalignCS 在 SOLiD 方面表现出最佳的整体性能。此外,我们还表明,在基因组的 CpG 岛、启动子和 5'-UTR 区域,下一代测序平台的覆盖度显著降低,SNP 调用性能较差。针对这些区域的 NGS 实验应具有比正常基因组区域更高的测序深度。