BaseClear BV, Leiden, The Netherlands.
Genome Biol. 2012 Jun 25;13(6):R56. doi: 10.1186/gb-2012-13-6-r56.
De novo assembly is a commonly used application of next-generation sequencing experiments. The ultimate goal is to puzzle millions of reads into one complete genome, although draft assemblies usually result in a number of gapped scaffold sequences. In this paper we propose an automated strategy, called GapFiller, to reliably close gaps within scaffolds using paired reads. The method shows good results on both bacterial and eukaryotic datasets, allowing only few errors. As a consequence, the amount of additional wetlab work needed to close a genome is drastically reduced. The software is available at http://www.baseclear.com/bioinformatics-tools/.
从头组装是下一代测序实验中常用的一种应用。其最终目标是将数以百万计的读取片段组合成一个完整的基因组,尽管草稿组装通常会产生许多有缺口的支架序列。在本文中,我们提出了一种自动化策略,称为 GapFiller,使用配对读取可靠地闭合支架内的缺口。该方法在细菌和真核数据集上都显示出了良好的效果,仅产生少量错误。因此,需要进行额外的湿实验来闭合基因组的工作量大大减少。该软件可在 http://www.baseclear.com/bioinformatics-tools/ 获取。