Suppr超能文献

FGAP:一种自动缺口闭合工具。

FGAP: an automated gap closing tool.

作者信息

Piro Vitor C, Faoro Helisson, Weiss Vinicius A, Steffens Maria B R, Pedrosa Fabio O, Souza Emanuel M, Raittz Roberto T

机构信息

Laboratory of Bioinformatics, Professional and Technological Education Sector, Federal University of Paraná, Curitiba, PR, Brazil, Rua Dr, Alcides Vieira Arcoverde 1225, Curitiba, Paraná, Brazil.

出版信息

BMC Res Notes. 2014 Jun 18;7:371. doi: 10.1186/1756-0500-7-371.

Abstract

BACKGROUND

The fast reduction of prices of DNA sequencing allowed rapid accumulation of genome data. However, the process of obtaining complete genome sequences is still very time consuming and labor demanding. In addition, data produced from various sequencing technologies or alternative assemblies remain underexplored to improve assembly of incomplete genome sequences.

FINDINGS

We have developed FGAP, a tool for closing gaps of draft genome sequences that takes advantage of different datasets. FGAP uses BLAST to align multiple contigs against a draft genome assembly aiming to find sequences that overlap gaps. The algorithm selects the best sequence to fill and eliminate the gap.

CONCLUSIONS

FGAP reduced the number of gaps by 78% in an E. coli draft genome assembly using two different sequencing technologies, Illumina and 454. Using PacBio long reads, 98% of gaps were solved. In human chromosome 14 assemblies, FGAP reduced the number of gaps by 35%. All the inserted sequences were validated with a reference genome using QUAST. The source code and a web tool are available at http://www.bioinfo.ufpr.br/fgap/.

摘要

背景

DNA测序价格的快速下降使得基因组数据得以迅速积累。然而,获得完整基因组序列的过程仍然非常耗时且费力。此外,来自各种测序技术或替代组装产生的数据在改善不完整基因组序列的组装方面仍未得到充分探索。

研究结果

我们开发了FGAP,这是一种利用不同数据集来填补基因组草图序列缺口的工具。FGAP使用BLAST将多个重叠群与基因组草图组装进行比对,旨在找到与缺口重叠的序列。该算法选择最佳序列来填补并消除缺口。

结论

在使用Illumina和454这两种不同测序技术的大肠杆菌基因组草图组装中,FGAP将缺口数量减少了78%。使用PacBio长读长,98%的缺口得到了解决。在人类14号染色体组装中,FGAP将缺口数量减少了35%。所有插入序列均使用QUAST通过参考基因组进行了验证。源代码和网络工具可在http://www.bioinfo.ufpr.br/fgap/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e712/4091766/7e4b66a67104/1756-0500-7-371-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验