Pantoja Yan, Pinheiro Kenny, Veras Allan, Araújo Fabrício, Lopes de Sousa Ailton, Guimarães Luis Carlos, Silva Artur, Ramos Rommel T J
Institute of Biological Sciences, Federal University Pará, Belém, Pará, Brazil.
PLoS One. 2017 May 24;12(5):e0178154. doi: 10.1371/journal.pone.0178154. eCollection 2017.
With increased production of genomic data since the advent of next-generation sequencing (NGS), there has been a need to develop new bioinformatics tools and areas, such as comparative genomics. In comparative genomics, the genetic material of an organism is directly compared to that of another organism to better understand biological species. Moreover, the exponentially growing number of deposited prokaryote genomes has enabled the investigation of several genomic characteristics that are intrinsic to certain species. Thus, a new approach to comparative genomics, termed pan-genomics, was developed. In pan-genomics, various organisms of the same species or genus are compared. Currently, there are many tools that can perform pan-genomic analyses, such as PGAP (Pan-Genome Analysis Pipeline), Panseq (Pan-Genome Sequence Analysis Program) and PGAT (Prokaryotic Genome Analysis Tool). Among these software tools, PGAP was developed in the Perl scripting language and its reliance on UNIX platform terminals and its requirement for an extensive parameterized command line can become a problem for users without previous computational knowledge. Thus, the aim of this study was to develop a web application, known as PanWeb, that serves as a graphical interface for PGAP. In addition, using the output files of the PGAP pipeline, the application generates graphics using custom-developed scripts in the R programming language. PanWeb is freely available at http://www.computationalbiology.ufpa.br/panweb.
自新一代测序(NGS)出现以来,随着基因组数据产量的增加,开发新的生物信息学工具和领域变得很有必要,比如比较基因组学。在比较基因组学中,将一种生物的遗传物质直接与另一种生物的遗传物质进行比较,以更好地了解生物物种。此外,已存原核生物基因组数量呈指数增长,这使得对某些物种特有的几个基因组特征进行研究成为可能。因此,一种新的比较基因组学方法——泛基因组学应运而生。在泛基因组学中,对同一物种或属的各种生物进行比较。目前,有许多工具可以进行泛基因组分析,如PGAP(泛基因组分析管道)、Panseq(泛基因组序列分析程序)和PGAT(原核生物基因组分析工具)。在这些软件工具中,PGAP是用Perl脚本语言开发的,它对UNIX平台终端的依赖以及对大量参数化命令行的要求,对于没有计算知识的用户来说可能会成为一个问题。因此,本研究的目的是开发一个名为PanWeb的网络应用程序,作为PGAP的图形界面。此外,该应用程序使用PGAP管道的输出文件,通过用R编程语言自定义开发的脚本生成图形。PanWeb可在http://www.computationalbiology.ufpa.br/panweb免费获取。