Dhapola Parashar, Chowdhury Shantanu
GNR Knowledge Center for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, New Delhi 110 025, India.
GNR Knowledge Center for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, New Delhi 110 025, India Proteomics and Structural Biology Unit, CSIR-Institute of Genomics and Integrative Biology, New Delhi 110 025, India
Nucleic Acids Res. 2016 Jul 8;44(W1):W277-83. doi: 10.1093/nar/gkw425. Epub 2016 May 16.
DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way.
DNA鸟嘌呤四链体(G4s)是非经典的DNA二级结构,会影响复制、转录和重组等基因组过程。G4s通过特定的核苷酸基序在计算机上识别,这些基序也被称为推定G4(PG4)基序。尽管这些结构具有普遍相关性,但目前还没有工具能够以用户友好的界面进行批量查询和全基因组范围的这些基序分析。QuadBase2(quadbase.igib.res.in)展示了先前发布的QuadBase数据库全新的网络服务器版本。QuadBase2使用户能够通过EuQuad模块在多达178种真核生物中挖掘PG4基序。该模块与Ensembl Compara数据库对接,允许用户在跨真核生物的感兴趣基因的直系同源物中挖掘PG4基序。通过ProQuad模块,可以在1719种原核生物的基因及其启动子序列中挖掘PG4基序。该模块具有一项功能,允许在全基因组范围内挖掘PG4基序并将其可视化为圆形直方图。TetraplexFinder是用于在用户提供的序列中挖掘PG4基序的模块,现在能够处理多达20MB的数据。QuadBase2是一个全面的PG4基序挖掘工具,以用户友好的方式进一步扩展了挖掘PG4基序的配置和算法。