Suppr超能文献

UpCoT:一种用于对原核生物基因组中直系同源基因的上游DNA序列进行聚类的集成管道工具。

UpCoT: an integrated pipeline tool for clustering upstream DNA sequences of orthologous genes in prokaryotic genomes.

作者信息

Arun P V Parvati Sai, Prakash Jogadhenu S S

机构信息

Department of Plant Sciences, School of Life Sciences, University of Hyderabad, Hyderabad, 500046, India.

Department of Biotechnology and Bioinformatics, School of Life Sciences, University of Hyderabad, P. O. Central University, Hyderabad, 500046, India.

出版信息

3 Biotech. 2016 Jun;6(1):74. doi: 10.1007/s13205-016-0363-4. Epub 2016 Feb 16.

Abstract

UpCoT is a pipeline tool developed by automating the series of steps involved in prediction of cis-regulatory elements. UpCoT generates orthologs for each gene in target genome using bi-directional best blast hit against the reference genomes, then identifies potential orthologous transcriptional units using intergenic distance. Finally it generates the FASTA files containing upstream sequences of orthologous transcriptional units of each gene in target genome. The inputs of UpCoT are protein sequence files (.faa), genome sequence files (.fna) and gene co-ordinate files (*.ptt) for target and reference genomes. The clustered-upstream DNA sequences can be used by motif prediction tool, such as MEME, Bio-prospector, Gibbs motif sampler, MDscan for prediction of conserved DNA elements. We tested the performance of UpCoT by selecting the genome of Synechocystis sp PCC 6803 as the target and 13 different cyanobacterial genomes as reference. The clustered upstream sequences generated by UpCoT of groES, ycf24 and nirA were used for cis-regulatory element prediction. The results were consistent with the experimentally identified cis-regulatory elements. Therefore, UpCoT is a reliable and automated pipeline package for prediction of orthologs, orthologous transcriptional units, and orthologous upstream sequences of a selected prokaryotic genome. UpCoT can be downloaded from http://jssplab.uohyd.ac.in/upcot/ .

摘要

UpCoT是一种通过自动化预测顺式调控元件所涉及的一系列步骤而开发的管道工具。UpCoT利用针对参考基因组的双向最佳比对为目标基因组中的每个基因生成直系同源物,然后使用基因间距离识别潜在的直系同源转录单元。最后,它生成包含目标基因组中每个基因的直系同源转录单元上游序列的FASTA文件。UpCoT的输入是目标基因组和参考基因组的蛋白质序列文件(.faa)、基因组序列文件(.fna)和基因坐标文件(*.ptt)。聚类的上游DNA序列可被模体预测工具(如MEME、Bio-prospector、Gibbs模体采样器、MDscan)用于预测保守DNA元件。我们通过选择集胞藻属PCC 6803的基因组作为目标,13个不同的蓝藻基因组作为参考来测试UpCoT的性能。UpCoT生成的groES、ycf24和nirA的聚类上游序列用于顺式调控元件预测。结果与实验鉴定的顺式调控元件一致。因此,UpCoT是一个用于预测所选原核基因组的直系同源物、直系同源转录单元和直系同源上游序列的可靠且自动化的管道软件包。UpCoT可从http://jssplab.uohyd.ac.in/upcot/下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/207c/4755962/c77dbeaf1a4a/13205_2016_363_Fig1_HTML.jpg

相似文献

1
UpCoT: an integrated pipeline tool for clustering upstream DNA sequences of orthologous genes in prokaryotic genomes.
3 Biotech. 2016 Jun;6(1):74. doi: 10.1007/s13205-016-0363-4. Epub 2016 Feb 16.
3
In silico identification of conserved intercoding sequences in Leishmania genomes: unraveling putative cis-regulatory elements.
Mol Biochem Parasitol. 2012 Jun;183(2):140-50. doi: 10.1016/j.molbiopara.2012.02.009. Epub 2012 Feb 25.
5
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
6
Using the Gibbs Motif Sampler for phylogenetic footprinting.
Methods Mol Biol. 2007;395:403-24. doi: 10.1007/978-1-59745-514-5_25.
7
ExUTR: a novel pipeline for large-scale prediction of 3'-UTR sequences from NGS data.
BMC Genomics. 2017 Nov 6;18(1):847. doi: 10.1186/s12864-017-4241-1.
9
A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies.
BMC Bioinformatics. 2016 Jun 30;17(1):260. doi: 10.1186/s12859-016-1142-2.

本文引用的文献

1
Finding sequence motifs in prokaryotic genomes--a brief practical guide for a microbiologist.
Brief Bioinform. 2009 Sep;10(5):525-36. doi: 10.1093/bib/bbp032. Epub 2009 Jun 24.
2
The cis-regulatory map of Shewanella genomes.
Nucleic Acids Res. 2008 Sep;36(16):5376-90. doi: 10.1093/nar/gkn515. Epub 2008 Aug 13.
3
Predicting cis-acting elements of Lactobacillus plantarum by comparative genomics with different taxonomic subgroups.
Nucleic Acids Res. 2006 Apr 13;34(7):1947-58. doi: 10.1093/nar/gkl138. Print 2006.
4
Rhodopseudomonas palustris regulons detected by cross-species analysis of alphaproteobacterial genomes.
Appl Environ Microbiol. 2005 Nov;71(11):7442-52. doi: 10.1128/AEM.71.11.7442-7452.2005.
6
Targeted inactivation of the hrcA repressor gene in cyanobacteria.
FEBS Lett. 2003 Aug 14;549(1-3):57-62. doi: 10.1016/s0014-5793(03)00768-3.
8
Role of NtcB in activation of nitrate assimilation genes in the cyanobacterium Synechocystis sp. strain PCC 6803.
J Bacteriol. 2001 Oct;183(20):5840-7. doi: 10.1128/JB.183.20.5840-5847.2001.
10
Conserved noncoding sequences are reliable guides to regulatory elements.
Trends Genet. 2000 Sep;16(9):369-72. doi: 10.1016/s0168-9525(00)02081-3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验