• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同时预测一组原核生物基因组中的转录因子结合位点。

Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes.

机构信息

Department of Bioinformatics and Genomics, Center for Bioinformatics Research, the University of North Carolina at Charlotte, Charlotte, NC 28223, USA.

出版信息

BMC Bioinformatics. 2010 Jul 23;11:397. doi: 10.1186/1471-2105-11-397.

DOI:10.1186/1471-2105-11-397
PMID:20653963
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2920276/
Abstract

BACKGROUND

Our current understanding of transcription factor binding sites (TFBSs) in sequenced prokaryotic genomes is very limited due to the lack of an accurate and efficient computational method for the prediction of TFBSs at a genome scale. In an attempt to change this situation, we have recently developed a comparative genomics based algorithm called GLECLUBS for de novo genome-wide prediction of TFBSs in a target genome. Although GLECLUBS has achieved rather high prediction accuracy of TFBSs in a target genome, it is still not efficient enough to be applied to all the sequenced prokaryotic genomes.

RESULTS

Here, we designed a new algorithm based on GLECLUBS called extended GLECLUBS (eGLECLUBS) for simultaneous prediction of TFBSs in a group of related prokaryotic genomes. When tested on a group of gamma-proteobacterial genomes including E. coli K12, a group of firmicutes genomes including B. subtilis and a group of cyanobacterial genomes using the same parameter settings, eGLECLUBS predicts more than 82% of known TFBSs in extracted inter-operonic sequences in both E. coli K12 and B. subtilis. Because each genome in a group is equally treated, it is highly likely that similar prediction accuracy has been achieved for each genome in the group.

CONCLUSIONS

We have developed a new algorithm for genome-wide de novo prediction of TFBSs in a group of related prokaryotic genomes. The algorithm has achieved the same level of accuracy and robustness as its predecessor GLECLUBS, but can work on dozens of genomes at the same time.

摘要

背景

由于缺乏一种准确有效的计算方法来预测基因组范围内的转录因子结合位点(TFBS),我们目前对测序原核基因组中的 TFBS 的理解非常有限。为了改变这种情况,我们最近开发了一种基于比较基因组学的算法,称为 GLECLUBS,用于在目标基因组中进行 TFBS 的从头全基因组预测。尽管 GLECLUBS 在目标基因组中实现了相当高的 TFBS 预测准确性,但它的效率仍然不够高,无法应用于所有测序的原核基因组。

结果

在这里,我们设计了一种基于 GLECLUBS 的新算法,称为扩展 GLECLUBS(eGLECLUBS),用于同时预测一组相关原核基因组中的 TFBS。当使用相同的参数设置在包括 E. coli K12 的一组γ-变形杆菌基因组、包括 B. subtilis 的一组Firmicutes 基因组和一组蓝藻基因组上进行测试时,eGLECLUBS 在提取的 E. coli K12 和 B. subtilis 中的种间序列中预测了超过 82%的已知 TFBS。由于组中的每个基因组都被平等对待,因此很可能在组中的每个基因组中都实现了类似的预测准确性。

结论

我们已经开发了一种新的算法,用于在一组相关的原核基因组中进行全基因组从头预测 TFBS。该算法与其前身 GLECLUBS 具有相同的准确性和稳健性,但可以同时处理几十个基因组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/6499909cce3e/1471-2105-11-397-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/ddfd7704b45f/1471-2105-11-397-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/17d541987bf4/1471-2105-11-397-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/1148fea8ffe2/1471-2105-11-397-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/48c5d081312d/1471-2105-11-397-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/6499909cce3e/1471-2105-11-397-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/ddfd7704b45f/1471-2105-11-397-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/17d541987bf4/1471-2105-11-397-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/1148fea8ffe2/1471-2105-11-397-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/48c5d081312d/1471-2105-11-397-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e7/2920276/6499909cce3e/1471-2105-11-397-5.jpg

相似文献

1
Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes.同时预测一组原核生物基因组中的转录因子结合位点。
BMC Bioinformatics. 2010 Jul 23;11:397. doi: 10.1186/1471-2105-11-397.
2
Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes.原核生物顺式调控结合位点的全基因组从头预测。
Nucleic Acids Res. 2009 Jun;37(10):e72. doi: 10.1093/nar/gkp248. Epub 2009 Apr 21.
3
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
4
Predicted transcription factor binding sites as predictors of operons in Escherichia coli and Streptomyces coelicolor.预测的转录因子结合位点作为大肠杆菌和天蓝色链霉菌操纵子的预测指标。
BMC Genomics. 2008 Feb 12;9:79. doi: 10.1186/1471-2164-9-79.
5
Genome-Wide De Novo Prediction of Cis-Regulatory Binding Sites in Mycobacterium tuberculosis H37Rv.结核分枝杆菌H37Rv中顺式调控结合位点的全基因组从头预测。
PLoS One. 2016 Feb 17;11(2):e0148965. doi: 10.1371/journal.pone.0148965. eCollection 2016.
6
Assessing phylogenetic motif models for predicting transcription factor binding sites.评估用于预测转录因子结合位点的系统发育基序模型。
Bioinformatics. 2009 Jun 15;25(12):i339-47. doi: 10.1093/bioinformatics/btp201.
7
A universal operon predictor for prokaryotic genomes.一种用于原核生物基因组的通用操纵子预测工具。
J Bioinform Comput Biol. 2009 Feb;7(1):19-38. doi: 10.1142/s0219720009003984.
8
Bioinformatic prediction of transcription factor binding sites at promoter regions of genes for photoperiod and vernalization responses in model and temperate cereal plants.模式植物和温带谷类作物中光周期及春化反应相关基因启动子区域转录因子结合位点的生物信息学预测
BMC Genomics. 2016 Aug 8;17:573. doi: 10.1186/s12864-016-2916-7.
9
MED: a new non-supervised gene prediction algorithm for bacterial and archaeal genomes.MED:一种用于细菌和古细菌基因组的新型无监督基因预测算法。
BMC Bioinformatics. 2007 Mar 16;8:97. doi: 10.1186/1471-2105-8-97.
10

引用本文的文献

1
Towards a map of cis-regulatory sequences in the human genome.构建人类基因组顺式调控序列图谱
Nucleic Acids Res. 2018 Jun 20;46(11):5395-5409. doi: 10.1093/nar/gky338.
2
CLIMP: Clustering Motifs via Maximal Cliques with Parallel Computing Design.CLIMP:通过具有并行计算设计的最大团进行基序聚类
PLoS One. 2016 Aug 3;11(8):e0160435. doi: 10.1371/journal.pone.0160435. eCollection 2016.
3
Genome-Wide De Novo Prediction of Cis-Regulatory Binding Sites in Mycobacterium tuberculosis H37Rv.结核分枝杆菌H37Rv中顺式调控结合位点的全基因组从头预测。

本文引用的文献

1
Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes.原核生物顺式调控结合位点的全基因组从头预测。
Nucleic Acids Res. 2009 Jun;37(10):e72. doi: 10.1093/nar/gkp248. Epub 2009 Apr 21.
2
Computational prediction of cAMP receptor protein (CRP) binding sites in cyanobacterial genomes.蓝藻基因组中cAMP受体蛋白(CRP)结合位点的计算预测。
BMC Genomics. 2009 Jan 15;10:23. doi: 10.1186/1471-2164-10-23.
3
DOOR: a database for prokaryotic operons.DOOR:一个原核生物操纵子数据库。
PLoS One. 2016 Feb 17;11(2):e0148965. doi: 10.1371/journal.pone.0148965. eCollection 2016.
4
De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.通过对大量染色质免疫沉淀数据集进行综合分析,从头预测顺式调控元件和模块。
BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.
5
SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents.SPIC:一种基于信息内容比较转录因子结合位点基序的新型相似性度量。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S14. doi: 10.1186/1752-0509-7-S2-S14. Epub 2013 Dec 17.
6
Phylogenetic footprinting: a boost for microbial regulatory genomics.系统发育足迹分析:促进微生物调控基因组学研究。
Protoplasma. 2012 Oct;249(4):901-7. doi: 10.1007/s00709-011-0351-9. Epub 2011 Nov 24.
7
MotifClick: prediction of cis-regulatory binding sites via merging cliques.MotifClick:通过合并节点预测顺式调控结合位点。
BMC Bioinformatics. 2011 Jun 16;12:238. doi: 10.1186/1471-2105-12-238.
Nucleic Acids Res. 2009 Jan;37(Database issue):D459-63. doi: 10.1093/nar/gkn757. Epub 2008 Nov 6.
4
The cis-regulatory map of Shewanella genomes.希瓦氏菌基因组的顺式调控图谱。
Nucleic Acids Res. 2008 Sep;36(16):5376-90. doi: 10.1093/nar/gkn515. Epub 2008 Aug 13.
5
The relative value of operon predictions.操纵子预测的相对价值。
Brief Bioinform. 2008 Sep;9(5):367-75. doi: 10.1093/bib/bbn019. Epub 2008 Apr 17.
6
The impact of next-generation sequencing technology on genetics.下一代测序技术对遗传学的影响。
Trends Genet. 2008 Mar;24(3):133-41. doi: 10.1016/j.tig.2007.12.007. Epub 2008 Feb 11.
7
RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.RegulonDB(版本6.0):大肠杆菌K-12超越转录的基因调控模型、活跃(实验性)注释启动子及Textpresso导航
Nucleic Acids Res. 2008 Jan;36(Database issue):D120-4. doi: 10.1093/nar/gkm994. Epub 2007 Dec 23.
8
DBD--taxonomically broad transcription factor predictions: new content and functionality.DBD——分类学范围广泛的转录因子预测:新内容与功能
Nucleic Acids Res. 2008 Jan;36(Database issue):D88-92. doi: 10.1093/nar/gkm964. Epub 2007 Dec 11.
9
DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information.DBTBS:一个包含上游基因间保守信息的枯草芽孢杆菌转录调控数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D93-6. doi: 10.1093/nar/gkm910. Epub 2007 Oct 25.
10
Computational prediction of Pho regulons in cyanobacteria.蓝藻中Pho调控子的计算预测
BMC Genomics. 2007 Jun 8;8:156. doi: 10.1186/1471-2164-8-156.