基于序列的新型方法用于鉴定原核基因组中的转录因子结合位点。

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

机构信息

Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63108, USA.

出版信息

Bioinformatics. 2010 Nov 1;26(21):2672-7. doi: 10.1093/bioinformatics/btq501. Epub 2010 Aug 31.

DOI:10.1093/bioinformatics/btq501

PMID:20807838

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2981494/

Abstract

MOTIVATION

Computational techniques for microbial genomic sequence analysis are becoming increasingly important. With next-generation sequencing technology and the human microbiome project underway, current sequencing capacity is significantly greater than the speed at which organisms of interest can be studied experimentally. Most related computational work has been focused on sequence assembly, gene annotation and metabolic network reconstruction. We have developed a method that will primarily use available sequence data in order to determine prokaryotic transcription factor (TF) binding specificities.

RESULTS

Specificity determining residues (critical residues) were identified from crystal structures of DNA-protein complexes and TFs with the same critical residues were grouped into specificity classes. The putative binding regions for each class were defined as the set of promoters for each TF itself (autoregulatory) and the immediately upstream and downstream operons. MEME was used to find putative motifs within each separate class. Tests on the LacI and TetR TF families, using RegulonDB annotated sites, showed the sensitivity of prediction 86% and 80%, respectively.

AVAILABILITY

http://ural.wustl.edu/∼gsahota/HTHmotif/

摘要

动机

微生物基因组序列分析的计算技术变得越来越重要。随着下一代测序技术和人类微生物组计划的进行，目前的测序能力大大超过了感兴趣的生物体可以进行实验研究的速度。大多数相关的计算工作都集中在序列组装、基因注释和代谢网络重建上。我们开发了一种方法，主要利用现有的序列数据来确定原核转录因子（TF）的结合特异性。

结果

从 DNA-蛋白质复合物的晶体结构和具有相同关键残基的 TF 中确定了特异性决定残基（关键残基），并将具有相同关键残基的 TF 分为特异性类别。每个类别的确切结合区域被定义为每个 TF 自身的启动子集（自身调控）以及紧邻的上下游操纵子。MEME 用于在每个单独的类别中找到假定的基序。在使用 RegulonDB 注释的 LacI 和 TetR TF 家族上进行的测试中，预测的敏感性分别为 86%和 80%。

可用性

http://ural.wustl.edu/∼gsahota/HTHmotif/

相似文献

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

Bioinformatics. 2010 Nov 1;26(21):2672-7. doi: 10.1093/bioinformatics/btq501. Epub 2010 Aug 31.

BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.

Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.

Predicting the binding preference of transcription factors to individual DNA k-mers.

Bioinformatics. 2009 Apr 15;25(8):1012-8. doi: 10.1093/bioinformatics/btn645. Epub 2008 Dec 16.

Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.

PLoS Comput Biol. 2010 Nov 18;6(11):e1001007. doi: 10.1371/journal.pcbi.1001007.

Binding site graphs: a new graph theoretical framework for prediction of transcription factor binding sites.

PLoS Comput Biol. 2007 May;3(5):e90. doi: 10.1371/journal.pcbi.0030090. Epub 2007 Apr 10.

A generic approach to identify Transcription Factor-specific operator motifs; Inferences for LacI-family mediated regulation in Lactobacillus plantarum WCFS1.

BMC Genomics. 2008 Mar 27;9:145. doi: 10.1186/1471-2164-9-145.

Computational promoter analysis of mouse, rat and human antimicrobial peptide-coding genes.

BMC Bioinformatics. 2006 Dec 18;7 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2105-7-S5-S8.

Identification of context-dependent motifs by contrasting ChIP binding data.

Bioinformatics. 2010 Nov 15;26(22):2826-32. doi: 10.1093/bioinformatics/btq546. Epub 2010 Sep 23.

Computational approach towards promoter sequence comparison via TF mapping using a new distance measure.

Interdiscip Sci. 2011 Mar;3(1):43-9. doi: 10.1007/s12539-011-0057-x. Epub 2011 Mar 3.

引用本文的文献

Interfacial water confers transcription factors with dinucleotide specificity.

Nat Struct Mol Biol. 2025 Apr;32(4):650-661. doi: 10.1038/s41594-024-01449-6. Epub 2025 Jan 3.

MucR protein: Three decades of studies have led to the identification of a new H-NS-like protein.

Mol Microbiol. 2025 Feb;123(2):154-167. doi: 10.1111/mmi.15261. Epub 2024 Apr 15.

The PhoPQ Two-Component System Is the Major Regulator of Cell Surface Properties, Stress Responses and Plant-Derived Substrate Utilisation During Development of -Host Plant Pathosystems.

Front Microbiol. 2021 Jan 15;11:621391. doi: 10.3389/fmicb.2020.621391. eCollection 2020.

Identification of Position-Specific Correlations between DNA-Binding Domains and Their Binding Sites. Application to the MerR Family of Transcription Factors.

PLoS One. 2016 Sep 30;11(9):e0162681. doi: 10.1371/journal.pone.0162681. eCollection 2016.

Global transcriptional regulator TrmB family members in prokaryotes.

J Microbiol. 2016 Oct;54(10):639-45. doi: 10.1007/s12275-016-6362-7. Epub 2016 Sep 30.

Assessment of transfer methods for comparative genomics of regulatory networks in bacteria.

BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):277. doi: 10.1186/s12859-016-1113-7.

σ54-dependent regulome in Desulfovibrio vulgaris Hildenborough.

BMC Genomics. 2015 Nov 10;16:919. doi: 10.1186/s12864-015-2176-y.

An SOS Regulon under Control of a Noncanonical LexA-Binding Motif in the Betaproteobacteria.

J Bacteriol. 2015 Aug;197(16):2622-30. doi: 10.1128/JB.00035-15. Epub 2015 May 18.

Substrate-dependent activation of the Vibrio cholerae vexAB RND efflux system requires vexR.

PLoS One. 2015 Feb 19;10(2):e0117890. doi: 10.1371/journal.pone.0117890. eCollection 2015.

Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction.

PLoS One. 2014 Sep 25;9(9):e107863. doi: 10.1371/journal.pone.0107863. eCollection 2014.

本文引用的文献

A human gut microbial gene catalogue established by metagenomic sequencing.

Nature. 2010 Mar 4;464(7285):59-65. doi: 10.1038/nature08821.

A new generation of homology search tools based on probabilistic inference.

Genome Inform. 2009 Oct;23(1):205-11.

Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL): adapting the Partial Phylogenetic Profiling algorithm to scan sequences for signatures that predict protein function.

BMC Bioinformatics. 2010 Jan 26;11:52. doi: 10.1186/1471-2105-11-52.

The Pfam protein families database.

Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.

Comparison of DNA binding across protein superfamilies.

Proteins. 2010 Jan;78(1):52-62. doi: 10.1002/prot.22525.

Fast UniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data.

ISME J. 2010 Jan;4(1):17-27. doi: 10.1038/ismej.2009.97. Epub 2009 Aug 27.

A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes.

PLoS Comput Biol. 2009 Aug;5(8):e1000465. doi: 10.1371/journal.pcbi.1000465. Epub 2009 Aug 14.

An ORFome assembly approach to metagenomics sequences analysis.

J Bioinform Comput Biol. 2009 Jun;7(3):455-71. doi: 10.1142/s0219720009004151.

Genome assembly reborn: recent computational challenges.

Brief Bioinform. 2009 Jul;10(4):354-66. doi: 10.1093/bib/bbp026. Epub 2009 May 29.

Diversity of 23S rRNA genes within individual prokaryotic genomes.

PLoS One. 2009;4(5):e5437. doi: 10.1371/journal.pone.0005437. Epub 2009 May 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序列的新型方法用于鉴定原核基因组中的转录因子结合位点。

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

机构信息

Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63108, USA.

出版信息

Bioinformatics. 2010 Nov 1;26(21):2672-7. doi: 10.1093/bioinformatics/btq501. Epub 2010 Aug 31.

DOI:10.1093/bioinformatics/btq501

PMID:20807838

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2981494/

Abstract

MOTIVATION

RESULTS

AVAILABILITY

http://ural.wustl.edu/∼gsahota/HTHmotif/

摘要

动机

结果

可用性

http://ural.wustl.edu/∼gsahota/HTHmotif/

基于序列的新型方法用于鉴定原核基因组中的转录因子结合位点。

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于序列的新型方法用于鉴定原核基因组中的转录因子结合位点。

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性