通过功能保守性预测协同转录因子

Prediction of synergistic transcription factors by function conservation.

作者信息

Hu Zihua, Hu Boyu, Collins James F

机构信息

New York State Center of Excellence in Bioinformatics and Life Sciences, Department of Biostatistics, Department of Medicine, University at Buffalo, State University of New York (SUNY), Buffalo, NY 14260, USA.

出版信息

Genome Biol. 2007;8(12):R257. doi: 10.1186/gb-2007-8-12-r257.

DOI:10.1186/gb-2007-8-12-r257

PMID:18053230

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2246259/

Abstract

BACKGROUND

Previous methods employed for the identification of synergistic transcription factors (TFs) are based on either TF enrichment from co-regulated genes or phylogenetic footprinting. Despite the success of these methods, both have limitations.

RESULTS

We propose a new strategy to identify synergistic TFs by function conservation. Rather than aligning the regulatory sequences from orthologous genes and then identifying conserved TF binding sites (TFBSs) in the alignment, we developed computational approaches to implement the novel strategy. These methods include combinatorial TFBS enrichment utilizing distance constraints followed by enrichment of overlapping orthologous genes from human and mouse, whose regulatory sequences contain the enriched TFBS combinations. Subsequently, integration of function conservation from both TFBS and overlapping orthologous genes was achieved by correlation analyses. These techniques have been used for genome-wide promoter analyses, which have led to the identification of 51 homotypic TF combinations; the validity of these approaches has been exemplified by both known TF-TF interactions and function coherence analyses. We further provide computational evidence that our novel methods were able to identify synergistic TFs to a much greater extent than phylogenetic footprinting.

CONCLUSION

Function conservation based on the concordance of combinatorial TFBS enrichment along with enrichment of overlapping orthologous genes has been proven to be a successful means for the identification of synergistic TFs. This approach avoids the limitations of phylogenetic footprinting as it does not depend upon sequence alignment. It utilizes existing gene annotation data, such as those available in GO, thus providing an alternative method for functional TF discovery and annotation.

摘要

背景

先前用于识别协同转录因子（TFs）的方法要么基于从共调控基因中富集TF，要么基于系统发育足迹法。尽管这些方法取得了成功，但两者都有局限性。

结果

我们提出了一种通过功能保守性来识别协同TFs的新策略。我们不是对齐直系同源基因的调控序列，然后在比对中识别保守的TF结合位点（TFBSs），而是开发了计算方法来实施这一新策略。这些方法包括利用距离约束进行组合TFBS富集，随后从人和小鼠中富集重叠的直系同源基因，其调控序列包含富集的TFBS组合。随后，通过相关性分析实现了TFBS和重叠直系同源基因两者功能保守性的整合。这些技术已用于全基因组启动子分析，从而识别出51种同型TF组合；已知的TF-TF相互作用和功能一致性分析都例证了这些方法的有效性。我们进一步提供了计算证据，表明我们的新方法比系统发育足迹法能在更大程度上识别协同TFs。

结论

基于组合TFBS富集与重叠直系同源基因富集一致性的功能保守性已被证明是识别协同TFs的一种成功方法。这种方法避免了系统发育足迹法的局限性，因为它不依赖于序列比对。它利用了现有的基因注释数据，如GO中可用的数据，从而为功能性TF的发现和注释提供了一种替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5af6/2246259/bce52123f10c/gb-2007-8-12-r257-1.jpg

相似文献

Prediction of synergistic transcription factors by function conservation.

Genome Biol. 2007;8(12):R257. doi: 10.1186/gb-2007-8-12-r257.

Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model.

BMC Genomics. 2016 Aug 27;17(1):686. doi: 10.1186/s12864-016-3025-3.

Evaluating phylogenetic footprinting for human-rodent comparisons.

Bioinformatics. 2006 Feb 15;22(4):430-7. doi: 10.1093/bioinformatics/bti819. Epub 2005 Dec 6.

Integrating genomic data to predict transcription factor binding.

Genome Inform. 2005;16(1):83-94.

Incorporating evolution of transcription factor binding sites into annotated alignments.

J Biosci. 2007 Aug;32(5):841-50. doi: 10.1007/s12038-007-0084-2.

Discovering approximate-associated sequence patterns for protein-DNA interactions.

Bioinformatics. 2011 Feb 15;27(4):471-8. doi: 10.1093/bioinformatics/btq682. Epub 2010 Dec 30.

Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data.

BMC Bioinformatics. 2006 Apr 27;7:229. doi: 10.1186/1471-2105-7-229.

Identifying functional transcription factor binding sites in yeast by considering their positional preference in the promoters.

PLoS One. 2013 Dec 26;8(12):e83791. doi: 10.1371/journal.pone.0083791. eCollection 2013.

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.

PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.

Identifying cooperative transcription factors in yeast using multiple data sources.

BMC Syst Biol. 2014;8 Suppl 5(Suppl 5):S2. doi: 10.1186/1752-0509-8-S5-S2. Epub 2014 Dec 12.

引用本文的文献

BestCRM: An Exhaustive Search for Optimal Cis-Regulatory Modules in Promoters Accelerated by the Multidimensional Hash Function.

Int J Mol Sci. 2024 Feb 5;25(3):1903. doi: 10.3390/ijms25031903.

Removing Background Co-occurrences of Transcription Factor Binding Sites Greatly Improves the Prediction of Specific Transcription Factor Cooperations.

Front Genet. 2018 May 29;9:189. doi: 10.3389/fgene.2018.00189. eCollection 2018.

Investigating transcription factor synergism in humans.

DNA Res. 2018 Feb 1;25(1):103-112. doi: 10.1093/dnares/dsx041.

Combinatorial Cis-regulation in Saccharomyces Species.

G3 (Bethesda). 2016 Jan 15;6(3):653-67. doi: 10.1534/g3.115.024331.

PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information.

BMC Bioinformatics. 2015 Dec 1;16:400. doi: 10.1186/s12859-015-0827-2.

Identification of HMX1 target genes: a predictive promoter model approach.

Mol Vis. 2013 Aug 6;19:1779-94. eCollection 2013.

Searching for synergies: matrix algebraic approaches for efficient pair screening.

PLoS One. 2013 Jul 25;8(7):e68598. doi: 10.1371/journal.pone.0068598. Print 2013.

Simplified method to predict mutual interactions of human transcription factors based on their primary structure.

PLoS One. 2011;6(7):e21887. doi: 10.1371/journal.pone.0021887. Epub 2011 Jul 5.

Genome-wide identification of conserved regulatory function in diverged sequences.

Genome Res. 2011 Jul;21(7):1139-49. doi: 10.1101/gr.119016.110. Epub 2011 May 31.

Transcriptional regulation of the Menkes copper ATPase (Atp7a) gene by hypoxia-inducible factor (HIF2{alpha}) in intestinal epithelial cells.

Am J Physiol Cell Physiol. 2011 Jun;300(6):C1298-305. doi: 10.1152/ajpcell.00023.2011. Epub 2011 Feb 23.

本文引用的文献

Promoter analysis of intestinal genes induced during iron-deprivation reveals enrichment of conserved SP1-like binding sites.

BMC Genomics. 2007 Nov 15;8:420. doi: 10.1186/1471-2164-8-420.

Tissue-specific transcriptional regulation has diverged significantly between human and mouse.

Nat Genet. 2007 Jun;39(6):730-2. doi: 10.1038/ng2047. Epub 2007 May 21.

Zic2 and Zic3 synergistically control neurulation and segmentation of paraxial mesoderm in mouse embryo.

Dev Biol. 2007 Jun 15;306(2):669-84. doi: 10.1016/j.ydbio.2007.04.003. Epub 2007 Apr 12.

SP1 transcription factors in male germ cell development and differentiation.

Mol Cell Endocrinol. 2007 May 30;270(1-2):1-7. doi: 10.1016/j.mce.2007.03.001. Epub 2007 Mar 12.

Distinct and Overlapping Roles for E2F Family Members in Transcription, Proliferation and Apoptosis.

Curr Mol Med. 2006 Nov;6(7):739-48. doi: 10.2174/1566524010606070739.

Transcription factor TFIIIB and transcription by RNA polymerase III.

Biochem Soc Trans. 2006 Dec;34(Pt 6):1082-7. doi: 10.1042/BST0341082.

Computational analysis of tissue-specific combinatorial gene regulation: predicting interaction between transcription factors in human tissues.

Nucleic Acids Res. 2006;34(17):4925-36. doi: 10.1093/nar/gkl595. Epub 2006 Sep 18.

Identification of common transcriptional regulatory elements in interleukin-17 target genes.

J Biol Chem. 2006 Aug 25;281(34):24138-48. doi: 10.1074/jbc.M604597200. Epub 2006 Jun 23.

Regulation of the expression of human organic anion transporter 3 by hepatocyte nuclear factor 1alpha/beta and DNA methylation.

Mol Pharmacol. 2006 Sep;70(3):887-96. doi: 10.1124/mol.106.025494. Epub 2006 Jun 22.

Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome.

Genome Res. 2006 May;16(5):595-605. doi: 10.1101/gr.4887606. Epub 2006 Apr 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过功能保守性预测协同转录因子

Prediction of synergistic transcription factors by function conservation.

作者信息

Hu Zihua, Hu Boyu, Collins James F

机构信息