ScerTF：酿酒酵母属物种基准化位置权重矩阵的综合数据库。

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.

机构信息

Department of Genetics, Washington University Medical School, St Louis, MO, USA.

出版信息

Nucleic Acids Res. 2012 Jan;40(Database issue):D162-8. doi: 10.1093/nar/gkr1180. Epub 2011 Dec 2.

DOI:10.1093/nar/gkr1180

PMID:22140105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3245033/

Abstract

Saccharomyces cerevisiae is a primary model for studies of transcriptional control, and the specificities of most yeast transcription factors (TFs) have been determined by multiple methods. However, it is unclear which position weight matrices (PWMs) are most useful; for the roughly 200 TFs in yeast, there are over 1200 PWMs in the literature. To address this issue, we created ScerTF, a comprehensive database of 1226 motifs from 11 different sources. We identified a single matrix for each TF that best predicts in vivo data by benchmarking matrices against chromatin immunoprecipitation and TF deletion experiments. We also used in vivo data to optimize thresholds for identifying regulatory sites with each matrix. To correct for biases from different methods, we developed a strategy to combine matrices. These aligned matrices outperform the best available matrix for several TFs. We used the matrices to predict co-occurring regulatory elements in the genome and identified many known TF combinations. In addition, we predict new combinations and provide evidence of combinatorial regulation from gene expression data. The database is available through a web interface at http://ural.wustl.edu/ScerTF. The site allows users to search the database with a regulatory site or matrix to identify the TFs most likely to bind the input sequence.

摘要

酿酒酵母是转录调控研究的主要模式生物，大多数酵母转录因子（TF）的特异性已通过多种方法确定。然而，目前尚不清楚哪些位置权重矩阵（PWMs）最有用；在酵母中大约有 200 个 TF，文献中就有超过 1200 个 PWM。为了解决这个问题，我们创建了 ScerTF，这是一个综合数据库，包含 11 个不同来源的 1226 个基序。我们通过将矩阵与染色质免疫沉淀和 TF 缺失实验进行基准测试，为每个 TF 确定了一个最佳预测体内数据的单一矩阵。我们还使用体内数据为每个矩阵优化了识别调控位点的阈值。为了纠正来自不同方法的偏差，我们开发了一种组合矩阵的策略。这些对齐的矩阵在几个 TF 上的表现优于现有最佳矩阵。我们使用这些矩阵来预测基因组中共同出现的调控元件，并鉴定了许多已知的 TF 组合。此外，我们还预测了新的组合，并从基因表达数据中提供了组合调控的证据。该数据库可通过网络界面 http://ural.wustl.edu/ScerTF 访问。该网站允许用户使用调控位点或矩阵搜索数据库，以识别最有可能结合输入序列的 TF。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8a6/3245033/cbd3e308755c/gkr1180f1.jpg

相似文献

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.

Nucleic Acids Res. 2012 Jan;40(Database issue):D162-8. doi: 10.1093/nar/gkr1180. Epub 2011 Dec 2.

YeTFaSCo: a database of evaluated yeast transcription factor sequence specificities.

Nucleic Acids Res. 2012 Jan;40(Database issue):D169-79. doi: 10.1093/nar/gkr993. Epub 2011 Nov 18.

MYBS: a comprehensive web server for mining transcription factor binding sites in yeast.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W221-6. doi: 10.1093/nar/gkm379. Epub 2007 May 30.

FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system.

Nucleic Acids Res. 2011 Jan;39(Database issue):D111-7. doi: 10.1093/nar/gkq858. Epub 2010 Nov 19.

YEASTRACT: providing a programmatic access to curated transcriptional regulatory associations in Saccharomyces cerevisiae through a web services interface.

Nucleic Acids Res. 2011 Jan;39(Database issue):D136-40. doi: 10.1093/nar/gkq964. Epub 2010 Oct 23.

The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae.

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D446-51. doi: 10.1093/nar/gkj013.

abc4pwm: affinity based clustering for position weight matrices in applications of DNA sequence analysis.

BMC Bioinformatics. 2022 Mar 3;23(1):83. doi: 10.1186/s12859-022-04615-z.

High-resolution DNA-binding specificity analysis of yeast transcription factors.

Genome Res. 2009 Apr;19(4):556-66. doi: 10.1101/gr.090233.108. Epub 2009 Jan 21.

Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices.

Nucleic Acids Res. 2012 Mar;40(5):e38. doi: 10.1093/nar/gkr1252. Epub 2011 Dec 19.

SwissRegulon, a database of genome-wide annotations of regulatory sites: recent updates.

Nucleic Acids Res. 2013 Jan;41(Database issue):D214-20. doi: 10.1093/nar/gks1145. Epub 2012 Nov 24.

引用本文的文献

Identification of DNA motif pairs on paired sequences based on composite heterogeneous graph.

Front Genet. 2024 Jun 17;15:1424085. doi: 10.3389/fgene.2024.1424085. eCollection 2024.

Investigating pioneer factor activity and its coordination with chromatin remodelers using integrated synthetic oligo assay.

STAR Protoc. 2023 Jun 7;4(2):102279. doi: 10.1016/j.xpro.2023.102279.

Differential Hsp90-dependent gene expression is strain-specific and common among yeast strains.

iScience. 2023 Apr 10;26(5):106635. doi: 10.1016/j.isci.2023.106635. eCollection 2023 May 19.

Zinc cluster transcription factors frequently activate target genes using a non-canonical half-site binding mode.

Nucleic Acids Res. 2023 Jun 9;51(10):5006-5021. doi: 10.1093/nar/gkad320.

Chance promoter activities illuminate the origins of eukaryotic intergenic transcriptions.

Nat Commun. 2023 Apr 1;14(1):1826. doi: 10.1038/s41467-023-37610-w.

Origin recognition complex harbors an intrinsic nucleosome remodeling activity.

Proc Natl Acad Sci U S A. 2022 Oct 18;119(42):e2211568119. doi: 10.1073/pnas.2211568119. Epub 2022 Oct 10.

TFLink: an integrated gateway to access transcription factor-target gene interactions for multiple species.

Database (Oxford). 2022 Sep 16;2022. doi: 10.1093/database/baac083.

Nucleosome-directed replication origin licensing independent of a consensus DNA sequence.

Nat Commun. 2022 Aug 23;13(1):4947. doi: 10.1038/s41467-022-32657-7.

Predicting which genes will respond to transcription factor perturbations.

G3 (Bethesda). 2022 Jul 29;12(8). doi: 10.1093/g3journal/jkac144.

-regulatory variants affect gene expression dynamics in yeast.

Elife. 2021 Aug 9;10:e68469. doi: 10.7554/eLife.68469.

本文引用的文献

Quantitative analysis demonstrates most transcription factors require only simple models of specificity.

Nat Biotechnol. 2011 Jun 7;29(6):480-3. doi: 10.1038/nbt.1893.

De novo identification and biophysical characterization of transcription-factor binding sites with microfluidic affinity analysis.

Nat Biotechnol. 2010 Sep;28(9):970-5. doi: 10.1038/nbt.1675. Epub 2010 Aug 29.

Comprehensive reanalysis of transcription factor knockout expression data in Saccharomyces cerevisiae reveals many new targets.

Nucleic Acids Res. 2010 Aug;38(14):4768-77. doi: 10.1093/nar/gkq232. Epub 2010 Apr 12.

Nucleic Acids Res. 2010 Jan;38(3):738-49. doi: 10.1093/nar/gkp989. Epub 2009 Nov 19.

The Gene Ontology in 2010: extensions and refinements.

Nucleic Acids Res. 2010 Jan;38(Database issue):D331-5. doi: 10.1093/nar/gkp1018. Epub 2009 Nov 17.

JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles.

Nucleic Acids Res. 2010 Jan;38(Database issue):D105-10. doi: 10.1093/nar/gkp950. Epub 2009 Nov 11.

Saccharomyces Genome Database provides mutant phenotype data.

Nucleic Acids Res. 2010 Jan;38(Database issue):D433-6. doi: 10.1093/nar/gkp917. Epub 2009 Nov 11.

Distinguishing direct versus indirect transcription factor-DNA interactions.

Genome Res. 2009 Nov;19(11):2090-100. doi: 10.1101/gr.094144.109. Epub 2009 Aug 3.

Diversity and complexity in DNA recognition by transcription factors.

Science. 2009 Jun 26;324(5935):1720-3. doi: 10.1126/science.1162327. Epub 2009 May 14.

Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors.

Nat Protoc. 2009;4(3):393-411. doi: 10.1038/nprot.2008.195.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

ScerTF：酿酒酵母属物种基准化位置权重矩阵的综合数据库。

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.

机构信息

Department of Genetics, Washington University Medical School, St Louis, MO, USA.

出版信息

Nucleic Acids Res. 2012 Jan;40(Database issue):D162-8. doi: 10.1093/nar/gkr1180. Epub 2011 Dec 2.

DOI:10.1093/nar/gkr1180

PMID:22140105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3245033/

Abstract

摘要

ScerTF：酿酒酵母属物种基准化位置权重矩阵的综合数据库。

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

ScerTF：酿酒酵母属物种基准化位置权重矩阵的综合数据库。

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献