• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

转录程序:转录调控中高阶结构的建模

Transcriptional programs: modelling higher order structure in transcriptional control.

作者信息

Reid John E, Ott Sascha, Wernisch Lorenz

机构信息

MRC Biostatistics Unit, Institute of Public Health, University Forvie Site, Cambridge CB2 0SR, UK.

出版信息

BMC Bioinformatics. 2009 Jul 16;10:218. doi: 10.1186/1471-2105-10-218.

DOI:10.1186/1471-2105-10-218
PMID:19607663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2725141/
Abstract

BACKGROUND

Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from motifs is by taking cooperativity of transcription factor binding into account. We propose a non-parametric probabilistic model, similar to a document topic model, for detecting transcriptional programs, groups of cooperative transcription factors and co-regulated genes. The analysis results in transcriptional programs which generalise both transcriptional modules and TF-target gene incidence matrices and provide a higher-level summary of these structures. The method is independent of prior specification of training sets of genes, for example, via gene expression data. The analysis is based on known binding motifs.

RESULTS

We applied our method to putative regulatory regions of 18,445 Mus musculus genes. We discovered just 68 transcriptional programs that effectively summarised the action of 149 transcription factors on these genes. Several of these programs were significantly enriched for known biological processes and signalling pathways. One transcriptional program has a significant overlap with a reference set of cell cycle specific transcription factors.

CONCLUSION

Our method is able to pick out higher order structure from noisy sequence analyses. The transcriptional programs it identifies potentially represent common mechanisms of regulatory control across the genome. It simultaneously predicts which genes are co-regulated and which sets of transcription factors cooperate to achieve this co-regulation. The programs we discovered enable biologists to choose new genes and transcription factors to study in specific transcriptional regulatory systems.

摘要

背景

转录调控是真核生物调控控制的重要组成部分。即使转录因子的结合基序已知,通过扫描序列来寻找结合位点的任务也会受到假阳性的困扰。从基序中提高结合位点检测的一种方法是考虑转录因子结合的协同性。我们提出了一种非参数概率模型,类似于文档主题模型,用于检测转录程序、协同转录因子组和共调控基因。分析结果得到的转录程序概括了转录模块和转录因子-靶基因关联矩阵,并对这些结构提供了更高级别的总结。该方法独立于例如通过基因表达数据对基因训练集的先验指定。分析基于已知的结合基序。

结果

我们将我们的方法应用于18445个小家鼠基因的假定调控区域。我们发现仅68个转录程序就有效地概括了149个转录因子对这些基因的作用。其中几个程序在已知的生物学过程和信号通路中显著富集。一个转录程序与一组细胞周期特异性转录因子的参考集有显著重叠。

结论

我们的方法能够从嘈杂的序列分析中挑选出高阶结构。它识别出的转录程序可能代表了全基因组调控控制的共同机制。它同时预测哪些基因是共调控的,以及哪些转录因子组协同实现这种共调控。我们发现的程序使生物学家能够选择新的基因和转录因子,以便在特定的转录调控系统中进行研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/4bc815fb1de9/1471-2105-10-218-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/9606e6d94413/1471-2105-10-218-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/dd57b1f06e69/1471-2105-10-218-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/481435de8522/1471-2105-10-218-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/43afb7f8f11a/1471-2105-10-218-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/eff5fd2c4dfd/1471-2105-10-218-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/8c3eb7733743/1471-2105-10-218-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/4bc815fb1de9/1471-2105-10-218-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/9606e6d94413/1471-2105-10-218-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/dd57b1f06e69/1471-2105-10-218-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/481435de8522/1471-2105-10-218-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/43afb7f8f11a/1471-2105-10-218-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/eff5fd2c4dfd/1471-2105-10-218-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/8c3eb7733743/1471-2105-10-218-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31da/2725141/4bc815fb1de9/1471-2105-10-218-7.jpg

相似文献

1
Transcriptional programs: modelling higher order structure in transcriptional control.转录程序:转录调控中高阶结构的建模
BMC Bioinformatics. 2009 Jul 16;10:218. doi: 10.1186/1471-2105-10-218.
2
Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data.利用基因表达和启动子分析数据对人类启动子的转录调控元件进行全基因组预测。
BMC Bioinformatics. 2006 Jul 4;7:330. doi: 10.1186/1471-2105-7-330.
3
Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells.全基因组计算机模拟鉴定调控人类细胞周期的转录调节因子。
Genome Res. 2003 May;13(5):773-80. doi: 10.1101/gr.947203.
4
A discriminative model for identifying spatial cis-regulatory modules.一种用于识别空间顺式调控模块的判别模型。
J Comput Biol. 2005 Jul-Aug;12(6):822-34. doi: 10.1089/cmb.2005.12.822.
5
Ab initio identification of putative human transcription factor binding sites by comparative genomics.通过比较基因组学从头鉴定假定的人类转录因子结合位点
BMC Bioinformatics. 2005 May 2;6:110. doi: 10.1186/1471-2105-6-110.
6
Unraveling transcriptional regulatory programs by integrative analysis of microarray and transcription factor binding data.通过微阵列和转录因子结合数据的综合分析来揭示转录调控程序。
Bioinformatics. 2008 Sep 1;24(17):1874-80. doi: 10.1093/bioinformatics/btn332. Epub 2008 Jun 27.
7
Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model.用于生成具有生物学意义和准确的系统发育足迹分析的TF-DNA结合的分子和结构考量:以LysR型转录调节因子家族作为研究模型
BMC Genomics. 2016 Aug 27;17(1):686. doi: 10.1186/s12864-016-3025-3.
8
Predicting distinct organization of transcription factor binding sites on the promoter regions: a new genome-based approach to expand human embryonic stem cell regulatory network.预测启动子区域转录因子结合位点的不同组织:一种新的基于基因组的方法来扩展人类胚胎干细胞调控网络。
Gene. 2013 Dec 1;531(2):212-9. doi: 10.1016/j.gene.2013.09.011. Epub 2013 Sep 13.
9
Combining sequence and time series expression data to learn transcriptional modules.结合序列和时间序列表达数据以学习转录模块。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):194-202. doi: 10.1109/TCBB.2005.34.
10
PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information.PC-TraFF:使用点互信息识别潜在协作转录因子。
BMC Bioinformatics. 2015 Dec 1;16:400. doi: 10.1186/s12859-015-0827-2.

引用本文的文献

1
A novel nodal enhancer dependent on pluripotency factors and smad2/3 signaling conditions a regulatory switch during epiblast maturation.一种新型节点增强子依赖于多能性因子和 smad2/3 信号,在胚胎外胚层成熟过程中调节开关。
PLoS Biol. 2014 Jun 24;12(6):e1001890. doi: 10.1371/journal.pbio.1001890. eCollection 2014 Jun.
2
STEME: efficient EM to find motifs in large data sets.STEME:高效的 EM 算法,用于在大数据集中发现模式。
Nucleic Acids Res. 2011 Oct;39(18):e126. doi: 10.1093/nar/gkr574. Epub 2011 Jul 23.
3
Discovering transcriptional modules by Bayesian data integration.

本文引用的文献

1
TREMOR--a tool for retrieving transcriptional modules by incorporating motif covariance.TREMOR——一种通过整合基序协方差来检索转录模块的工具。
Nucleic Acids Res. 2007;35(21):7360-71. doi: 10.1093/nar/gkm885. Epub 2007 Oct 25.
2
Automated discovery of functional generality of human gene expression programs.人类基因表达程序功能通用性的自动化发现
PLoS Comput Biol. 2007 Aug;3(8):e148. doi: 10.1371/journal.pcbi.0030148. Epub 2007 Jun 13.
3
Bayesian hierarchical model for transcriptional module discovery by jointly modeling gene expression and ChIP-chip data.
基于贝叶斯数据整合的转录模块发现。
Bioinformatics. 2010 Jun 15;26(12):i158-67. doi: 10.1093/bioinformatics/btq210.
通过联合建模基因表达和芯片数据发现转录模块的贝叶斯层次模型。
BMC Bioinformatics. 2007 Aug 3;8:283. doi: 10.1186/1471-2105-8-283.
4
oPOSSUM: integrated tools for analysis of regulatory motif over-representation.oPOSSUM:用于分析调控基序过度表达的综合工具。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W245-52. doi: 10.1093/nar/gkm427. Epub 2007 Jun 18.
5
Clustering of genes into regulons using integrated modeling-COGRIM.使用综合建模-COGRIM将基因聚类成调控子。
Genome Biol. 2007;8(1):R4. doi: 10.1186/gb-2007-8-1-r4.
6
Ensembl 2007.Ensembl 2007。
Nucleic Acids Res. 2007 Jan;35(Database issue):D610-7. doi: 10.1093/nar/gkl996. Epub 2006 Dec 5.
7
Computational reconstruction of transcriptional regulatory modules of the yeast cell cycle.酵母细胞周期转录调控模块的计算重建
BMC Bioinformatics. 2006 Sep 29;7:421. doi: 10.1186/1471-2105-7-421.
8
Inferring transcriptional modules from ChIP-chip, motif and microarray data.从芯片免疫沉淀、基序和微阵列数据推断转录模块。
Genome Biol. 2006;7(5):R37. doi: 10.1186/gb-2006-7-5-r37. Epub 2006 May 5.
9
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure.通过去相关GO图结构从基因表达数据中改进功能组的评分。
Bioinformatics. 2006 Jul 1;22(13):1600-7. doi: 10.1093/bioinformatics/btl140. Epub 2006 Apr 10.
10
From genomics to chemical genomics: new developments in KEGG.从基因组学到化学基因组学:KEGG的新进展
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D354-7. doi: 10.1093/nar/gkj102.