Suppr超能文献

核苷酸序列中由位置特异性评分矩阵表示的基序簇的统计学意义。

Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences.

作者信息

Frith Martin C, Spouge John L, Hansen Ulla, Weng Zhiping

机构信息

Bioinformatics Program, Boston University, 44 Cummington Street, Boston MA 02215, USA.

出版信息

Nucleic Acids Res. 2002 Jul 15;30(14):3214-24. doi: 10.1093/nar/gkf438.

Abstract

The human genome encodes the transcriptional control of its genes in clusters of cis-elements that constitute enhancers, silencers and promoter signals. The sequence motifs of individual cis- elements are usually too short and degenerate for confident detection. In most cases, the requirements for organization of cis-elements within these clusters are poorly understood. Therefore, we have developed a general method to detect local concentrations of cis-element motifs, using predetermined matrix representations of the cis-elements, and calculate the statistical significance of these motif clusters. The statistical significance calculation is highly accurate not only for idealized, pseudorandom DNA, but also for real human DNA. We use our method 'cluster of motifs E-value tool' (COMET) to make novel predictions concerning the regulation of genes by transcription factors associated with muscle. COMET performs comparably with two alternative state-of-the-art techniques, which are more complex and lack E-value calculations. Our statistical method enables us to clarify the major bottleneck in the hard problem of detecting cis-regulatory regions, which is that many known enhancers do not contain very significant clusters of the motif types that we search for. Thus, discovery of additional signals that belong to these regulatory regions will be the key to future progress.

摘要

人类基因组通过构成增强子、沉默子和启动子信号的顺式元件簇对其基因进行转录控制。单个顺式元件的序列基序通常太短且具有简并性,难以可靠检测。在大多数情况下,对这些簇内顺式元件组织的要求了解甚少。因此,我们开发了一种通用方法,利用顺式元件的预定矩阵表示来检测顺式元件基序的局部浓度,并计算这些基序簇的统计显著性。这种统计显著性计算不仅对理想化的伪随机DNA高度准确,对真实的人类DNA也同样如此。我们使用我们的方法“基序簇E值工具”(COMET)对与肌肉相关的转录因子对基因的调控做出新的预测。COMET与另外两种更复杂且缺乏E值计算的先进技术表现相当。我们的统计方法使我们能够阐明检测顺式调控区域这一难题中的主要瓶颈,即许多已知的增强子并不包含我们所寻找的基序类型的非常显著的簇。因此,发现属于这些调控区域的其他信号将是未来进展的关键。

相似文献

4
The limits of de novo DNA motif discovery.从头开始的 DNA 基序发现的局限性。
PLoS One. 2012;7(11):e47836. doi: 10.1371/journal.pone.0047836. Epub 2012 Nov 7.
5
Regulatory motif discovery using a population clustering evolutionary algorithm.使用群体聚类进化算法进行调控基序发现。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):403-414. doi: 10.1109/tcbb.2007.1044.

引用本文的文献

6
7
Identification and computational analysis of gene regulatory elements.基因调控元件的鉴定与计算分析
Cold Spring Harb Protoc. 2015 Jan 5;2015(1):pdb.top083642. doi: 10.1101/pdb.top083642.

本文引用的文献

6
Detection of cis-element clusters in higher eukaryotic DNA.高等真核生物DNA中顺式元件簇的检测
Bioinformatics. 2001 Oct;17(10):878-89. doi: 10.1093/bioinformatics/17.10.878.
9
Nuclear hormone receptors and gene expression.核激素受体与基因表达。
Physiol Rev. 2001 Jul;81(3):1269-304. doi: 10.1152/physrev.2001.81.3.1269.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验