Suppr超能文献

用于DNA序列的严格模式识别方法。大肠杆菌启动子序列分析。

Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli.

作者信息

Galas D J, Eggert M, Waterman M S

出版信息

J Mol Biol. 1985 Nov 5;186(1):117-28. doi: 10.1016/0022-2836(85)90262-1.

Abstract

The basic nature of the sequence features that define a promoter sequence for Escherichia coli RNA polymerase have been established by a variety of biochemical and genetic methods. We have developed rigorous analytical methods for finding unknown patterns that occur imperfectly in a set of several sequences, and have used them to examine a set of bacterial promoters. The algorithm easily discovers the "consensus" sequences for the -10 and -35 regions, which are essentially identical to the results of previous analyses, but requires no prior assumptions about the common patterns. By explicitly specifying the nature of the search for consensus sequences, we give a rigorous definition to this concept that should be widely applicable. We also have provided estimates for the statistical significance of common patterns discovered in sets of sequences. In addition to providing a rigorous basis for defining known consensus regions, we have found additional features in these promoters that may have functional significance. These added features were located on either side of the -35 region. The pattern 5', or upstream, from the -35 region was found using the standard alphabet (A, G, C and T), but the pattern between the -10 and the -35 regions was detectable only in a sub-alphabet. Recent results relating DNA sequence to helix conformation suggest that the former (upstream) pattern may have a functional significance. Possible roles in promoter function are discussed in this light, and an observation of altered promoter function involving the upstream region is reported that appears to support the suggestion of function in at least one case.

摘要

通过多种生化和遗传学方法,已确定了定义大肠杆菌RNA聚合酶启动子序列的序列特征的基本性质。我们开发了严格的分析方法,用于寻找在一组多个序列中不完全出现的未知模式,并将其用于研究一组细菌启动子。该算法能够轻松发现-10区和-35区的“共有”序列,这些序列与先前分析的结果基本相同,但无需对常见模式进行先验假设。通过明确指定寻找共有序列的性质,我们对这一概念给出了一个应能广泛适用的严格定义。我们还对在序列集中发现的常见模式的统计显著性进行了估计。除了为定义已知的共有区域提供严格的基础外,我们还在这些启动子中发现了可能具有功能意义的其他特征。这些额外的特征位于-35区的两侧。-35区5'端或上游的模式是使用标准字母表(A、G、C和T)发现的,但-10区和-35区之间的模式仅在一个子字母表中可检测到。最近将DNA序列与螺旋构象相关联的结果表明,前者(上游)模式可能具有功能意义。据此讨论了其在启动子功能中的可能作用,并报告了一项涉及上游区域的启动子功能改变的观察结果,这似乎至少在一个案例中支持了其功能的推测。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验