Suppr超能文献

大肠杆菌的蛋白质大小为14 kDa的倍数:结构域概念及进化意义

Proteins of Escherichia coli come in sizes that are multiples of 14 kDa: domain concepts and evolutionary implications.

作者信息

Savageau M A

出版信息

Proc Natl Acad Sci U S A. 1986 Mar;83(5):1198-202. doi: 10.1073/pnas.83.5.1198.

Abstract

Initial attempts to correlate the distribution of gene density (number of gene loci per unit length on the linkage map) with the distribution of lengths of coding sequences have led to the observation that 46% of approximately 1000 sampled proteins in Escherichia coli have molecular masses of n X 14,000 +/- 2500 daltons (n = 1, 2, ...). This clustering around multiples of 14,000 contrasts with the 36% one would expect in these ranges if the sizes were uniformly distributed. The entire distribution is well fit by a sum of normal or lognormal distributions located at multiples of 14,000, which suggests that the percentage of E. coli proteins governed by the underlying sizing mechanism is much greater than 50%. Clustering of protein molecular sizes around multiples of a unit size also is suggested by the distribution of well-characterized HeLa cell proteins. The distribution of gene lengths for E. coli suggests regular clustering, which implies that the clustering of protein molecular masses is not an artifact of the molecular mass measurement by gel electrophoresis. These observations suggest the existence of a fundamental structural unit. The rather uniform size of this structural unit (without any apparent sequence homology) suggests that a general principle such as geometrical or physical optimization at the DNA or protein level is responsible. This suggestion is discussed in relation to experimental evidence for the domain structure of proteins and to existing hypotheses that attempt to account for these domains. Microevolution would appear to be accommodated by incremental changes within this fundamental unit, whereas macroevolution would appear to involve "quantum" changes to the next stable size of protein.

摘要

最初尝试将基因密度(连锁图谱上每单位长度的基因座数量)分布与编码序列长度分布关联起来,结果发现大肠杆菌中约1000个抽样蛋白质里有46%的分子量为n×14,000±2500道尔顿(n = 1、2……)。这种围绕14,000倍数的聚类现象,与如果大小均匀分布时在这些范围内预期的36%形成对比。整个分布可以很好地由位于14,000倍数处的正态分布或对数正态分布之和拟合,这表明受潜在大小确定机制控制的大肠杆菌蛋白质百分比远大于50%。特征明确的海拉细胞蛋白质的分布也表明蛋白质分子大小围绕单位大小的倍数聚类。大肠杆菌基因长度的分布表明存在规则聚类,这意味着蛋白质分子量的聚类不是凝胶电泳测量分子量的人为产物。这些观察结果表明存在一个基本结构单元。这个结构单元相当均匀的大小(没有任何明显的序列同源性)表明,诸如DNA或蛋白质水平的几何或物理优化等一般原则是其原因。结合蛋白质结构域结构的实验证据以及试图解释这些结构域的现有假说,对这一观点进行了讨论。微观进化似乎可以通过这个基本单元内的增量变化来实现,而宏观进化似乎涉及到蛋白质下一个稳定大小的“量子”变化。

相似文献

7
Sequences of the E. coli uvrC gene and protein.大肠杆菌uvrC基因及蛋白质序列。
Nucleic Acids Res. 1984 Jun 11;12(11):4593-608. doi: 10.1093/nar/12.11.4593.

引用本文的文献

6
Underlying order in protein sequence organization.蛋白质序列组织中的潜在秩序。
Proc Natl Acad Sci U S A. 1994 Apr 26;91(9):4044-7. doi: 10.1073/pnas.91.9.4044.
8
Periodic recurrence of methionines: fossil of gene fusion?甲硫氨酸的周期性重复:基因融合的化石?
Proc Natl Acad Sci U S A. 1995 Jan 17;92(2):557-60. doi: 10.1073/pnas.92.2.557.

本文引用的文献

1
The evolutionary origins of the immunoglobulins.免疫球蛋白的进化起源。
Proc Natl Acad Sci U S A. 1966 Dec;56(6):1762-9. doi: 10.1073/pnas.56.6.1762.
3
The anatomy and taxonomy of protein structure.蛋白质结构的解剖学与分类学。
Adv Protein Chem. 1981;34:167-339. doi: 10.1016/s0065-3233(08)60520-3.
5
Structure of chromatin and the linking number of DNA.染色质结构与DNA的连环数
Proc Natl Acad Sci U S A. 1981 Mar;78(3):1461-5. doi: 10.1073/pnas.78.3.1461.
7
Exons--present from the beginning?外显子——从一开始就存在吗?
Nature. 1983;306(5943):535-7. doi: 10.1038/306535a0.
10
Immunoglobulin genes.免疫球蛋白基因
Annu Rev Immunol. 1983;1:499-528. doi: 10.1146/annurev.iy.01.040183.002435.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验