Brenner S
MRC Molecular Genetics Unit, Cambridge, UK.
Nature. 1988 Aug 11;334(6182):528-30. doi: 10.1038/334528a0.
The advent of techniques for cloning and rapidly sequencing DNA has produced an explosive increase of sequence information for nucleic acids and their inferred proteins. Careful study of this large store of data might give us new insights into the relations between the linear sequences of genes and their functions embodied in the three-dimensional structure of proteins, and also illuminate the origin and evolution of the structural complexity of present-day proteins. Here I argue from such a study that the active site sequences of enzymes that have analogous essential serine residues lie in fact on two lines of descent from an ancient ancestral enzyme which had a cysteine instead of serine in its active site. This is based on the assumption that the two codon types which define the separate lines of descent and which have different bases in two positions could not interconvert by single mutations.
克隆和快速测序DNA技术的出现,使核酸及其推断出的蛋白质的序列信息呈爆炸式增长。仔细研究这一庞大的数据库,可能会让我们对基因的线性序列与其在蛋白质三维结构中所体现的功能之间的关系有新的认识,也有助于阐明当今蛋白质结构复杂性的起源和进化。在此,我通过这样一项研究认为,具有类似必需丝氨酸残基的酶的活性位点序列实际上源自一种古老的祖先酶的两条谱系,这种祖先酶的活性位点含有半胱氨酸而非丝氨酸。这是基于这样一种假设,即定义不同谱系且在两个位置具有不同碱基的两种密码子类型不会通过单个突变相互转换。