Chothia C
MRC Laboratory of Molecular Biology, Cambridge, UK.
Dev Suppl. 1994:27-33.
The evolution of development involves the development of new proteins. Estimates based on the initial results of the genome projects, and on the data banks of protein sequences and structures, suggest that the large majority of proteins come from no more than one thousand families. Members of a family are descended from a common ancestor. Protein families evolve by gene duplication and mutation. Mutations change the conformation of the peripheral regions of proteins; i.e. the regions that are involved, at least in part, in their function. If mutations proceed until only 20% of the residues in related proteins are identical, it is common for the conformational changes to affect half the structure. Most of the proteins involved in the interactions of cells, and in their assembly to form multicellular organisms, are mosaic proteins. These are large and have a modular structure, in that they are built of sets of homologous domains that are drawn from a relatively small number of protein families. Patthy's model for the evolution of mosaic proteins describes how they arose through the insertion of introns into genes, gene duplications and intronic recombination. The rates of progress in the genome sequencing projects, and in protein structure analyses, means that in a few years we will have a fairly complete outline description of the molecules responsible for the structure and function of organisms at several different levels of developmental complexity. This should make a major contribution to our understanding of the evolution of development.
发育的进化涉及新蛋白质的产生。基于基因组计划的初步结果以及蛋白质序列和结构数据库的估计表明,绝大多数蛋白质来自不超过一千个家族。一个家族的成员源自共同的祖先。蛋白质家族通过基因复制和突变而进化。突变会改变蛋白质外围区域的构象,即至少部分涉及其功能的区域。如果突变持续进行,直到相关蛋白质中只有20%的残基相同,那么构象变化通常会影响一半的结构。大多数参与细胞相互作用以及细胞组装形成多细胞生物的蛋白质都是镶嵌蛋白。这些蛋白质很大且具有模块化结构,因为它们由从相对较少的蛋白质家族中提取的同源结构域集合构成。帕蒂关于镶嵌蛋白进化的模型描述了它们是如何通过内含子插入基因、基因复制和内含子重组而产生的。基因组测序计划和蛋白质结构分析的进展速度意味着,再过几年,我们将对负责不同发育复杂程度生物体结构和功能的分子有一个相当完整的概述。这将对我们理解发育的进化做出重大贡献。