Thornton Joe
Columbia University Earth Institute, Columbia University, #2430, New York, NY 10027, USA.
EXS. 2002(92):191-207. doi: 10.1007/978-3-0348-8114-2_14.
How have proteins taken on the remarkable diversity of biochemical and physiological functions necessary to create and maintain complex organisms? The majority of proteins are organized hierarchically into families and superfamilies, reflecting an ancient and continuing process of gene duplication and divergence. The techniques of molecular phylogenetics, developed to recover the nested hierarchy of taxa from character information in their gene sequences, can also reconstruct the evolutionary relationships among genes and provide a conceptual foundation for comparative evolutionary analysis of proteins and their functions. In this review, I outline the application of phylogenetic approaches to issues in gene family studies, beginning with the inference of phylogeny and the assessment of the two types of homology by which genes in a family can be related: orthology (common descent from a cladogenetic event) and paralogy (common descent from a gene duplication event). I show how the phylogenetic approach makes possible novel kinds of comparative analysis, including detection of exon shuffling, reconstruction of the evolutionary diversification of gene families, tracing of evolutionary change in protein function at the amino acid level, and prediction of structure-function relationships. A marriage of the principles of phylogenetic systematics with the copious sequence data being generated by molecular biology and genomics promises unprecedented insights into the nature of biological organization and the historical processes that created it.
蛋白质是如何具备创造和维持复杂生物体所需的显著多样的生化及生理功能的呢?大多数蛋白质会分层组织成家族和超家族,这反映了一个古老且持续的基因复制和分化过程。分子系统发育学技术旨在从基因序列中的特征信息恢复分类单元的嵌套层次结构,它同样能够重建基因之间的进化关系,并为蛋白质及其功能的比较进化分析提供概念基础。在这篇综述中,我概述了系统发育方法在基因家族研究问题中的应用,首先是系统发育的推断以及对家族中基因可能存在关联的两种同源性的评估:直系同源(来自分支发生事件的共同祖先)和平行同源(来自基因复制事件的共同祖先)。我展示了系统发育方法如何使新型的比较分析成为可能,包括外显子重排的检测、基因家族进化多样化的重建、氨基酸水平上蛋白质功能进化变化的追踪以及结构 - 功能关系的预测。系统发育分类学原理与分子生物学和基因组学产生的大量序列数据相结合,有望为生物组织的本质以及创造它的历史过程带来前所未有的深刻见解。