Schaeffer R Dustin, Kinch Lisa N, Liao Yuxing, Grishin Nick V
Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.
Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.
Protein Sci. 2016 Jul;25(7):1188-203. doi: 10.1002/pro.2893. Epub 2016 Feb 21.
Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain-like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade.
蛋白质及其结构域通过一系列通常包括小基序的复制和分化的事件而进化。结构域中短重复区域的存在通常给结构域分类及其层次结构带来难题。我们开发了蛋白质结构域进化分类法(ECOD),部分原因是为了实现对这类蛋白质进行分类的新方案。在此,我们记录了ECOD在其进化模式中对具有小内部重复序列、广泛分布的功能基序以及小结构域样片段组装体的蛋白质进行分类的方式。我们阐述了结构基因组学项目在过去十年中对新结构域和序列家族的分类与表征产生影响的方式。