Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA.
Proc Natl Acad Sci U S A. 2012 Jan 24;109(4):1170-5. doi: 10.1073/pnas.1119684109. Epub 2012 Jan 11.
Protein structure and function are closely related, especially in functional surfaces, which are local spatial regions that perform the biological functions. Also, protein structures tend to evolve more slowly than amino acid sequences. We have therefore developed a method to classify proteins using the structures of functional surfaces; we call it protein surface classification (PSC). PSC may reflect functional relationships among proteins and may detect evolutionary relationships among highly divergent sequences. We focused on the surfaces of ligand-bound regions because they represent well-defined structures. Specifically, we used structural attributes to measure similarities between binding surfaces and constructed a PSC library of ~2,000 binding surface types from the bound forms. Using flavin mononucleotide-binding proteins and glycosidases as examples, we show how the evolutionary position of an uncharacterized protein can be defined and its function inferred from the characterized members of the same surface subtype. We found that proteins with the same enzyme nomenclature may be divided into subtypes and that two proteins in the same CATH (Class, Architecture, Topology, Homologous superfamily) fold may belong to two different surface types. In conclusion, our approach complements the sequence-based and fold-domain classifications and has the advantage of associating the shape of a protein with its biological function. As an expandable library, PSC provides a resource of spatial patterns for studying the evolution of protein structure and function.
蛋白质的结构和功能密切相关,特别是在功能表面,这是执行生物功能的局部空间区域。此外,蛋白质结构的进化速度往往比氨基酸序列慢。因此,我们开发了一种使用功能表面结构对蛋白质进行分类的方法;我们称之为蛋白质表面分类(PSC)。PSC 可能反映了蛋白质之间的功能关系,并可能检测到高度差异序列之间的进化关系。我们专注于配体结合区域的表面,因为它们代表了明确的结构。具体来说,我们使用结构属性来测量结合表面之间的相似性,并从结合形式构建了一个包含约 2000 种结合表面类型的 PSC 库。我们以黄素单核苷酸结合蛋白和糖苷酶为例,展示了如何从同一表面亚类的特征成员中定义未表征蛋白质的进化位置并推断其功能。我们发现,具有相同酶命名法的蛋白质可能被分为亚类,并且在同一 CATH(类、结构、拓扑、同源超家族)折叠中的两个蛋白质可能属于两个不同的表面类型。总之,我们的方法补充了基于序列和折叠域的分类,并且具有将蛋白质的形状与其生物学功能相关联的优势。作为一个可扩展的库,PSC 为研究蛋白质结构和功能的进化提供了空间模式资源。