Srouji John R, Xu Anting, Park Annsea, Kirsch Jack F, Brenner Steven E
Plant and Microbial Biology Department, University of California, Berkeley, California, 94720.
Molecular and Cell Biology Department, University of California, Berkeley, California, 94720.
Proteins. 2017 May;85(5):775-811. doi: 10.1002/prot.25223. Epub 2017 Mar 16.
The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch-specific adenine glycosylases (A/G-specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally-based annotations. We manually constructed a structure-guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 "select" Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775-811. © 2016 Wiley Periodicals, Inc.
Nudix同源家族包含来自生命三个域的80000多个蛋白质结构域,它们通过彼此间的同源性来定义。具有该家族结构域的蛋白质可分为四大功能类别:焦磷酸水解酶、异戊烯基二磷酸异构酶(IDI)、腺嘌呤/鸟嘌呤错配特异性腺嘌呤糖基化酶(A/G特异性腺嘌呤糖基化酶),以及诸如蛋白质/蛋白质相互作用和转录调控等非酶活性。最大的一类是焦磷酸水解酶,包含100多种不同的水解酶特异性。为了解这大量活性的进化过程,我们收集并分析了从文献中获取的205种Nudix蛋白的实验和结构数据。我们纠正了该家族基因本体注释数据库中描述的53个注释的错误功能或提供了更合适的描述,并提出了275个基于实验的新注释。我们手动构建了78种Nudix蛋白的结构引导序列比对。以该结构比对为种子,我们随后对从结构确定、功能表征或系统发育重要的Nudix结构域中挑选出的347个“精选”Nudix同源结构域进行了比对。基于我们对Nudix焦磷酸水解酶结构和特异性的综述,我们进一步分析了Nudix水解酶基序下游的一个环区域,该区域先前已显示与底物分子接触并拥有已知的功能基序。该环区域为水解酶家族内底物特异性的功能辐射和进化提供了潜在的结构基础。最后,对347个精选蛋白结构域和完整的Nudix同源家族进行的系统发育分析揭示了功能上的一般单系性以及一些可能的同塑性实例。《蛋白质》2017年;85卷:775 - 811页。©2016威利期刊公司