Atkinson Holly J, Babbitt Patricia C
Graduate Program in Biological and Medical Informatics, University of California, San Francisco, California, United States of America.
PLoS Comput Biol. 2009 Oct;5(10):e1000541. doi: 10.1371/journal.pcbi.1000541. Epub 2009 Oct 23.
The group of proteins that contain a thioredoxin (Trx) fold is huge and diverse. Assessment of the variation in catalytic machinery of Trx fold proteins is essential in providing a foundation for understanding their functional diversity and predicting the function of the many uncharacterized members of the class. The proteins of the Trx fold class retain common features-including variations on a dithiol CxxC active site motif-that lead to delivery of function. We use protein similarity networks to guide an analysis of how structural and sequence motifs track with catalytic function and taxonomic categories for 4,082 representative sequences spanning the known superfamilies of the Trx fold. Domain structure in the fold class is varied and modular, with 2.8% of sequences containing more than one Trx fold domain. Most member proteins are bacterial. The fold class exhibits many modifications to the CxxC active site motif-only 56.8% of proteins have both cysteines, and no functional groupings have absolute conservation of the expected catalytic motif. Only a small fraction of Trx fold sequences have been functionally characterized. This work provides a global view of the complex distribution of domains and catalytic machinery throughout the fold class, showing that each superfamily contains remnants of the CxxC active site. The unifying context provided by this work can guide the comparison of members of different Trx fold superfamilies to gain insight about their structure-function relationships, illustrated here with the thioredoxins and peroxiredoxins.
包含硫氧还蛋白(Trx)折叠结构的蛋白质种类繁多且各不相同。评估Trx折叠蛋白催化机制的变化对于理解其功能多样性以及预测该类别中许多未表征成员的功能至关重要。Trx折叠类别的蛋白质保留了共同特征,包括二硫醇CxxC活性位点基序的变体,这些变体导致了功能的发挥。我们使用蛋白质相似性网络来指导对4082个跨越Trx折叠已知超家族的代表性序列的结构和序列基序如何与催化功能及分类类别相关联的分析。该折叠类别的结构域结构多样且模块化其中2.8%的序列包含多个Trx折叠结构域。大多数成员蛋白是细菌来源的。该折叠类别对CxxC活性位点基序有许多修饰,只有56.8% 的蛋白质含有两个半胱氨酸,并且没有功能分组对预期的催化基序具有绝对保守性。只有一小部分Trx折叠序列具有功能特征。这项工作提供了整个折叠类别中结构域和催化机制复杂分布的全局视图,表明每个超家族都包含CxxC活性位点的残余部分。这项工作提供的统一背景可以指导对不同Trx折叠超家族成员的比较,以深入了解它们的结构-功能关系,本文以硫氧还蛋白和过氧化物氧还蛋白为例进行了说明。