Center for Bioinformatics Tübingen, Eberhard-Karls-Universität Tübingen, Tübingen, Germany.
PLoS Comput Biol. 2010 Jan 8;6(1):e1000636. doi: 10.1371/journal.pcbi.1000636.
An important aspect of the functional annotation of enzymes is not only the type of reaction catalysed by an enzyme, but also the substrate specificity, which can vary widely within the same family. In many cases, prediction of family membership and even substrate specificity is possible from enzyme sequence alone, using a nearest neighbour classification rule. However, the combination of structural information and sequence information can improve the interpretability and accuracy of predictive models. The method presented here, Active Site Classification (ASC), automatically extracts the residues lining the active site from one representative three-dimensional structure and the corresponding residues from sequences of other members of the family. From a set of representatives with known substrate specificity, a Support Vector Machine (SVM) can then learn a model of substrate specificity. Applied to a sequence of unknown specificity, the SVM can then predict the most likely substrate. The models can also be analysed to reveal the underlying structural reasons determining substrate specificities and thus yield valuable insights into mechanisms of enzyme specificity. We illustrate the high prediction accuracy achieved on two benchmark data sets and the structural insights gained from ASC by a detailed analysis of the family of decarboxylating dehydrogenases. The ASC web service is available at http://asc.informatik.uni-tuebingen.de/.
酶的功能注释的一个重要方面不仅是酶催化的反应类型,还有底物特异性,而在同一酶家族中,其底物特异性可能差异很大。在许多情况下,仅通过酶序列使用最近邻分类规则,就可以预测家族成员甚至底物特异性。然而,结构信息和序列信息的结合可以提高预测模型的可解释性和准确性。本文提出的方法,活性位点分类(Active Site Classification,ASC),可以自动从一个代表性的三维结构中提取活性位点的残基,并从家族中其他成员的序列中提取相应的残基。然后,从一组具有已知底物特异性的代表结构中,支持向量机(Support Vector Machine,SVM)可以学习底物特异性的模型。对于未知特异性的序列,可以使用 SVM 预测最可能的底物。这些模型还可以进行分析,揭示决定底物特异性的潜在结构原因,从而深入了解酶特异性的机制。我们通过对脱羧脱氢酶家族的详细分析,说明了在两个基准数据集上实现的高预测准确性和从 ASC 获得的结构见解。ASC 网络服务可在 http://asc.informatik.uni-tuebingen.de/ 获得。