Yao Sen, Flight Robert M, Rouchka Eric C, Moseley Hunter N B
School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, Kentucky, 40292.
Department of Computer Engineering and Computer Science, University of Louisville, Louisville, Kentucky, 40292.
Proteins. 2015 Aug;83(8):1470-87. doi: 10.1002/prot.24834. Epub 2015 Jun 13.
Zinc metalloproteins are involved in many biological processes and play crucial biochemical roles across all domains of life. Local structure around the zinc ion, especially the coordination geometry (CG), is dictated by the protein sequence and is often directly related to the function of the protein. Current methodologies in characterizing zinc metalloproteins' CG consider only previously reported CG models based mainly on nonbiological chemical context. Exceptions to these canonical CG models are either misclassified or discarded as "outliers." Thus, we developed a less-biased method that directly handles potential exceptions without pre-assuming any CG model. Our study shows that numerous exceptions could actually be further classified and that new CG models are needed to characterize them. Also, these new CG models are cross-validated by strong correlation between independent structural and functional annotation distance metrics, which is partially lost if these new CGs models are ignored. Furthermore, these new CG models exhibit functional propensities distinct from the canonical CG models.
锌金属蛋白参与许多生物过程,并在生命的所有领域中发挥关键的生化作用。锌离子周围的局部结构,尤其是配位几何结构(CG),由蛋白质序列决定,并且通常与蛋白质的功能直接相关。目前表征锌金属蛋白CG的方法仅考虑先前报道的主要基于非生物化学背景的CG模型。这些典型CG模型的例外情况要么被错误分类,要么被作为“异常值”丢弃。因此,我们开发了一种偏差较小的方法,该方法无需预先假定任何CG模型即可直接处理潜在的例外情况。我们的研究表明,许多例外情况实际上可以进一步分类,并且需要新的CG模型来对它们进行表征。此外,这些新的CG模型通过独立的结构和功能注释距离度量之间的强相关性进行交叉验证,如果忽略这些新的CG模型,这种相关性会部分丧失。此外,这些新的CG模型表现出与典型CG模型不同的功能倾向。