Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30303, USA.
Proteins. 2012 Dec;80(12):2666-79. doi: 10.1002/prot.24149. Epub 2012 Jul 31.
Identifying Ca(2+) -binding sites in proteins is the first step toward understanding the molecular basis of diseases related to Ca(2+) -binding proteins. Currently, these sites are identified in structures either through X-ray crystallography or NMR analysis. However, Ca(2+) -binding sites are not always visible in X-ray structures due to flexibility in the binding region or low occupancy in a Ca(2+) -binding site. Similarly, both Ca(2+) and its ligand oxygens are not directly observed in NMR structures. To improve our ability to predict Ca(2+) -binding sites in both X-ray and NMR structures, we report a new graph theory algorithm (MUG(C) ) to predict Ca(2+) -binding sites. Using carbon atoms covalently bonded to the chelating oxygen atoms, and without explicit reference to side-chain oxygen ligand co-ordinates, MUG(C) is able to achieve 94% sensitivity with 76% selectivity on a dataset of X-ray structures composed of 43 Ca(2+) -binding proteins. Additionally, prediction of Ca(2+) -binding sites in NMR structures was obtained by MUG(C) using a different set of parameters, which were determined by the analysis of both Ca(2+) -constrained and unconstrained Ca(2+) -loaded structures derived from NMR data. MUG(C) identified 20 of 21 Ca(2+) -binding sites in NMR structures inferred without the use of Ca(2+) constraints. MUG(C) predictions are also highly selective for Ca(2+) -binding sites as analyses of binding sites for Mg(2+) , Zn(2+) , and Pb(2+) were not identified as Ca(2+) -binding sites. These results indicate that the geometric arrangement of the second-shell carbon cluster is sufficient not only for accurate identification of Ca(2+) -binding sites in NMR and X-ray structures but also for selective differentiation between Ca(2+) and other relevant divalent cations.
鉴定蛋白质中的 Ca(2+)结合位点是理解与 Ca(2+)结合蛋白相关疾病的分子基础的第一步。目前,这些位点是通过 X 射线晶体学或 NMR 分析在结构中确定的。然而,由于结合区域的灵活性或 Ca(2+)结合位点的低占有率,Ca(2+)结合位点并不总是在 X 射线结构中可见。同样,在 NMR 结构中也不能直接观察到 Ca(2+)和其配体氧。为了提高我们在 X 射线和 NMR 结构中预测 Ca(2+)结合位点的能力,我们报告了一种新的图论算法(MUG(C))来预测 Ca(2+)结合位点。MUG(C)使用与螯合氧原子共价键合的碳原子,并且不明确引用侧链氧配体坐标,在由 43 个 Ca(2+)结合蛋白组成的 X 射线结构数据集上实现了 94%的灵敏度和 76%的选择性。此外,通过使用不同的参数集,MUG(C)可以预测 NMR 结构中的 Ca(2+)结合位点,这些参数集是通过对源自 NMR 数据的 Ca(2+)约束和未约束 Ca(2+)加载结构的分析确定的。MUG(C)在没有使用 Ca(2+)约束的情况下,从 NMR 结构中推断出 21 个 Ca(2+)结合位点中的 20 个。MUG(C)的预测对 Ca(2+)结合位点也具有高度的选择性,因为对 Mg(2+)、Zn(2+)和 Pb(2+)结合位点的分析并未被识别为 Ca(2+)结合位点。这些结果表明,第二壳层碳原子簇的几何排列不仅足以准确识别 NMR 和 X 射线结构中的 Ca(2+)结合位点,而且足以区分 Ca(2+)和其他相关二价阳离子。