Kornhaber Gregory J, Snyder David, Moseley Hunter N B, Montelione Gaetano T
Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ 08854, USA.
J Biomol NMR. 2006 Apr;34(4):259-69. doi: 10.1007/s10858-006-0027-5.
Although a significant number of proteins include bound metals as part of their structure, the identification of amino acid residues coordinated to non-paramagnetic metals by NMR remains a challenge. Metal ligands can stabilize the native structure and/or play critical catalytic roles in the underlying biochemistry. An atom's chemical shift is exquisitely sensitive to its electronic environment. Chemical shift data can provide valuable insights into structural features, including metal ligation. In this study, we demonstrate that overlapped 13Cbeta chemical shift distributions of Zn-ligated and non-metal-ligated cysteine residues are largely resolved by the inclusion of the corresponding 13Calpha chemical shift information, together with secondary structural information. We demonstrate this with a bivariate distribution plot, and statistically with a multivariate analysis of variance (MANOVA) and hierarchical logistic regression analysis. Using 287 13Calpha/13Cbeta shift pairs from 79 proteins with known three-dimensional structures, including 86 13Calpha and 13Cbeta shifts for 43 Zn-ligated cysteine residues, along with corresponding oxidation state and secondary structure information, we have built a logistic regression model that distinguishes between oxidized cystines, reduced (non-metal ligated) cysteines, and Zn-ligated cysteines. Classifying cysteines/cystines with a statistical model incorporating all three phenomena resulted in a predictor of Zn ligation with a recall, precision and F-measure of 83.7%, and an accuracy of 95.1%. This model was applied in the analysis of Bacillus subtilis IscU, a protein involved in iron-sulfur cluster assembly. The model predicts that all three cysteines of IscU are metal ligands. We confirmed these results by (i) examining the effect of metal chelation on the NMR spectrum of IscU, and (ii) inductively coupled plasma mass spectrometry analysis. To gain further insight into the frequency of occurrence of non-cysteine Zn ligands, we analyzed the Protein Data Bank and found that 78% of the Zn ligands are histidine and cysteine (with nearly identical frequencies), and 18% are acidic residues aspartate and glutamate.
尽管大量蛋白质在其结构中包含结合金属,但通过核磁共振(NMR)鉴定与顺磁性金属配位的氨基酸残基仍然是一项挑战。金属配体可以稳定天然结构和/或在基础生物化学中发挥关键催化作用。原子的化学位移对其电子环境极为敏感。化学位移数据可以提供有关结构特征(包括金属配位)的有价值见解。在本研究中,我们证明,通过纳入相应的¹³Cα化学位移信息以及二级结构信息,锌配位和非金属配位的半胱氨酸残基重叠的¹³Cβ化学位移分布在很大程度上得以解析。我们通过双变量分布图进行了展示,并通过多变量方差分析(MANOVA)和分层逻辑回归分析进行了统计验证。利用来自79个已知三维结构蛋白质的287个¹³Cα/¹³Cβ位移对,包括43个锌配位半胱氨酸残基的86个¹³Cα和¹³Cβ位移,以及相应的氧化态和二级结构信息,我们建立了一个逻辑回归模型,用于区分氧化型胱氨酸、还原型(非金属配位)半胱氨酸和锌配位半胱氨酸。使用包含所有这三种现象的统计模型对半胱氨酸/胱氨酸进行分类,得到了一个锌配位预测器,其召回率、精确率和F值为83.7%,准确率为95.1%。该模型应用于枯草芽孢杆菌IscU(一种参与铁硫簇组装的蛋白质)的分析。该模型预测IscU的所有三个半胱氨酸都是金属配体。我们通过(i)检查金属螯合对IscU核磁共振谱的影响,以及(ii)电感耦合等离子体质谱分析证实了这些结果。为了进一步深入了解非半胱氨酸锌配体的出现频率,我们分析了蛋白质数据库,发现78%的锌配体是组氨酸和半胱氨酸(频率几乎相同),18%是酸性残基天冬氨酸和谷氨酸。