Louis-Jeune Caroline, Andrade-Navarro Miguel A, Perez-Iratxeta Carol
Sprott Center for Stem Cell Research, Ottawa Hospital Research Institute, Ottawa, Ontario K1H 8L6, Canada.
Proteins. 2012 Feb;80(2):374-81. doi: 10.1002/prot.23188. Epub 2011 Nov 17.
Circular dichroism (CD) is a spectroscopic technique commonly used to investigate the structure of proteins. Major secondary structure types, alpha-helices and beta-strands, produce distinctive CD spectra. Thus, by comparing the CD spectrum of a protein of interest to a reference set consisting of CD spectra of proteins of known structure, predictive methods can estimate the secondary structure of the protein. Currently available methods, including K2D2, use such experimental CD reference sets, which are very small in size when compared to the number of tertiary structures available in the Protein Data Bank (PDB). Conversely, given a PDB structure, it is possible to predict a theoretical CD spectrum from it. The methodological framework for this calculation was established long ago but only recently a convenient implementation called DichroCalc has been developed. In this study, we set to determine whether theoretically derived spectra could be used as reference set for accurate CD based predictions of secondary structure. We used DichroCalc to calculate the theoretical CD spectra of a nonredundant set of structures representing most proteins in the PDB, and applied a straightforward approach for predicting protein secondary structure content using these theoretical CD spectra as reference set. We show that this method improves the predictions, particularly for the wavelength interval between 200 and 240 nm and for beta-strand content. We have implemented this method, called K2D3, in a publicly accessible web server at http://www. ogic.ca/projects/k2d3.
圆二色性(CD)是一种常用于研究蛋白质结构的光谱技术。主要的二级结构类型,即α-螺旋和β-链,会产生独特的CD光谱。因此,通过将感兴趣蛋白质的CD光谱与由已知结构蛋白质的CD光谱组成的参考集进行比较,预测方法可以估计该蛋白质的二级结构。目前可用的方法,包括K2D2,使用这样的实验CD参考集,与蛋白质数据库(PDB)中可用的三级结构数量相比,其规模非常小。相反,给定一个PDB结构,就有可能从中预测理论CD光谱。这种计算的方法框架早就建立起来了,但直到最近才开发出一种名为DichroCalc的便捷实现方法。在本研究中,我们着手确定理论推导的光谱是否可以用作基于CD的二级结构准确预测的参考集。我们使用DichroCalc计算了一组代表PDB中大多数蛋白质的非冗余结构的理论CD光谱,并应用一种直接的方法,使用这些理论CD光谱作为参考集来预测蛋白质二级结构含量。我们表明,这种方法改进了预测,特别是对于200至240纳米的波长区间和β-链含量。我们已经在http://www.ogic.ca/projects/k2d3上的一个可公开访问的网络服务器中实现了这种称为K2D3的方法。