Pribić R
Faculty of Physics and Astronomy, Free University, Amsterdam, The Netherlands.
Anal Biochem. 1994 Nov 15;223(1):26-34. doi: 10.1006/abio.1994.1541.
Gaining information on the secondary structure of a protein from its spectra is presented as a calibration problem. The secondary structures known from X-ray studies and the spectra of 21 proteins are represented by a linear model. Fourier transform infrared (FTIR) spectra from 1700 to 1600 cm-1, circular dichroism (CD) spectra from 178 to 260 nm, and combined spectra are used; the secondary structure classes of interest are alpha-helices, antiparallel beta-sheets, parallel beta-sheets, beta-turns, and "other." The calibration is solved in two steps: (i) the dependencies between the structures and the spectra of reference proteins are found using the least-squares estimator, and (ii) the secondary structure of a protein is predicted from its spectra using the information gained in the first step and principal component analysis. The problem of information content of the reference spectra is analyzed using the linearly independent pieces of information, the so-called principal components, provided by singular value decomposition. Attention is paid to a number of the principal components sufficient for the prediction, which may be less than the total number. A relative estimable parameter is used to determine unambiguously the number of the components corresponding to the minimum mean square error of the predictor. The analysis gives the solutions to this linear calibration relevant to the underlying protein problem, thus reducing subjective assessments as well as computations.
从蛋白质光谱中获取其二级结构信息被视为一个校准问题。通过X射线研究已知的二级结构以及21种蛋白质的光谱由一个线性模型表示。使用了1700至1600 cm-1的傅里叶变换红外(FTIR)光谱、178至260 nm的圆二色性(CD)光谱以及组合光谱;感兴趣的二级结构类别包括α螺旋、反平行β折叠、平行β折叠、β转角和“其他”。校准分两步解决:(i)使用最小二乘估计器找出参考蛋白质的结构与光谱之间的相关性,(ii)利用第一步中获得的信息和主成分分析从蛋白质光谱预测其二级结构。使用奇异值分解提供的线性独立信息片段(即所谓的主成分)来分析参考光谱的信息含量问题。关注预测所需的主成分数量,其可能少于总数。使用一个相对可估计参数来明确确定对应于预测器最小均方误差的成分数量。该分析给出了与基础蛋白质问题相关的这种线性校准的解决方案,从而减少了主观评估以及计算量。