Giuliani A, Benigni R, Sirabella P, Zbilut J P, Colosimo A
TCE Laboratory, Istituto Superiore di Sanitá, 00161 Roma, Italy.
Biophys J. 2000 Jan;78(1):136-49. doi: 10.1016/S0006-3495(00)76580-5.
Two computational methods widely used in time series analysis were applied to protein sequences, and their ability to derive structural information not directly accessible through classical sequence comparisons methods was assessed. The primary structures of 19 rubredoxins of both mesophilic and thermophilic bacteria, coded with hydrophobicity values of amino acid residues, were considered as time series and were analyzed by 1) recurrence quantification analysis and 2) spectral analysis of the sequence major eigenfunctions. The results of the two methods agreed to a large extent and generated a classification consistent with known 3D structural characteristics of the studied proteins. This classification separated in a clearcut manner a thermophilic protein from mesophilic proteins. The classification of primary structures given by the two dynamical methods was demonstrated to be basically different from classification stemming from classical sequence homology metrics. Moreover, on a more detailed scale, the method was able to discriminate between thermophilic and mesophilic proteins from a set of chimeric sequences generated from the mixing of a mesophilic (Rubr Clopa) and a thermophilic (Rubr Pyrfu) protein. Overall, our results point to a new way of looking at protein sequence comparisons.
两种广泛应用于时间序列分析的计算方法被应用于蛋白质序列,并评估了它们推导通过经典序列比较方法无法直接获得的结构信息的能力。将编码有氨基酸残基疏水性值的嗜温和嗜热细菌的19种红素氧还蛋白的一级结构视为时间序列,并通过以下方法进行分析:1)递归定量分析和2)序列主要本征函数的频谱分析。两种方法的结果在很大程度上一致,并产生了与所研究蛋白质已知三维结构特征一致的分类。这种分类以明确的方式将一种嗜热蛋白与嗜温蛋白区分开来。结果表明,两种动力学方法给出的一级结构分类与基于经典序列同源性指标的分类基本不同。此外,在更详细的尺度上,该方法能够从由嗜温蛋白(Rubr Clopa)和嗜热蛋白(Rubr Pyrfu)混合产生的一组嵌合序列中区分嗜热蛋白和嗜温蛋白。总体而言,我们的结果指出了一种看待蛋白质序列比较的新方法。