Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics (C2B2), Columbia University, 1130 St Nicholas Ave., New York, NY 10032, USA.
Bioinformatics. 2010 Mar 1;26(5):625-31. doi: 10.1093/bioinformatics/btq012. Epub 2010 Jan 16.
The mutation of amino acids often impacts protein function and structure. Mutations without negative effect sustain evolutionary pressure. We study a particular aspect of structural robustness with respect to mutations: regular protein secondary structure and natively unstructured (intrinsically disordered) regions. Is the formation of regular secondary structure an intrinsic feature of amino acid sequences, or is it a feature that is lost upon mutation and is maintained by evolution against the odds? Similarly, is disorder an intrinsic sequence feature or is it difficult to maintain? To tackle these questions, we in silico mutated native protein sequences into random sequence-like ensembles and monitored the change in predicted secondary structure and disorder.
We established that by our coarse-grained measures for change, predictions and observations were similar, suggesting that our results were not biased by prediction mistakes. Changes in secondary structure and disorder predictions were linearly proportional to the change in sequence. Surprisingly, neither the content nor the length distribution for the predicted secondary structure changed substantially. Regions with long disorder behaved differently in that significantly fewer such regions were predicted after a few mutation steps. Our findings suggest that the formation of regular secondary structure is an intrinsic feature of random amino acid sequences, while the formation of long-disordered regions is not an intrinsic feature of proteins with disordered regions. Put differently, helices and strands appear to be maintained easily by evolution, whereas maintaining disordered regions appears difficult. Neutral mutations with respect to disorder are therefore very unlikely.
氨基酸的突变通常会影响蛋白质的功能和结构。没有负面影响的突变能承受进化压力。我们研究了结构稳健性的一个特定方面,即突变时的规则蛋白质二级结构和天然无结构(固有无序)区域。规则二级结构的形成是氨基酸序列的固有特征,还是在突变后丢失的特征,并且通过进化来维持?同样,无序是固有序列特征还是难以维持?为了解决这些问题,我们在计算机上将天然蛋白质序列突变为随机序列样集合,并监测预测二级结构和无序的变化。
我们确定,通过我们的粗粒度变化度量,预测和观察结果相似,这表明我们的结果不受预测错误的影响。二级结构和无序预测的变化与序列变化呈线性比例关系。令人惊讶的是,预测的二级结构的含量和长度分布都没有发生实质性变化。具有长无序的区域表现不同,在经过几个突变步骤后,预测到的这种区域数量显著减少。我们的发现表明,规则二级结构的形成是随机氨基酸序列的固有特征,而长无序区域的形成不是无序区域的蛋白质的固有特征。换句话说,螺旋和链似乎很容易被进化维持,而维持无序区域则很困难。因此,无序的中性突变是非常不可能的。