Key Laboratory of Bioinformatics, Ministry of Education, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
Sci Rep. 2017 Apr 19;7:46463. doi: 10.1038/srep46463.
Protein thermostability has received growing attention in recent years. Little is known about the determinants of thermal resistance in individual protein families. However, it is known that the mechanism is family-dependent and not identical for all proteins. We present a multivariate statistical analysis to find the determinants of thermostability in one protein family, the serine hydroxymethyltransferase family. Based on principal component analysis, we identified three amino acid fragments as the potential determinants of thermostability. The correlation coefficients between all the putative fragments and the protein thermostability were significant according to multivariable linear regression. Within the fragments, four critical amino acid positions were identified, and they indicated the contributions of Leu, Val, Lys, Asp, Glu, and Phe to thermostability. Moreover, we analyzed the insertions/deletions of amino acids in the sequence, which showed that thermophilic SHMTs tend to insert or delete residues in the C-terminal domain rather than the N-terminal domain. Our study provided a promising approach to perform a preliminary search for the determinants of thermophilic proteins. It could be extended to other protein families to explore their own strategies for adapting to high temperature.
近年来,人们对蛋白质的热稳定性越来越关注。虽然人们对个别蛋白质家族的热抗性决定因素知之甚少,但已知该机制因家族而异,并非适用于所有蛋白质。我们采用多元统计分析方法,寻找丝氨酸羟甲基转移酶家族这一蛋白质家族的热稳定性决定因素。基于主成分分析,我们确定了三个氨基酸片段作为潜在的热稳定性决定因素。根据多元线性回归,所有假定片段与蛋白质热稳定性之间的相关系数均具有统计学意义。在这些片段中,确定了四个关键氨基酸位置,表明亮氨酸、缬氨酸、赖氨酸、天冬氨酸、谷氨酸和苯丙氨酸对热稳定性有贡献。此外,我们分析了序列中氨基酸的插入/缺失,结果表明嗜热 SHMT 倾向于在 C 末端结构域而不是 N 末端结构域插入或删除残基。本研究为寻找耐热蛋白质的决定因素提供了一种很有前途的方法。它可以扩展到其他蛋白质家族,以探索它们适应高温的自身策略。