College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, China.
BMC Bioinformatics. 2020 Feb 17;21(1):57. doi: 10.1186/s12859-020-3398-9.
The maintenance of protein structural stability requires the cooperativity among spatially neighboring residues. Previous studies have shown that conserved residues tend to occur clustered together within enzyme active sites and protein-protein/DNA interfaces. It is possible that conserved residues form one or more local clusters in protein tertiary structures as it can facilitate the formation of functional motifs. In this work, we systematically investigate the spatial distributions of conserved residues as well as hot spot ones within protein-RNA interfaces.
The analysis of 191 polypeptide chains from 160 complexes shows the polypeptides interacting with tRNAs evolve relatively rapidly. A statistical analysis of residues in different regions shows that the interface residues are often more conserved, while the most conserved ones are those occurring at protein interiors which maintain the stability of folded polypeptide chains. Additionally, we found that 77.8% of the interfaces have the conserved residues clustered within the entire interface regions. Appling the clustering characteristics to the identification of the real interface, there are 31.1% of cases where the real interfaces are ranked in top 10% of 1000 randomly generated surface patches. In the conserved clusters, the preferred residues are the hydrophobic (Leu, Ile, Met), aromatic (Tyr, Phe, Trp) and interestingly only one positively charged Arg residues. For the hot spot residues, 51.5% of them are situated in the conserved residue clusters, and they are largely consistent with the preferred residue types in the conserved clusters.
The protein-RNA interface residues are often more conserved than non-interface surface ones. The conserved interface residues occur more spatially clustered relative to the entire interface residues. The high consistence of hot spot residue types and the preferred residue types in the conserved clusters has important implications for the experimental alanine scanning mutagenesis study. This work deepens the understanding of the residual organization at protein-RNA interface and is of potential applications in the identification of binding site and hot spot residues.
蛋白质结构稳定性的维持需要空间上相邻残基的协同作用。先前的研究表明,保守残基倾向于在酶活性位点和蛋白质-蛋白质/DNA 界面内聚集在一起。在蛋白质三级结构中,保守残基可能形成一个或多个局部簇,因为这有助于形成功能基序。在这项工作中,我们系统地研究了蛋白质-RNA 界面中保守残基和热点残基的空间分布。
对来自 160 个复合物的 191 条多肽链的分析表明,与 tRNA 相互作用的多肽进化得相对较快。对不同区域残基的统计分析表明,界面残基通常更保守,而最保守的残基位于蛋白质内部,维持折叠多肽链的稳定性。此外,我们发现 77.8%的界面具有保守残基在整个界面区域内聚集。将聚类特征应用于真实界面的识别,在 1000 个随机生成的表面斑块中,有 31.1%的情况下,真实界面排在前 10%。在保守簇中,首选残基是疏水性(亮氨酸、异亮氨酸、甲硫氨酸)、芳香族(酪氨酸、苯丙氨酸、色氨酸),有趣的是只有一个带正电荷的精氨酸残基。对于热点残基,其中 51.5%位于保守残基簇中,它们与保守残基簇中的首选残基类型基本一致。
蛋白质-RNA 界面残基通常比非界面表面残基更保守。与整个界面残基相比,保守的界面残基发生更空间聚集。热点残基类型与保守簇中首选残基类型的高度一致性对实验性丙氨酸扫描诱变研究具有重要意义。这项工作加深了对蛋白质-RNA 界面残基组织的理解,并且在识别结合位点和热点残基方面具有潜在的应用。