Ahmad Shandar, Sarai Akinori
Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka, 820-8502, Japan.
BMC Struct Biol. 2011 Feb 1;11:8. doi: 10.1186/1472-6807-11-8.
Protein-RNA interactions play important role in many biological processes such as gene regulation, replication, protein synthesis and virus assembly. Although many structures of various types of protein-RNA complexes have been determined, the mechanism of protein-RNA recognition remains elusive. We have earlier shown that the simplest electrostatic properties viz. charge, dipole and quadrupole moments, calculated from backbone atomic coordinates of proteins are biased relative to other proteins, and these quantities can be used to identify DNA-binding proteins. Closely related, RNA-binding proteins are investigated in this study. In particular, discrimination between various types of RNA-binding proteins, evolutionary conservation of these bulk electrostatic features and effect of conformational changes by complex formation are investigated. Basic binding mechanism of a putative RNA-binding protein (HI1333 from Haemophilus influenza) is suggested as a potential application of this study.
We found that similar to DNA-binding proteins (DBPs), RNA-binding proteins (RBPs) also show significantly higher values of electric moments. However, higher moments in RBPs are found to strongly depend on their functional class: proteins binding to ribosomal RNA (rRNA) constitute the only class with all three of the properties (charge, dipole and quadrupole moments) being higher than control proteins. Neural networks were trained using leave-one-out cross-validation to predict RBPs from control data as well as pair-wise classification capacity between proteins binding to various RNA types. RBPs and control proteins reached up to 78% accuracy measured by the area under the ROC curve. Proteins binding to rRNA are found to be best distinguished (AUC = 79%). Changes in dipole and quadrupole moments between unbound and bound structures were small and these properties are found to be robust under complex formation.
Bulk electric moments of proteins considered here provide insights into target recognition by RNA-binding proteins, as well as ability to recognize one type of RBP from others. These results help in understanding the mechanism of protein-RNA recognition, and identifying RNA-binding proteins.
蛋白质 - RNA相互作用在许多生物过程中发挥着重要作用,如基因调控、复制、蛋白质合成和病毒组装。尽管已经确定了各种类型蛋白质 - RNA复合物的许多结构,但蛋白质 - RNA识别的机制仍然难以捉摸。我们之前已经表明,最简单的静电特性,即从蛋白质主链原子坐标计算得出的电荷、偶极矩和四极矩,相对于其他蛋白质存在偏差,并且这些量可用于识别DNA结合蛋白。与此密切相关的是,本研究对RNA结合蛋白进行了研究。特别是,研究了不同类型RNA结合蛋白之间的区分、这些整体静电特征的进化保守性以及复合物形成引起的构象变化的影响。作为本研究的潜在应用,提出了一种假定的RNA结合蛋白(来自流感嗜血杆菌的HI1333)的基本结合机制。
我们发现,与DNA结合蛋白(DBP)类似,RNA结合蛋白(RBP)也表现出明显更高的电矩值。然而,发现RBP中的高阶矩强烈依赖于它们的功能类别:与核糖体RNA(rRNA)结合的蛋白质是唯一一类所有三种特性(电荷、偶极矩和四极矩)都高于对照蛋白的类别。使用留一法交叉验证训练神经网络,以从对照数据预测RBP以及结合各种RNA类型的蛋白质之间的成对分类能力。通过ROC曲线下的面积测量,RBP和对照蛋白的准确率高达78%。发现与rRNA结合的蛋白质最容易区分(AUC = 79%)。未结合和结合结构之间的偶极矩和四极矩变化很小,并且发现这些特性在复合物形成过程中具有稳健性。
本文考虑的蛋白质整体电矩为RNA结合蛋白的靶标识别以及从其他蛋白中识别一种类型的RBP的能力提供了见解。这些结果有助于理解蛋白质 - RNA识别的机制,并识别RNA结合蛋白。