Roca-Martínez Joel, Kang Hyun-Seo, Sattler Michael, Vranken Wim
Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels 1050, Belgium.
Structural Biology Brussels, Vrije Universiteit Brussel, Brussels 1050, Belgium.
NAR Genom Bioinform. 2024 Jan 29;6(1):lqae002. doi: 10.1093/nargab/lqae002. eCollection 2024 Mar.
The RNA recognition motif (RRM) is the most prevalent RNA binding domain in eukaryotes and is involved in most RNA metabolism processes. Single RRM domains have a limited RNA specificity and affinity and tend to be accompanied by other RNA binding domains, frequently additional RRMs that contribute to an avidity effect. Within multi-RRM proteins, the most common arrangement are tandem RRMs, with two domains connected by a variable linker. Despite their prevalence, little is known about the features that lead to specific arrangements, and especially the role of the connecting linker. In this work, we present a novel and robust way to investigate the relative domain orientation in multi-domain proteins using inter-domain vectors referenced to a stable secondary structure element. We apply this method to tandem RRM domains and cluster experimental tandem RRM structures according to their inter-domain and linker-domain contacts, and report how this correlates with their orientation. By extending our analysis to AlphaFold2 predicted structures, with particular attention to the inter-domain predicted aligned error, we identify new orientations not reported experimentally. Our analysis provides novel insights across a range of tandem RRM orientations that may help for the design of proteins with a specific RNA binding mode.
RNA识别基序(RRM)是真核生物中最普遍的RNA结合结构域,参与大多数RNA代谢过程。单个RRM结构域的RNA特异性和亲和力有限,并且往往伴随着其他RNA结合结构域,通常是有助于亲和效应的额外RRM。在多RRM蛋白中,最常见的排列方式是串联RRM,两个结构域由可变连接子连接。尽管它们很普遍,但对于导致特定排列的特征,尤其是连接子的作用,人们知之甚少。在这项工作中,我们提出了一种新颖且强大的方法,利用参考稳定二级结构元件的结构域间向量来研究多结构域蛋白中结构域的相对方向。我们将此方法应用于串联RRM结构域,并根据它们的结构域间和连接子-结构域接触对实验性串联RRM结构进行聚类,并报告这与它们的方向如何相关。通过将我们的分析扩展到AlphaFold2预测结构,特别关注结构域间预测比对误差,我们识别出实验未报道的新方向。我们的分析为一系列串联RRM方向提供了新见解,这可能有助于设计具有特定RNA结合模式的蛋白质。