Zhu Weimin, Ding Xiaohan, Shen Hong-Bin, Pan Xiaoyong
Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
J Mol Biol. 2025 Apr 15;437(8):169010. doi: 10.1016/j.jmb.2025.169010. Epub 2025 Feb 15.
RNAs are emerging as promising therapeutic targets, yet identifying small molecules that bind to them remains a significant challenge in drug discovery. This underscores the crucial role of computational modeling in predicting RNA-small molecule binding sites. However, accurate and efficient computational methods for identifying these interactions are still lacking. Recently, advances in large language models (LLMs), previously successful in DNA and protein research, have spurred the development of RNA-specific LLMs. These models leverage vast unlabeled RNA sequences to autonomously learn semantic representations with the goal of enhancing downstream tasks, particularly those constrained by limited annotated data. Here, we develop RNABind, an embedding-informed geometric deep learning framework to detect RNA-small molecule binding sites from RNA structures. RNABind integrates RNA LLMs into advanced geometric deep learning networks, which encodes both RNA sequence and structure information. To evaluate RNABind, we first compile the largest RNA-small molecule interaction dataset from the entire multi-chain complex structure instead of single-chain RNAs. Extensive experiments demonstrate that RNABind outperforms existing state-of-the-art methods. Besides, we conduct an extensive experimental evaluation of eight pre-trained RNA LLMs, assessing their performance on the binding site prediction task within a unified experimental protocol. In summary, RNABind provides a powerful tool on exploring RNA-small molecule binding site prediction, which paves the way for future innovations in the RNA-targeted drug discovery.
RNA正成为有前景的治疗靶点,但在药物研发中,识别与RNA结合的小分子仍然是一项重大挑战。这凸显了计算建模在预测RNA-小分子结合位点方面的关键作用。然而,目前仍缺乏准确且高效的识别这些相互作用的计算方法。最近,此前在DNA和蛋白质研究中取得成功的大语言模型(LLM)取得的进展,推动了针对RNA的LLM的发展。这些模型利用大量未标记的RNA序列自主学习语义表示,以增强下游任务,特别是那些受有限注释数据限制的任务。在此,我们开发了RNABind,这是一种基于嵌入的几何深度学习框架,用于从RNA结构中检测RNA-小分子结合位点。RNABind将RNA的LLM集成到先进的几何深度学习网络中,该网络对RNA序列和结构信息进行编码。为了评估RNABind,我们首先从整个多链复合结构而非单链RNA中编译了最大的RNA-小分子相互作用数据集。大量实验表明,RNABind优于现有的最先进方法。此外,我们对八个预训练的RNA的LLM进行了广泛的实验评估,在统一的实验方案内评估它们在结合位点预测任务上的表现。总之,RNABind为探索RNA-小分子结合位点预测提供了一个强大的工具,为未来基于RNA的药物研发创新铺平了道路。