Shen Xilin, Hou Yayan, Wang Xueer, Zhang Chunyong, Liu Jilei, Shen Hongru, Wang Wei, Yang Yichen, Yang Meng, Li Yang, Zhang Jin, Sun Yan, Chen Kexin, Shi Lei, Li Xiangchun
Tianjin Cancer Institute, Tianjin's Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China.
Department of Pathology, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China.
Patterns (N Y). 2025 Jan 10;6(1):101150. doi: 10.1016/j.patter.2024.101150.
Protein-RNA interactions play pivotal roles in regulating transcription, translation, and RNA metabolism. Characterizing these interactions offers key insights into RNA dysregulation mechanisms. Here, we introduce Reformer, a deep learning model that predicts protein-RNA binding affinity from sequence data. Trained on 225 enhanced cross-linking and immunoprecipitation sequencing (eCLIP-seq) datasets encompassing 155 RNA-binding proteins across three cell lines, Reformer achieves high accuracy in predicting binding affinity at single-base resolution. The model uncovers binding motifs that are often undetectable through traditional eCLIP-seq methods. Notably, the motifs learned by Reformer are shown to correlate with RNA processing functions. Validation via electrophoretic mobility shift assays confirms the model's precision in quantifying the impact of mutations on RNA regulation. In summary, Reformer improves the resolution of RNA-protein interaction predictions and aids in prioritizing mutations that influence RNA regulation.
蛋白质-RNA相互作用在调节转录、翻译和RNA代谢中起着关键作用。表征这些相互作用有助于深入了解RNA失调机制。在此,我们介绍Reformer,一种从序列数据预测蛋白质-RNA结合亲和力的深度学习模型。该模型在包含三种细胞系中155种RNA结合蛋白的225个增强交联和免疫沉淀测序(eCLIP-seq)数据集上进行训练,在单碱基分辨率下预测结合亲和力方面具有高精度。该模型揭示了通过传统eCLIP-seq方法通常无法检测到的结合基序。值得注意的是,Reformer学习到的基序与RNA加工功能相关。通过电泳迁移率变动分析进行的验证证实了该模型在量化突变对RNA调节影响方面的准确性。总之,Reformer提高了RNA-蛋白质相互作用预测的分辨率,并有助于对影响RNA调节的突变进行优先级排序。