Computational Health Center, Helmholtz Center Munich, Munich, Germany.
Department of Biology, University of Copenhagen, Copenhagen, Denmark.
Genome Biol. 2023 Aug 4;24(1):180. doi: 10.1186/s13059-023-03015-7.
We present RBPNet, a novel deep learning method, which predicts CLIP-seq crosslink count distribution from RNA sequence at single-nucleotide resolution. By training on up to a million regions, RBPNet achieves high generalization on eCLIP, iCLIP and miCLIP assays, outperforming state-of-the-art classifiers. RBPNet performs bias correction by modeling the raw signal as a mixture of the protein-specific and background signal. Through model interrogation via Integrated Gradients, RBPNet identifies predictive sub-sequences that correspond to known and novel binding motifs and enables variant-impact scoring via in silico mutagenesis. Together, RBPNet improves imputation of protein-RNA interactions, as well as mechanistic interpretation of predictions.
我们提出了 RBPNet,这是一种新的深度学习方法,可以从单核苷酸分辨率的 RNA 序列预测 CLIP-seq 交联计数分布。通过在多达一百万的区域进行训练,RBPNet 在 eCLIP、iCLIP 和 miCLIP 测定中实现了高度的泛化能力,优于最先进的分类器。RBPNet 通过将原始信号建模为蛋白质特异性信号和背景信号的混合物来进行偏差校正。通过通过集成梯度进行模型询问,RBPNet 确定了与已知和新的结合基序相对应的预测子序列,并通过计算机诱变实现了变体影响评分。总的来说,RBPNet 提高了蛋白质-RNA 相互作用的推断,以及对预测的机制解释。