Li Shuya, Dong Fanghong, Wu Yuexin, Zhang Sai, Zhang Chen, Liu Xiao, Jiang Tao, Zeng Jianyang
School of Life Sciences, Tsinghua University, Beijing 100084, China.
Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.
Nucleic Acids Res. 2017 Aug 21;45(14):e129. doi: 10.1093/nar/gkx492.
Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost.
表征RNA结合蛋白(RBP)的结合行为对于理解它们在基因表达调控中的功能作用至关重要。然而,当前用于识别RBP靶标的高通量实验方法,如CLIP-seq和RNAcompete,通常存在假阴性问题。在此,我们开发了一种基于深度增强的机器学习方法,称为DeBooster,以准确模拟结合序列偏好并从CLIP-seq数据中识别RBP的相应结合靶标。全面的验证测试表明,DeBooster在RBP靶标预测方面优于其他现有最先进的方法。此外,我们已经证明DeBooster可能为理解RBP的调控功能提供新的见解,包括RNA解旋酶MOV10对mRNA降解的结合作用、与其编辑活性相关的潜在不同的ADAR1结合行为,以及RBP结合对miRNA抑制的拮抗作用。此外,DeBooster可能为研究RBP结合位点中致病突变的影响提供一个有效的指标,特别是那些与剪接事件相关的突变。我们期望DeBooster将被广泛应用于分析大规模的CLIP-seq实验数据,并能为理解RBP调控机制的新生物学发现提供一个实际有用的工具。DeBooster的源代码可从http://github.com/dongfanghong/deepboost下载。