Suppr超能文献

MCNN:用于RNA-蛋白质结合位点预测的多重卷积神经网络

MCNN: Multiple Convolutional Neural Networks for RNA-Protein Binding Sites Prediction.

作者信息

Pan Zhengsen, Zhou Shusen, Zou Hailin, Liu Chanjuan, Zang Mujun, Liu Tong, Wang Qingjun

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1180-1187. doi: 10.1109/TCBB.2022.3170367. Epub 2023 Apr 3.

Abstract

Computational prediction of the RBP bound sites using features learned from existing annotation knowledge is an effective method because high-throughput experiments are complex, expensive and time-consuming. Many methods have been proposed to predict RNA-protein binding sites. However, the partial information of RNA sequence is not fully used. In this study, we propose multiple convolutional neural networks (MCNN) method, which predicts RNA-protein binding sites by integrating multiple convolutional neural networks constructed by RNA sequence information extracted from windows with different lengths. First, MCNN trains multiple CNNs base on RNA sequences extracted by different window lengths. Second, MCNN can extract more binding patterns of RBPs by combining these trained multiple CNNs previously. Third, MCNN only uses RNA base sequence information for RNA-protein binding sites prediction, which extracts sequence binding features and predicts the result with same architecture. This avoids the information loss of feature extraction step. Our proposed MCNN demonstrates a competitive performance comparing with other methods on a large-scale dataset derived from CLIP-seq, which is an effective method for RNA-protein binding sites prediction. The source code of our proposed MCNN method can be found in https://github.com/biomg/MCNN.

摘要

利用从现有注释知识中学到的特征对RNA结合蛋白(RBP)结合位点进行计算预测是一种有效的方法,因为高通量实验复杂、昂贵且耗时。已经提出了许多方法来预测RNA-蛋白质结合位点。然而,RNA序列的部分信息并未得到充分利用。在本研究中,我们提出了多卷积神经网络(MCNN)方法,该方法通过整合由从不同长度窗口提取的RNA序列信息构建的多个卷积神经网络来预测RNA-蛋白质结合位点。首先,MCNN基于不同窗口长度提取的RNA序列训练多个卷积神经网络。其次,MCNN可以通过组合这些先前训练的多个卷积神经网络来提取更多RBP的结合模式。第三,MCNN仅使用RNA碱基序列信息进行RNA-蛋白质结合位点预测,它提取序列结合特征并使用相同的架构预测结果。这避免了特征提取步骤中的信息丢失。我们提出的MCNN在源自CLIP-seq的大规模数据集上与其他方法相比表现出有竞争力的性能,这是一种预测RNA-蛋白质结合位点的有效方法。我们提出的MCNN方法的源代码可在https://github.com/biomg/MCNN上找到。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验