• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。

IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.

作者信息

Pan Xiaoyong, Fan Yong-Xian, Yan Junchi, Shen Hong-Bin

机构信息

Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Dongchuan Road, Shanghai, China.

Present Address: Department of Veterinary Clinical and Animal Sciences, University of Copenhagen, Copenhagen, Denmark.

出版信息

BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.

DOI:10.1186/s12864-016-2931-8
PMID:27506469
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4979166/
Abstract

BACKGROUND

Non-coding RNAs (ncRNAs) play crucial roles in many biological processes, such as post-transcription of gene regulation. ncRNAs mainly function through interaction with RNA binding proteins (RBPs). To understand the function of a ncRNA, a fundamental step is to identify which protein is involved into its interaction. Therefore it is promising to computationally predict RBPs, where the major challenge is that the interaction pattern or motif is difficult to be found.

RESULTS

In this study, we propose a computational method IPMiner (Interaction Pattern Miner) to predict ncRNA-protein interactions from sequences, which makes use of deep learning and further improves its performance using stacked ensembling. One of the IPMiner's typical merits is that it is able to mine the hidden sequential interaction patterns from sequence composition features of protein and RNA sequences using stacked autoencoder, and then the learned hidden features are fed into random forest models. Finally, stacked ensembling is used to integrate different predictors to further improve the prediction performance. The experimental results indicate that IPMiner achieves superior performance on the tested lncRNA-protein interaction dataset with an accuracy of 0.891, sensitivity of 0.939, specificity of 0.831, precision of 0.945 and Matthews correlation coefficient of 0.784, respectively. We further comprehensively investigate IPMiner on other RNA-protein interaction datasets, which yields better performance than the state-of-the-art methods, and the performance has an increase of over 20 % on some tested benchmarked datasets. In addition, we further apply IPMiner for large-scale prediction of ncRNA-protein network, that achieves promising prediction performance.

CONCLUSION

By integrating deep neural network and stacked ensembling, from simple sequence composition features, IPMiner can automatically learn high-level abstraction features, which had strong discriminant ability for RNA-protein detection. IPMiner achieved high performance on our constructed lncRNA-protein benchmark dataset and other RNA-protein datasets. IPMiner tool is available at http://www.csbio.sjtu.edu.cn/bioinf/IPMiner .

摘要

背景

非编码RNA(ncRNAs)在许多生物学过程中发挥着关键作用,如基因调控的转录后过程。ncRNAs主要通过与RNA结合蛋白(RBPs)相互作用发挥功能。为了解ncRNA的功能,一个基本步骤是确定参与其相互作用的蛋白质。因此,通过计算预测RBPs具有很大的前景,其中主要挑战在于难以找到相互作用模式或基序。

结果

在本研究中,我们提出了一种计算方法IPMiner(相互作用模式挖掘器),用于从序列中预测ncRNA-蛋白质相互作用,该方法利用深度学习,并通过堆叠集成进一步提高其性能。IPMiner的一个典型优点是,它能够使用堆叠自动编码器从蛋白质和RNA序列的序列组成特征中挖掘隐藏的序列相互作用模式,然后将学习到的隐藏特征输入到随机森林模型中。最后,使用堆叠集成来整合不同的预测器,以进一步提高预测性能。实验结果表明,IPMiner在测试的lncRNA-蛋白质相互作用数据集上表现优异,准确率为0.891,灵敏度为0.939,特异性为0.831,精确率为0.945,马修斯相关系数为0.784。我们进一步在其他RNA-蛋白质相互作用数据集上全面研究了IPMiner,其性能优于现有方法,并且在一些测试的基准数据集上性能提高了20%以上。此外,我们进一步将IPMiner应用于ncRNA-蛋白质网络的大规模预测,取得了有前景的预测性能。

结论

通过整合深度神经网络和堆叠集成,IPMiner可以从简单的序列组成特征中自动学习具有强判别能力的高级抽象特征,用于RNA-蛋白质检测。IPMiner在我们构建的lncRNA-蛋白质基准数据集和其他RNA-蛋白质数据集上取得了高性能。IPMiner工具可在http://www.csbio.sjtu.edu.cn/bioinf/IPMiner获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/7904842cadb5/12864_2016_2931_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/4ea46e214aff/12864_2016_2931_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/ab64e1c84777/12864_2016_2931_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/fcbbc3992555/12864_2016_2931_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/7904842cadb5/12864_2016_2931_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/4ea46e214aff/12864_2016_2931_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/ab64e1c84777/12864_2016_2931_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/fcbbc3992555/12864_2016_2931_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/218e/4979166/7904842cadb5/12864_2016_2931_Fig4_HTML.jpg

相似文献

1
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
2
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
3
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
4
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
5
RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence.RPiRLS:定量预测与任何已知序列蛋白质相互作用的 RNA。
Molecules. 2018 Feb 28;23(3):540. doi: 10.3390/molecules23030540.
6
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
7
Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs.计算方法在非编码 RNA 分类和亚细胞定位预测中的研究进展。
Int J Mol Sci. 2021 Aug 13;22(16):8719. doi: 10.3390/ijms22168719.
8
DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy.DM-RPIs:基于堆叠集成策略的 ncRNA-蛋白质相互作用预测
Comput Biol Chem. 2019 Dec;83:107088. doi: 10.1016/j.compbiolchem.2019.107088. Epub 2019 Jul 6.
9
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
10
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.

引用本文的文献

1
LncPTPred: predicting lncRNA-protein interaction based on crosslinking and immunoprecipitation (CLIP-Seq) data.LncPTPred:基于交联免疫沉淀(CLIP-Seq)数据预测长链非编码RNA与蛋白质的相互作用
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf432.
2
RPIPLM: Prediction of ncRNA-protein interaction by post-training a dual-tower pretrained biological model with supervised contrastive learning.RPIPLM:通过使用监督对比学习对双塔预训练生物模型进行训练后预测非编码RNA与蛋白质的相互作用
PLoS One. 2025 Aug 14;20(8):e0329174. doi: 10.1371/journal.pone.0329174. eCollection 2025.
3
Graph-RPI: predicting RNA-protein interactions via graph autoencoder and self-supervised learning strategies.

本文引用的文献

1
Predicting effects of noncoding variants with deep learning-based sequence model.使用基于深度学习的序列模型预测非编码变异的影响。
Nat Methods. 2015 Oct;12(10):931-4. doi: 10.1038/nmeth.3547. Epub 2015 Aug 24.
2
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.通过深度学习预测 DNA 和 RNA 结合蛋白的序列特异性。
Nat Biotechnol. 2015 Aug;33(8):831-8. doi: 10.1038/nbt.3300. Epub 2015 Jul 27.
3
Revealing protein-lncRNA interaction.揭示蛋白质与长链非编码RNA的相互作用。
Graph-RPI:通过图自动编码器和自监督学习策略预测RNA-蛋白质相互作用
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf292.
4
LPItabformer: Enhancing generalization in predicting lncRNA-protein interactions via a tabular Transformer.LPItabformer:通过表格Transformer增强lncRNA-蛋白质相互作用预测中的泛化能力。
Comput Struct Biotechnol J. 2025 May 29;27:2323-2335. doi: 10.1016/j.csbj.2025.05.050. eCollection 2025.
5
RNA-protein interaction prediction using network-guided deep learning.利用网络引导的深度学习进行RNA-蛋白质相互作用预测。
Commun Biol. 2025 Feb 16;8(1):247. doi: 10.1038/s42003-025-07694-9.
6
Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding.使用具有二核苷酸-密码子融合特征编码的混合深度学习模型预测长链非编码RNA-蛋白质相互作用。
BMC Genomics. 2024 Dec 28;25(1):1253. doi: 10.1186/s12864-024-11168-3.
7
Structure-Based Prediction of lncRNA-Protein Interactions by Deep Learning.基于深度学习的长链非编码RNA-蛋白质相互作用的结构预测
Methods Mol Biol. 2025;2883:363-376. doi: 10.1007/978-1-0716-4290-0_16.
8
CBIL-VHPLI: a model for predicting viral-host protein-lncRNA interactions based on machine learning and transfer learning.CBIL-VHPLI:一种基于机器学习和迁移学习的预测病毒-宿主蛋白-lncRNA 相互作用的模型。
Sci Rep. 2024 Jul 30;14(1):17549. doi: 10.1038/s41598-024-68750-8.
9
BioPrediction-RPI: Democratizing the prediction of interaction between non-coding RNA and protein with end-to-end machine learning.生物预测-RPI:通过端到端机器学习实现非编码RNA与蛋白质相互作用预测的普及。
Comput Struct Biotechnol J. 2024 May 22;23:2267-2276. doi: 10.1016/j.csbj.2024.05.031. eCollection 2024 Dec.
10
Prediction of LncRNA-protein Interactions Using Auto-Encoder, SE-ResNet Models and Transfer Learning.使用自动编码器、SE-ResNet模型和迁移学习预测长链非编码RNA与蛋白质的相互作用
Microrna. 2024;13(2):155-165. doi: 10.2174/0122115366288068240322064431.
Brief Bioinform. 2016 Jan;17(1):106-16. doi: 10.1093/bib/bbv031. Epub 2015 Jun 2.
4
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
5
Transcriptomic landscape of lncRNAs in inflammatory bowel disease.lncRNAs 在炎症性肠病中的转录组景观。
Genome Med. 2015 May 13;7(1):39. doi: 10.1186/s13073-015-0162-2. eCollection 2015.
6
Prediction of nucleic acid binding probability in proteins: a neighboring residue network based score.蛋白质中核酸结合概率的预测:基于相邻残基网络的评分
Nucleic Acids Res. 2015 Jun 23;43(11):5340-51. doi: 10.1093/nar/gkv446. Epub 2015 May 4.
7
A comprehensive comparative review of sequence-based predictors of DNA- and RNA-binding residues.基于序列的DNA和RNA结合残基预测因子的全面比较综述。
Brief Bioinform. 2016 Jan;17(1):88-105. doi: 10.1093/bib/bbv023. Epub 2015 May 1.
8
Computationally predicting protein-RNA interactions using only positive and unlabeled examples.仅使用正例和未标记示例进行蛋白质-RNA相互作用的计算预测。
J Bioinform Comput Biol. 2015 Jun;13(3):1541005. doi: 10.1142/S021972001541005X. Epub 2015 Feb 8.
9
RPI-Pred: predicting ncRNA-protein interaction using sequence and structural information.RPI-Pred:利用序列和结构信息预测非编码RNA与蛋白质的相互作用
Nucleic Acids Res. 2015 Feb 18;43(3):1370-9. doi: 10.1093/nar/gkv020. Epub 2015 Jan 21.
10
High-throughput characterization of protein-RNA interactions.蛋白质-RNA相互作用的高通量表征
Brief Funct Genomics. 2015 Jan;14(1):74-89. doi: 10.1093/bfgp/elu047. Epub 2014 Dec 13.