• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。

EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.

机构信息

Department of Biomedical Engineering, Faculty of Environment and Life, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing University of Technology, Beijing, 100124, China.

出版信息

BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.

DOI:10.1186/s12859-021-04069-9
PMID:33740884
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7980572/
Abstract

BACKGROUND

Non-coding RNA (ncRNA) and protein interactions play essential roles in various physiological and pathological processes. The experimental methods used for predicting ncRNA-protein interactions are time-consuming and labor-intensive. Therefore, there is an increasing demand for computational methods to accurately and efficiently predict ncRNA-protein interactions.

RESULTS

In this work, we presented an ensemble deep learning-based method, EDLMFC, to predict ncRNA-protein interactions using the combination of multi-scale features, including primary sequence features, secondary structure sequence features, and tertiary structure features. Conjoint k-mer was used to extract protein/ncRNA sequence features, integrating tertiary structure features, then fed into an ensemble deep learning model, which combined convolutional neural network (CNN) to learn dominating biological information with bi-directional long short-term memory network (BLSTM) to capture long-range dependencies among the features identified by the CNN. Compared with other state-of-the-art methods under five-fold cross-validation, EDLMFC shows the best performance with accuracy of 93.8%, 89.7%, and 86.1% on RPI1807, NPInter v2.0, and RPI488 datasets, respectively. The results of the independent test demonstrated that EDLMFC can effectively predict potential ncRNA-protein interactions from different organisms. Furtherly, EDLMFC is also shown to predict hub ncRNAs and proteins presented in ncRNA-protein networks of Mus musculus successfully.

CONCLUSIONS

In general, our proposed method EDLMFC improved the accuracy of ncRNA-protein interaction predictions and anticipated providing some helpful guidance on ncRNA functions research. The source code of EDLMFC and the datasets used in this work are available at https://github.com/JingjingWang-87/EDLMFC .

摘要

背景

非编码 RNA(ncRNA)与蛋白质相互作用在多种生理和病理过程中起着至关重要的作用。用于预测 ncRNA-蛋白质相互作用的实验方法既耗时又费力。因此,越来越需要计算方法来准确、有效地预测 ncRNA-蛋白质相互作用。

结果

在这项工作中,我们提出了一种基于集成深度学习的方法 EDLMFC,使用多尺度特征(包括一级序列特征、二级结构序列特征和三级结构特征)的组合来预测 ncRNA-蛋白质相互作用。联合 k-mer 用于提取蛋白质/ncRNA 序列特征,整合三级结构特征,然后将其输入到一个集成深度学习模型中,该模型结合卷积神经网络(CNN)来学习主导生物信息,以及双向长短期记忆网络(BLSTM)来捕获由 CNN 识别的特征之间的长程依赖关系。在 5 倍交叉验证下,与其他最先进的方法相比,EDLMFC 在 RPI1807、NPInter v2.0 和 RPI488 数据集上的准确率分别为 93.8%、89.7%和 86.1%,表现出最佳性能。独立测试的结果表明,EDLMFC 可以有效地从不同的生物体中预测潜在的 ncRNA-蛋白质相互作用。此外,EDLMFC 还成功地预测了 Mus musculus 中 ncRNA-蛋白质网络中的 hub ncRNA 和蛋白质。

结论

总的来说,我们提出的方法 EDLMFC 提高了 ncRNA-蛋白质相互作用预测的准确性,并为 ncRNA 功能研究提供了一些有价值的指导。EDLMFC 的源代码和本工作中使用的数据集可在 https://github.com/JingjingWang-87/EDLMFC 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/8d54debb6902/12859_2021_4069_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/57de1e975cab/12859_2021_4069_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/cb872225bd1d/12859_2021_4069_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/656c6ce7187d/12859_2021_4069_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/9415cf2c9adc/12859_2021_4069_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/8d54debb6902/12859_2021_4069_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/57de1e975cab/12859_2021_4069_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/cb872225bd1d/12859_2021_4069_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/656c6ce7187d/12859_2021_4069_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/9415cf2c9adc/12859_2021_4069_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77e/7980572/8d54debb6902/12859_2021_4069_Fig5_HTML.jpg

相似文献

1
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
2
RPI-EDLCN: An Ensemble Deep Learning Framework Based on Capsule Network for ncRNA-Protein Interaction Prediction.RPI-EDLCN:一种基于胶囊网络的 ncRNA-蛋白质相互作用预测的集成深度学习框架。
J Chem Inf Model. 2024 Apr 8;64(7):2221-2235. doi: 10.1021/acs.jcim.3c00377. Epub 2023 May 9.
3
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
4
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
5
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
6
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
7
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
8
NPI-GNN: Predicting ncRNA-protein interactions with deep graph neural networks.NPI-GNN:利用深度图神经网络预测 ncRNA-蛋白质相互作用。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab051.
9
BioDeepfuse: a hybrid deep learning approach with integrated feature extraction techniques for enhanced non-coding RNA classification.BioDeepfuse:一种混合深度学习方法,结合了集成特征提取技术,用于增强非编码 RNA 分类。
RNA Biol. 2024 Jan;21(1):1-12. doi: 10.1080/15476286.2024.2329451. Epub 2024 Mar 25.
10
GATLGEMF: A graph attention model with line graph embedding multi-complex features for ncRNA-protein interactions prediction.GATLGEMF:一种具有线图嵌入多复杂特征的图注意力模型,用于非编码RNA-蛋白质相互作用预测。
Comput Biol Chem. 2024 Feb;108:108000. doi: 10.1016/j.compbiolchem.2023.108000. Epub 2023 Dec 6.

引用本文的文献

1
RPIPLM: Prediction of ncRNA-protein interaction by post-training a dual-tower pretrained biological model with supervised contrastive learning.RPIPLM:通过使用监督对比学习对双塔预训练生物模型进行训练后预测非编码RNA与蛋白质的相互作用
PLoS One. 2025 Aug 14;20(8):e0329174. doi: 10.1371/journal.pone.0329174. eCollection 2025.
2
BioPrediction-RPI: Democratizing the prediction of interaction between non-coding RNA and protein with end-to-end machine learning.生物预测-RPI:通过端到端机器学习实现非编码RNA与蛋白质相互作用预测的普及。
Comput Struct Biotechnol J. 2024 May 22;23:2267-2276. doi: 10.1016/j.csbj.2024.05.031. eCollection 2024 Dec.
3

本文引用的文献

1
iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network.iDRBP_MMC:基于多标签学习模型和基于模体的卷积神经网络的 DNA 结合蛋白和 RNA 结合蛋白的鉴定。
J Mol Biol. 2020 Nov 6;432(22):5860-5875. doi: 10.1016/j.jmb.2020.09.008. Epub 2020 Sep 11.
2
aPRBind: protein-RNA interface prediction by combining sequence and I-TASSER model-based structural features learned with convolutional neural networks.aPRBind:通过结合序列和 I-TASSER 模型的基于结构特征与卷积神经网络学习来预测蛋白质-RNA 界面。
Bioinformatics. 2021 May 17;37(7):937-942. doi: 10.1093/bioinformatics/btaa747.
3
Predicting lncRNA-protein interactions through deep learning framework employing multiple features and random forest algorithm.
通过深度学习框架利用多种特征和随机森林算法预测 lncRNA-蛋白质相互作用。
BMC Bioinformatics. 2024 Mar 12;25(1):108. doi: 10.1186/s12859-024-05727-4.
4
Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review.智能蛋白质设计与分子特征技术:全面综述。
Molecules. 2023 Nov 30;28(23):7865. doi: 10.3390/molecules28237865.
5
EnsembleDL-ATG: Identifying autophagy proteins by integrating their sequence and evolutionary information using an ensemble deep learning framework.集成深度学习自噬蛋白识别方法(EnsembleDL-ATG):使用集成深度学习框架整合自噬蛋白的序列和进化信息来识别自噬蛋白。
Comput Struct Biotechnol J. 2023 Sep 29;21:4836-4848. doi: 10.1016/j.csbj.2023.09.036. eCollection 2023.
6
Exploring the landscape of tools and resources for the analysis of long non-coding RNAs.探索用于长链非编码RNA分析的工具和资源全景。
Comput Struct Biotechnol J. 2023 Sep 29;21:4706-4716. doi: 10.1016/j.csbj.2023.09.041. eCollection 2023.
7
DCiPatho: deep cross-fusion networks for genome scale identification of pathogens.DCiPatho:用于大规模病原体基因组识别的深度交叉融合网络。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad194.
8
ncRPI-LGAT: Prediction of ncRNA-protein interactions with line graph attention network framework.ncRPI-LGAT:基于线图注意力网络框架的非编码RNA-蛋白质相互作用预测
Comput Struct Biotechnol J. 2023 Mar 17;21:2286-2295. doi: 10.1016/j.csbj.2023.03.027. eCollection 2023.
9
Insights into the role of long non-coding RNAs in DNA methylation mediated transcriptional regulation.长链非编码RNA在DNA甲基化介导的转录调控中的作用研究进展
Front Mol Biosci. 2022 Dec 2;9:1067406. doi: 10.3389/fmolb.2022.1067406. eCollection 2022.
10
Recent advances in machine learning methods for predicting LncRNA and disease associations.机器学习方法在预测 LncRNA 和疾病关联中的最新进展。
Front Cell Infect Microbiol. 2022 Nov 30;12:1071972. doi: 10.3389/fcimb.2022.1071972. eCollection 2022.
LPI-CNNCP: Prediction of lncRNA-protein interactions by using convolutional neural network with the copy-padding trick.
LPI-CNNCP:利用卷积神经网络和复制填充技术预测 lncRNA-蛋白质相互作用。
Anal Biochem. 2020 Jul 15;601:113767. doi: 10.1016/j.ab.2020.113767. Epub 2020 May 23.
4
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.
5
NPInter v4.0: an integrated database of ncRNA interactions.NPInter v4.0:一个 ncRNA 相互作用的综合数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D160-D165. doi: 10.1093/nar/gkz969.
6
Cytoscape Automation: empowering workflow-based network analysis.Cytoscape 自动化:赋能基于工作流的网络分析。
Genome Biol. 2019 Sep 2;20(1):185. doi: 10.1186/s13059-019-1758-4.
7
DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy.DM-RPIs:基于堆叠集成策略的 ncRNA-蛋白质相互作用预测
Comput Biol Chem. 2019 Dec;83:107088. doi: 10.1016/j.compbiolchem.2019.107088. Epub 2019 Jul 6.
8
DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction.DeepACLSTM:用于蛋白质二级结构预测的深度非对称卷积长短时记忆神经模型。
BMC Bioinformatics. 2019 Jun 17;20(1):341. doi: 10.1186/s12859-019-2940-0.
9
Serine/Arginine-Rich Splicing Factor 3 Modulates the Alternative Splicing of Cytoplasmic Polyadenylation Element Binding Protein 2.丝氨酸/精氨酸丰富剪接因子 3 调节细胞质多聚腺苷酸化元件结合蛋白 2 的可变剪接。
Mol Cancer Res. 2019 Sep;17(9):1920-1930. doi: 10.1158/1541-7786.MCR-18-1291. Epub 2019 May 28.
10
Prediction of Long Non-Coding RNAs Based on Deep Learning.基于深度学习的长非编码 RNA 预测。
Genes (Basel). 2019 Apr 3;10(4):273. doi: 10.3390/genes10040273.