• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SAWRPI:一种基于序列信息预测非编码RNA-蛋白质相互作用的具有自适应权重的堆叠集成框架。

SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information.

作者信息

Ren Zhong-Hao, Yu Chang-Qing, Li Li-Ping, You Zhu-Hong, Guan Yong-Jian, Li Yue-Chao, Pan Jie

机构信息

School of Information Engineering, Xijing University, Xi'an, China.

School of Computer Science, Northwestern Polytechnical University, Xi'an, China.

出版信息

Front Genet. 2022 Feb 28;13:839540. doi: 10.3389/fgene.2022.839540. eCollection 2022.

DOI:10.3389/fgene.2022.839540
PMID:35360836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8963817/
Abstract

Non-coding RNAs (ncRNAs) take essential effects on biological processes, like gene regulation. One critical way of ncRNA executing biological functions is interactions between ncRNA and RNA binding proteins (RBPs). Identifying proteins, involving ncRNA-protein interactions, can well understand the function ncRNA. Many high-throughput experiment have been applied to recognize the interactions. As a consequence of these approaches are time- and labor-consuming, currently, a great number of computational methods have been developed to improve and advance the ncRNA-protein interactions research. However, these methods may be not available to all RNAs and proteins, particularly processing new RNAs and proteins. Additionally, most of them cannot process well with long sequence. In this work, a computational method SAWRPI is proposed to make prediction of ncRNA-protein through sequence information. More specifically, the raw features of protein and ncRNA are firstly extracted through the k-mer sparse matrix with SVD reduction and learning nucleic acid symbols by natural language processing with local fusion strategy, respectively. Then, to classify easily, Hilbert Transformation is exploited to transform raw feature data to the new feature space. Finally, stacking ensemble strategy is adopted to learn high-level abstraction features automatically and generate final prediction results. To confirm the robustness and stability, three different datasets containing two kinds of interactions are utilized. In comparison with state-of-the-art methods and other results classifying or feature extracting strategies, SAWRPI achieved high performance on three datasets, containing two kinds of lncRNA-protein interactions. Upon our finding, SAWRPI is a trustworthy, robust, yet simple and can be used as a beneficial supplement to the task of predicting ncRNA-protein interactions.

摘要

非编码RNA(ncRNAs)在生物过程中发挥着重要作用,如基因调控。ncRNA执行生物学功能的一种关键方式是与RNA结合蛋白(RBPs)相互作用。识别参与ncRNA-蛋白质相互作用的蛋白质,有助于深入了解ncRNA的功能。许多高通量实验已被用于识别这些相互作用。由于这些方法既耗时又费力,目前已开发出大量计算方法来改进和推进ncRNA-蛋白质相互作用的研究。然而,这些方法可能并非适用于所有的RNA和蛋白质,尤其是处理新的RNA和蛋白质时。此外,它们中的大多数在处理长序列时效果不佳。在这项工作中,我们提出了一种计算方法SAWRPI,通过序列信息来预测ncRNA-蛋白质相互作用。具体而言,首先分别通过具有奇异值分解降维的k-mer稀疏矩阵提取蛋白质和ncRNA的原始特征,并通过采用局部融合策略的自然语言处理来学习核酸符号。然后,为了便于分类,利用希尔伯特变换将原始特征数据转换到新的特征空间。最后,采用堆叠集成策略自动学习高级抽象特征并生成最终预测结果。为了验证其稳健性和稳定性,我们使用了包含两种相互作用的三个不同数据集。与现有方法以及其他分类或特征提取策略的结果相比,SAWRPI在包含两种lncRNA-蛋白质相互作用的三个数据集上均取得了高性能。根据我们的发现,SAWRPI是一种可靠、稳健且简单的方法,可作为预测ncRNA-蛋白质相互作用任务的有益补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/883aa1da764a/fgene-13-839540-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/85c310629ae2/fgene-13-839540-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/9c3714f80a1f/fgene-13-839540-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/065133300ec2/fgene-13-839540-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/2421be124ec4/fgene-13-839540-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/c52c0b34a1c8/fgene-13-839540-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/8f59624c1bc4/fgene-13-839540-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/a1c6cc9b441c/fgene-13-839540-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/883aa1da764a/fgene-13-839540-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/85c310629ae2/fgene-13-839540-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/9c3714f80a1f/fgene-13-839540-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/065133300ec2/fgene-13-839540-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/2421be124ec4/fgene-13-839540-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/c52c0b34a1c8/fgene-13-839540-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/8f59624c1bc4/fgene-13-839540-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/a1c6cc9b441c/fgene-13-839540-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eaf/8963817/883aa1da764a/fgene-13-839540-g008.jpg

相似文献

1
SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information.SAWRPI:一种基于序列信息预测非编码RNA-蛋白质相互作用的具有自适应权重的堆叠集成框架。
Front Genet. 2022 Feb 28;13:839540. doi: 10.3389/fgene.2022.839540. eCollection 2022.
2
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
3
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
4
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
5
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
6
RPI-EDLCN: An Ensemble Deep Learning Framework Based on Capsule Network for ncRNA-Protein Interaction Prediction.RPI-EDLCN:一种基于胶囊网络的 ncRNA-蛋白质相互作用预测的集成深度学习框架。
J Chem Inf Model. 2024 Apr 8;64(7):2221-2235. doi: 10.1021/acs.jcim.3c00377. Epub 2023 May 9.
7
Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction.用于计算预测非编码RNA-蛋白质相互作用的复杂特征构建
Front Genet. 2019 Feb 1;10:18. doi: 10.3389/fgene.2019.00018. eCollection 2019.
8
MFPred: prediction of ncRNA families based on multi-feature fusion.MFPred:基于多特征融合的 ncRNA 家族预测。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad303.
9
Identify ncRNA Subcellular Localization via Graph Regularized k-Local Hyperplane Distance Nearest Neighbor Model on Multi-Kernel Learning.基于多核学习的图正则化k-局部超平面距离最近邻模型识别非编码RNA亚细胞定位
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3517-3529. doi: 10.1109/TCBB.2021.3107621. Epub 2022 Dec 8.
10
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.

引用本文的文献

1
Model ensembling as a tool to form interpretable multi-omic predictors of cancer pharmacosensitivity.模型集成作为一种工具,用于构建癌症药物敏感性的可解释多组学预测因子。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae567.
2
A spatial hierarchical network learning framework for drug repositioning allowing interpretation from macro to micro scale.一种用于药物重定位的空间分层网络学习框架,允许从宏观到微观尺度进行解释。
Commun Biol. 2024 Oct 30;7(1):1413. doi: 10.1038/s42003-024-07107-3.
3
An Ensemble Classifiers for Improved Prediction of Native-Non-Native Protein-Protein Interaction.

本文引用的文献

1
NPI-RGCNAE: Fast Predicting ncRNA-Protein Interactions Using the Relational Graph Convolutional Network Auto-Encoder.NPI-RGCNAE:基于关系图卷积网络自动编码器的 ncRNA-蛋白质相互作用快速预测
IEEE J Biomed Health Inform. 2022 Apr;26(4):1861-1871. doi: 10.1109/JBHI.2021.3122527. Epub 2022 Apr 14.
2
Prediction of Protein-Protein Interactions in , Maize, and Rice by Combining Deep Neural Network With Discrete Hilbert Transform.通过结合深度神经网络与离散希尔伯特变换预测玉米和水稻中的蛋白质-蛋白质相互作用
Front Genet. 2021 Sep 20;12:745228. doi: 10.3389/fgene.2021.745228. eCollection 2021.
3
DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding.
用于改进天然-非天然蛋白质-蛋白质相互作用预测的集成分类器。
Int J Mol Sci. 2024 May 29;25(11):5957. doi: 10.3390/ijms25115957.
4
Predicting lncRNA-protein interactions through deep learning framework employing multiple features and random forest algorithm.通过深度学习框架利用多种特征和随机森林算法预测 lncRNA-蛋白质相互作用。
BMC Bioinformatics. 2024 Mar 12;25(1):108. doi: 10.1186/s12859-024-05727-4.
5
DeepMPF: deep learning framework for predicting drug-target interactions based on multi-modal representation with meta-path semantic analysis.DeepMPF:基于多模态表示和元路径语义分析的深度学习框架,用于预测药物-靶标相互作用。
J Transl Med. 2023 Jan 25;21(1):48. doi: 10.1186/s12967-023-03876-3.
DeepLncLoc:一种基于子序列嵌入的深度学习框架,用于长非编码 RNA 亚细胞定位预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab360.
4
iDeepSubMito: identification of protein submitochondrial localization with deep learning.iDeepSubMito:基于深度学习的蛋白质亚线粒体定位鉴定。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab288.
5
iCircRBP-DHN: identification of circRNA-RBP interaction sites using deep hierarchical network.iCircRBP-DHN:使用深度层次网络识别 circRNA-RBP 相互作用位点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa274.
6
Denoising Protein-Protein interaction network via variational graph auto-encoder for protein complex detection.通过变分图自动编码器去噪蛋白质-蛋白质相互作用网络以进行蛋白质复合物检测
J Bioinform Comput Biol. 2020 Jun;18(3):2040010. doi: 10.1142/S0219720020400107.
7
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
8
Learning distributed representations of RNA and protein sequences and its application for predicting lncRNA-protein interactions.学习RNA和蛋白质序列的分布式表示及其在预测长链非编码RNA-蛋白质相互作用中的应用。
Comput Struct Biotechnol J. 2019 Nov 30;18:20-26. doi: 10.1016/j.csbj.2019.11.004. eCollection 2020.
9
Construction and Analysis of Molecular Association Network by Combining Behavior Representation and Node Attributes.结合行为表征与节点属性构建和分析分子关联网络
Front Genet. 2019 Nov 7;10:1106. doi: 10.3389/fgene.2019.01106. eCollection 2019.
10
DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy.DM-RPIs:基于堆叠集成策略的 ncRNA-蛋白质相互作用预测
Comput Biol Chem. 2019 Dec;83:107088. doi: 10.1016/j.compbiolchem.2019.107088. Epub 2019 Jul 6.