• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序列和进化信息整合对非编码RNA-蛋白质相互作用进行准确预测

Accurate Prediction of ncRNA-Protein Interactions From the Integration of Sequence and Evolutionary Information.

作者信息

Zhan Zhao-Hui, You Zhu-Hong, Li Li-Ping, Zhou Yong, Yi Hai-Cheng

机构信息

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China.

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China.

出版信息

Front Genet. 2018 Oct 8;9:458. doi: 10.3389/fgene.2018.00458. eCollection 2018.

DOI:10.3389/fgene.2018.00458
PMID:30349558
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6186793/
Abstract

Non-coding RNA (ncRNA) plays a crucial role in numerous biological processes including gene expression and post-transcriptional gene regulation. The biological function of ncRNA is mostly realized by binding with related proteins. Therefore, an accurate understanding of interactions between ncRNA and protein has a significant impact on current biological research. The major challenge at this stage is the waste of a great deal of redundant time and resource consumed on classification in traditional interaction pattern prediction methods. Fortunately, an efficient classifier named LightGBM can solve this difficulty of long time consumption. In this study, we employed LightGBM as the integrated classifier and proposed a novel computational model for predicting ncRNA and protein interactions. More specifically, the pseudo-Zernike Moments and singular value decomposition algorithm are employed to extract the discriminative features from protein and ncRNA sequences. On four widely used datasets RPI369, RPI488, RPI1807, and RPI2241, we evaluated the performance of LGBM and obtained an superior performance with AUC of 0.799, 0.914, 0.989, and 0.762, respectively. The experimental results of 10-fold cross-validation shown that the proposed method performs much better than existing methods in predicting ncRNA-protein interaction patterns, which could be used as a useful tool in proteomics research.

摘要

非编码RNA(ncRNA)在包括基因表达和转录后基因调控在内的众多生物过程中发挥着关键作用。ncRNA的生物学功能大多通过与相关蛋白质结合来实现。因此,准确理解ncRNA与蛋白质之间的相互作用对当前的生物学研究具有重大影响。现阶段的主要挑战在于传统相互作用模式预测方法在分类过程中浪费了大量冗余的时间和资源。幸运的是,一种名为LightGBM的高效分类器可以解决这个耗时的难题。在本研究中,我们将LightGBM用作集成分类器,并提出了一种预测ncRNA与蛋白质相互作用的新型计算模型。更具体地说,采用伪泽尼克矩和奇异值分解算法从蛋白质和ncRNA序列中提取判别特征。在四个广泛使用的数据集RPI369、RPI488、RPI1807和RPI2241上,我们评估了LGBM的性能,分别获得了0.799、0.914、0.989和0.762的AUC优异性能。十折交叉验证的实验结果表明,所提出的方法在预测ncRNA - 蛋白质相互作用模式方面比现有方法表现得好得多,可作为蛋白质组学研究中的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/81c15b26e2cf/fgene-09-00458-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/ff73e6040780/fgene-09-00458-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/9e247f5f11a9/fgene-09-00458-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/cb714a0b308c/fgene-09-00458-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/bc33576f9c95/fgene-09-00458-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/8499a0c66cce/fgene-09-00458-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/81c15b26e2cf/fgene-09-00458-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/ff73e6040780/fgene-09-00458-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/9e247f5f11a9/fgene-09-00458-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/cb714a0b308c/fgene-09-00458-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/bc33576f9c95/fgene-09-00458-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/8499a0c66cce/fgene-09-00458-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f070/6186793/81c15b26e2cf/fgene-09-00458-g0006.jpg

相似文献

1
Accurate Prediction of ncRNA-Protein Interactions From the Integration of Sequence and Evolutionary Information.基于序列和进化信息整合对非编码RNA-蛋白质相互作用进行准确预测
Front Genet. 2018 Oct 8;9:458. doi: 10.3389/fgene.2018.00458. eCollection 2018.
2
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
3
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
4
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
5
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
6
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
7
Learning distributed representations of RNA and protein sequences and its application for predicting lncRNA-protein interactions.学习RNA和蛋白质序列的分布式表示及其在预测长链非编码RNA-蛋白质相互作用中的应用。
Comput Struct Biotechnol J. 2019 Nov 30;18:20-26. doi: 10.1016/j.csbj.2019.11.004. eCollection 2020.
8
Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction.用于计算预测非编码RNA-蛋白质相互作用的复杂特征构建
Front Genet. 2019 Feb 1;10:18. doi: 10.3389/fgene.2019.00018. eCollection 2019.
9
Combining High Speed ELM Learning with a Deep Convolutional Neural Network Feature Encoding for Predicting Protein-RNA Interactions.将高速 ELM 学习与深度卷积神经网络特征编码相结合,用于预测蛋白质-RNA 相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):972-980. doi: 10.1109/TCBB.2018.2874267. Epub 2018 Oct 5.
10
HeadTailTransfer: An efficient sampling method to improve the performance of graph neural network method in predicting sparse ncRNA-protein interactions.头尾转移:一种提高图神经网络方法预测稀疏 ncRNA-蛋白质相互作用性能的有效采样方法。
Comput Biol Med. 2023 May;157:106783. doi: 10.1016/j.compbiomed.2023.106783. Epub 2023 Mar 15.

引用本文的文献

1
Advancing non-alcoholic fatty liver disease prediction: a comprehensive machine learning approach integrating SHAP interpretability and multi-cohort validation.推进非酒精性脂肪性肝病预测:一种综合机器学习方法,整合 SHAP 可解释性和多队列验证。
Front Endocrinol (Lausanne). 2024 Oct 8;15:1450317. doi: 10.3389/fendo.2024.1450317. eCollection 2024.
2
Development and application of a machine learning-based predictive model for obstructive sleep apnea screening.基于机器学习的阻塞性睡眠呼吸暂停筛查预测模型的开发与应用。
Front Big Data. 2024 May 16;7:1353469. doi: 10.3389/fdata.2024.1353469. eCollection 2024.
3

本文引用的文献

1
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
2
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
3
The interplay between noncoding RNAs and insulin in diabetes.
Machine Learning Model in Predicting Sarcopenia in Crohn's Disease Based on Simple Clinical and Anthropometric Measures.
基于简单临床和人体测量学指标的机器学习模型预测克罗恩病肌少症。
Int J Environ Res Public Health. 2022 Dec 30;20(1):656. doi: 10.3390/ijerph20010656.
4
Artificial intelligence methods enhance the discovery of RNA interactions.人工智能方法促进了RNA相互作用的发现。
Front Mol Biosci. 2022 Oct 7;9:1000205. doi: 10.3389/fmolb.2022.1000205. eCollection 2022.
5
A Presurgical Unfavorable Prediction Scale of Endovascular Treatment for Acute Ischemic Stroke.急性缺血性卒中血管内治疗的术前不良预测量表
Front Aging Neurosci. 2022 Jun 30;14:942285. doi: 10.3389/fnagi.2022.942285. eCollection 2022.
6
Circulating non-coding RNAs in recurrent and metastatic ovarian cancer.复发性和转移性卵巢癌中的循环非编码RNA
Cancer Drug Resist. 2019 Sep 19;2(3):399-418. doi: 10.20517/cdr.2019.51. eCollection 2019.
7
Opportunities and Challenges of Predictive Approaches for the Non-coding RNA in Plants.植物非编码RNA预测方法的机遇与挑战
Front Plant Sci. 2022 Apr 14;13:890663. doi: 10.3389/fpls.2022.890663. eCollection 2022.
8
DeepStack-DTIs: Predicting Drug-Target Interactions Using LightGBM Feature Selection and Deep-Stacked Ensemble Classifier.DeepStack-DTIs:使用 LightGBM 特征选择和深度堆叠集成分类器预测药物-靶标相互作用。
Interdiscip Sci. 2022 Jun;14(2):311-330. doi: 10.1007/s12539-021-00488-7. Epub 2021 Nov 3.
9
Development of machine learning model for diagnostic disease prediction based on laboratory tests.基于实验室检查的机器学习模型在疾病诊断预测中的开发。
Sci Rep. 2021 Apr 7;11(1):7567. doi: 10.1038/s41598-021-87171-5.
10
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
非编码 RNA 与胰岛素在糖尿病中的相互作用。
Cancer Lett. 2018 Apr 10;419:53-63. doi: 10.1016/j.canlet.2018.01.038.
4
RNAcompete-S: Combined RNA sequence/structure preferences for RNA binding proteins derived from a single-step in vitro selection.RNAcompete-S:从单个体外选择步骤中获得的 RNA 结合蛋白的 RNA 序列/结构综合偏好。
Methods. 2017 Aug 15;126:18-28. doi: 10.1016/j.ymeth.2017.06.024. Epub 2017 Jun 24.
5
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
6
RPI-Bind: a structure-based method for accurate identification of RNA-protein binding sites.RPI-Bind:一种基于结构的方法,用于准确识别 RNA-蛋白质结合位点。
Sci Rep. 2017 Apr 4;7(1):614. doi: 10.1038/s41598-017-00795-4.
7
Selecting high-quality negative samples for effectively predicting protein-RNA interactions.选择高质量的阴性样本以有效预测蛋白质-RNA相互作用。
BMC Syst Biol. 2017 Mar 14;11(Suppl 2):9. doi: 10.1186/s12918-017-0390-8.
8
Highly Efficient Framework for Predicting Interactions Between Proteins.高效蛋白质相互作用预测框架。
IEEE Trans Cybern. 2017 Mar;47(3):731-743. doi: 10.1109/TCYB.2016.2524994. Epub 2016 Mar 30.
9
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
10
rpiCOOL: A tool for In Silico RNA-protein interaction detection using random forest.rpiCOOL:一种使用随机森林进行计算机模拟RNA-蛋白质相互作用检测的工具。
J Theor Biol. 2016 Aug 7;402:1-8. doi: 10.1016/j.jtbi.2016.04.025. Epub 2016 Apr 28.