• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于梯度提升机的 RNA-蛋白质相互作用预测数据驱动模型。

A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine.

机构信息

Department of Computer Science and Information Systems, Birla Institute of Technology and Science Pilani, K K Birla Goa campus, Zuarinagar, South Goa, Goa, India.

Faculty of Electronics and Information Technology, Warsaw University of Technology, Warsaw, Poland.

出版信息

Sci Rep. 2018 Jun 22;8(1):9552. doi: 10.1038/s41598-018-27814-2.

DOI:10.1038/s41598-018-27814-2
PMID:29934510
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6015049/
Abstract

RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.

摘要

RNA 与蛋白质的相互作用(RPI)在调节各种生物过程中起着关键作用。RPI 的实验验证既耗时又费力,因此为计算预测方法铺平了道路。这些方法的主要限制因素一直是预测的准确性和置信度,我们的内部实验表明,它们无法准确预测涉及 TERRA RNA 等短 RNA 序列的 RPI。在这里,我们使用梯度提升分类器为 RPI 预测提供了一种数据驱动的模型。基于 RNA 蛋白复合物的高分辨率结构数据对氨基酸和核苷酸进行分类。使用由五个残基组成的最小结构单元作为描述符。与现有方法的比较分析表明,无论 RPI 中存在的 RNA 长度如何,我们的方法始终表现出更高的性能。该方法已成功应用于映射涉及长非编码 RNA 和 TERRA RNA 的 RPI 网络。该方法还成功预测了四个不同生物体的 RPI 网络中存在的 RNA 和蛋白质枢纽。该方法的稳健性将为预测未知的长非编码 RNA 和 miRNA 的 RPI 网络提供一种方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2fd/6015049/55acab6629ed/41598_2018_27814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2fd/6015049/34f2fd2bd3df/41598_2018_27814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2fd/6015049/55acab6629ed/41598_2018_27814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2fd/6015049/34f2fd2bd3df/41598_2018_27814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2fd/6015049/55acab6629ed/41598_2018_27814_Fig2_HTML.jpg

相似文献

1
A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine.基于梯度提升机的 RNA-蛋白质相互作用预测数据驱动模型。
Sci Rep. 2018 Jun 22;8(1):9552. doi: 10.1038/s41598-018-27814-2.
2
RPI-Pred: predicting ncRNA-protein interaction using sequence and structural information.RPI-Pred:利用序列和结构信息预测非编码RNA与蛋白质的相互作用
Nucleic Acids Res. 2015 Feb 18;43(3):1370-9. doi: 10.1093/nar/gkv020. Epub 2015 Jan 21.
3
Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method.通过结合深度卷积神经网络和特征选择集成方法预测 RNA-蛋白质相互作用。
J Theor Biol. 2019 Jan 14;461:230-238. doi: 10.1016/j.jtbi.2018.10.029. Epub 2018 Oct 12.
4
A boosting approach for prediction of protein-RNA binding residues.一种用于预测蛋白质-RNA结合残基的增强方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):465. doi: 10.1186/s12859-017-1879-2.
5
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
6
RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence.RPiRLS:定量预测与任何已知序列蛋白质相互作用的 RNA。
Molecules. 2018 Feb 28;23(3):540. doi: 10.3390/molecules23030540.
7
Computational Prediction of RNA-Protein Interactions.RNA-蛋白质相互作用的计算预测
Methods Mol Biol. 2017;1543:169-185. doi: 10.1007/978-1-4939-6716-2_8.
8
Predicting RNA-binding sites in proteins using the interaction propensity of amino acid triplets.利用氨基酸三联体的相互作用倾向预测蛋白质中的RNA结合位点。
Protein Pept Lett. 2010 Sep;17(9):1102-10. doi: 10.2174/092986610791760388.
9
Computational Prediction of RNA-Binding Proteins and Binding Sites.RNA结合蛋白及结合位点的计算预测
Int J Mol Sci. 2015 Nov 3;16(11):26303-17. doi: 10.3390/ijms161125952.
10
Predicting protein-binding RNA nucleotides using the feature-based removal of data redundancy and the interaction propensity of nucleotide triplets.利用基于特征的数据冗余消除和核苷酸三联体的相互作用倾向预测与蛋白质结合的 RNA 核苷酸。
Comput Biol Med. 2013 Nov;43(11):1687-97. doi: 10.1016/j.compbiomed.2013.08.011. Epub 2013 Aug 21.

引用本文的文献

1
VirVACPRED: A Web Server for Prediction of Protective Viral Antigens.VirVACPRED:一个用于预测保护性病毒抗原的网络服务器。
Int J Pept Res Ther. 2022;28(1):35. doi: 10.1007/s10989-021-10345-2. Epub 2021 Dec 17.
2
The Identification of Metal Ion Ligand-Binding Residues by Adding the Reclassified Relative Solvent Accessibility.通过添加重新分类的相对溶剂可及性来鉴定金属离子配体结合残基。
Front Genet. 2020 Mar 19;11:214. doi: 10.3389/fgene.2020.00214. eCollection 2020.
3
Genomic analysis of variability in Delta-toxin levels between strains.

本文引用的文献

1
Identifying N-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine.利用多区间核苷酸对位置特异性和支持向量机鉴定 N6-甲基腺苷位点。
Sci Rep. 2017 Apr 25;7:46757. doi: 10.1038/srep46757.
2
CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency.CPPred-RF:一种基于序列的用于识别细胞穿透肽及其摄取效率的预测工具。
J Proteome Res. 2017 May 5;16(5):2044-2053. doi: 10.1021/acs.jproteome.7b00019. Epub 2017 Apr 26.
3
RAID v2.0: an updated resource of RNA-associated interactions across organisms.
菌株间δ毒素水平变异性的基因组分析。
PeerJ. 2020 Mar 24;8:e8717. doi: 10.7717/peerj.8717. eCollection 2020.
4
Solvation Free Energy Calculations with Quantum Mechanics/Molecular Mechanics and Machine Learning Models.溶剂化自由能的量子力学/分子力学和机器学习模型计算。
J Phys Chem B. 2019 Jan 31;123(4):901-908. doi: 10.1021/acs.jpcb.8b11905. Epub 2019 Jan 15.
5
Identification of miR-200c and miR141-Mediated lncRNA-mRNA Crosstalks in Muscle-Invasive Bladder Cancer Subtypes.肌层浸润性膀胱癌亚型中miR-200c和miR141介导的lncRNA-mRNA相互作用的鉴定
Front Genet. 2018 Sep 28;9:422. doi: 10.3389/fgene.2018.00422. eCollection 2018.
RAID v2.0:跨生物体的RNA相关相互作用的更新资源。
Nucleic Acids Res. 2017 Jan 4;45(D1):D115-D118. doi: 10.1093/nar/gkw1052. Epub 2016 Nov 28.
4
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
5
Bioactive Molecule Prediction Using Extreme Gradient Boosting.使用极端梯度提升法进行生物活性分子预测
Molecules. 2016 Jul 28;21(8):983. doi: 10.3390/molecules21080983.
6
NPInter v3.0: an upgraded database of noncoding RNA-associated interactions.NPInter v3.0:一个非编码RNA相关相互作用的升级数据库。
Database (Oxford). 2016 Apr 17;2016. doi: 10.1093/database/baw057. Print 2016.
7
Emerging Roles of RNA-Binding Proteins in Plant Growth, Development, and Stress Responses.RNA结合蛋白在植物生长、发育及应激反应中的新作用
Mol Cells. 2016 Mar;39(3):179-85. doi: 10.14348/molcells.2016.2359. Epub 2016 Feb 2.
8
AMBER Force Field Parameters for the Naturally Occurring Modified Nucleosides in RNA.RNA 中天然存在的修饰核苷的 AMBER 力场参数。
J Chem Theory Comput. 2007 Jul;3(4):1464-75. doi: 10.1021/ct600329w.
9
PAR-CLIP: A Method for Transcriptome-Wide Identification of RNA Binding Protein Interaction Sites.PAR-CLIP:一种用于全转录组范围鉴定RNA结合蛋白相互作用位点的方法。
Methods Mol Biol. 2016;1358:153-73. doi: 10.1007/978-1-4939-3067-8_10.
10
Enhanced Protein Fold Prediction Method Through a Novel Feature Extraction Technique.通过一种新型特征提取技术增强蛋白质折叠预测方法
IEEE Trans Nanobioscience. 2015 Sep;14(6):649-59. doi: 10.1109/TNB.2015.2450233.