• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用集成方法计算蛋白质 - RNA 复合物中结合能热点。

Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.

机构信息

School of Software, Central South University, Changsha 410075, China.

School of Electronics and Computer Science, Zhejiang Wanli University, Ningbo 315100, China.

出版信息

Bioinformatics. 2018 May 1;34(9):1473-1480. doi: 10.1093/bioinformatics/btx822.

DOI:10.1093/bioinformatics/btx822
PMID:29281004
Abstract

MOTIVATION

Identifying RNA-binding residues, especially energetically favored hot spots, can provide valuable clues for understanding the mechanisms and functional importance of protein-RNA interactions. Yet, limited availability of experimentally recognized energy hot spots in protein-RNA crystal structures leads to the difficulties in developing empirical identification approaches. Computational prediction of RNA-binding hot spot residues is still in its infant stage.

RESULTS

Here, we describe a computational method, PrabHot (Prediction of protein-RNA binding hot spots), that can effectively detect hot spot residues on protein-RNA binding interfaces using an ensemble of conceptually different machine learning classifiers. Residue interaction network features and new solvent exposure characteristics are combined together and selected for classification with the Boruta algorithm. In particular, two new reference datasets (benchmark and independent) have been generated containing 107 hot spots from 47 known protein-RNA complex structures. In 10-fold cross-validation on the training dataset, PrabHot achieves promising performances with an AUC score of 0.86 and a sensitivity of 0.78, which are significantly better than that of the pioneer RNA-binding hot spot prediction method HotSPRing. We also demonstrate the capability of our proposed method on the independent test dataset and gain a competitive advantage as a result.

AVAILABILITY AND IMPLEMENTATION

The PrabHot webserver is freely available at http://denglab.org/PrabHot/.

CONTACT

leideng@csu.edu.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

确定 RNA 结合残基,特别是能量有利的热点,可以为理解蛋白质-RNA 相互作用的机制和功能重要性提供有价值的线索。然而,蛋白质-RNA 晶体结构中实验识别的能量热点的有限可用性导致开发经验识别方法的困难。RNA 结合热点残基的计算预测仍处于起步阶段。

结果

在这里,我们描述了一种计算方法 PrabHot(蛋白质-RNA 结合热点预测),该方法可以使用概念上不同的机器学习分类器的集合,有效地检测蛋白质-RNA 结合界面上的热点残基。残基相互作用网络特征和新的溶剂暴露特征结合在一起,并使用 Boruta 算法进行分类选择。特别是,我们生成了两个新的参考数据集(基准数据集和独立数据集),其中包含 47 个已知蛋白质-RNA 复合物结构中的 107 个热点。在训练数据集上的 10 倍交叉验证中,PrabHot 取得了有希望的性能,AUC 得分为 0.86,灵敏度为 0.78,明显优于先驱的 RNA 结合热点预测方法 HotSPRing。我们还在独立测试数据集上展示了我们提出的方法的能力,并因此获得了竞争优势。

可用性和实现

PrabHot 网络服务器可免费在 http://denglab.org/PrabHot/ 获得。

联系方式

leideng@csu.edu.cn。

补充信息

补充数据可在“Bioinformatics”在线获取。

相似文献

1
Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.使用集成方法计算蛋白质 - RNA 复合物中结合能热点。
Bioinformatics. 2018 May 1;34(9):1473-1480. doi: 10.1093/bioinformatics/btx822.
2
Computationally identifying hot spots in protein-DNA binding interfaces using an ensemble approach.使用集成方法计算蛋白质-DNA 结合界面中的热点。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):384. doi: 10.1186/s12859-020-03675-3.
3
Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy.蛋白质界面中计算热点的识别:结合溶剂可及性和残基间势能可提高准确性。
Bioinformatics. 2009 Jun 15;25(12):1513-20. doi: 10.1093/bioinformatics/btp240. Epub 2009 Apr 8.
4
XGBPRH: Prediction of Binding Hot Spots at Protein⁻RNA Interfaces Utilizing Extreme Gradient Boosting.XGBPRH:利用极端梯度提升预测蛋白质⁻RNA 界面的结合热点。
Genes (Basel). 2019 Mar 21;10(3):242. doi: 10.3390/genes10030242.
5
APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility.APIS:通过结合突出指数和溶剂可及性来准确预测蛋白质界面热点。
BMC Bioinformatics. 2010 Apr 8;11:174. doi: 10.1186/1471-2105-11-174.
6
A feature-based approach to predict hot spots in protein-DNA binding interfaces.基于特征的方法预测蛋白质-DNA 结合界面热点。
Brief Bioinform. 2020 May 21;21(3):1038-1046. doi: 10.1093/bib/bbz037.
7
Hot spot prediction in protein-protein interactions by an ensemble system.通过集成系统预测蛋白质-蛋白质相互作用中的热点
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):132. doi: 10.1186/s12918-018-0665-8.
8
Structure-based prediction of protein- peptide binding regions using Random Forest.基于结构的随机森林预测蛋白肽结合区域。
Bioinformatics. 2018 Feb 1;34(3):477-484. doi: 10.1093/bioinformatics/btx614.
9
Protein binding hot spots prediction from sequence only by a new ensemble learning method.仅通过一种新的集成学习方法从序列预测蛋白质结合热点
Amino Acids. 2017 Oct;49(10):1773-1785. doi: 10.1007/s00726-017-2474-6. Epub 2017 Aug 1.
10
Machine Learning Approaches for Protein⁻Protein Interaction Hot Spot Prediction: Progress and Comparative Assessment.机器学习方法在蛋白质-蛋白质相互作用热点预测中的应用:进展与比较评估。
Molecules. 2018 Oct 4;23(10):2535. doi: 10.3390/molecules23102535.

引用本文的文献

1
Precise prediction of hotspot residues in protein-RNA complexes using graph attention networks and pretrained protein language models.使用图注意力网络和预训练蛋白质语言模型精确预测蛋白质-RNA复合物中的热点残基。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i466-i474. doi: 10.1093/bioinformatics/btaf197.
2
PNBACE: an ensemble algorithm to predict the effects of mutations on protein-nucleic acid binding affinity.PNBACE:一种用于预测突变对蛋白质-核酸结合亲和力影响的集成算法。
BMC Biol. 2024 Sep 11;22(1):203. doi: 10.1186/s12915-024-02006-9.
3
PepCNN deep learning tool for predicting peptide binding residues in proteins using sequence, structural, and language model features.
PepCNN 深度学习工具,用于使用序列、结构和语言模型特征预测蛋白质中的肽结合残基。
Sci Rep. 2023 Nov 28;13(1):20882. doi: 10.1038/s41598-023-47624-5.
4
HPC-Atlas: Computationally Constructing A Comprehensive Atlas of Human Protein Complexes.HPC图谱:通过计算构建人类蛋白质复合物综合图谱
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):976-990. doi: 10.1016/j.gpb.2023.05.001. Epub 2023 Sep 18.
5
Beyond sequence: Structure-based machine learning.超越序列:基于结构的机器学习。
Comput Struct Biotechnol J. 2022 Dec 29;21:630-643. doi: 10.1016/j.csbj.2022.12.039. eCollection 2023.
6
Deep mutational scanning and machine learning reveal structural and molecular rules governing allosteric hotspots in homologous proteins.深度突变扫描和机器学习揭示了同源蛋白中变构热点的结构和分子规则。
Elife. 2022 Oct 13;11:e79932. doi: 10.7554/eLife.79932.
7
Molecular Modeling of Subtype-Specific Tat Protein Signatures to Predict Tat-TAR Interactions That May Be Involved in HIV-Associated Neurocognitive Disorders.亚型特异性Tat蛋白特征的分子建模,以预测可能与HIV相关神经认知障碍有关的Tat-TAR相互作用。
Front Microbiol. 2022 Apr 7;13:866611. doi: 10.3389/fmicb.2022.866611. eCollection 2022.
8
mmCSM-NA: accurately predicting effects of single and multiple mutations on protein-nucleic acid binding affinity.mmCSM-NA:准确预测单突变和多突变对蛋白质-核酸结合亲和力的影响。
NAR Genom Bioinform. 2021 Nov 17;3(4):lqab109. doi: 10.1093/nargab/lqab109. eCollection 2021 Dec.
9
Nabe: an energetic database of amino acid mutations in protein-nucleic acid binding interfaces.Nabe:蛋白质-核酸结合界面中氨基酸突变的能量数据库。
Database (Oxford). 2021 Aug 14;2021. doi: 10.1093/database/baab050.
10
An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties.一种通过探索界面邻居性质来改进 DNA 结合热点残基预测方法。
BMC Bioinformatics. 2021 May 17;22(Suppl 3):253. doi: 10.1186/s12859-020-03871-1.