• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PROBselect:通过动态预测器选择从蛋白质序列中准确预测蛋白质结合残基。

PROBselect: accurate prediction of protein-binding residues from proteins sequences via dynamic predictor selection.

机构信息

Hunan Provincial Key Laboratory on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China.

School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, China.

出版信息

Bioinformatics. 2020 Dec 30;36(Suppl_2):i735-i744. doi: 10.1093/bioinformatics/btaa806.

DOI:10.1093/bioinformatics/btaa806
PMID:33381815
Abstract

MOTIVATION

Knowledge of protein-binding residues (PBRs) improves our understanding of protein-protein interactions, contributes to the prediction of protein functions and facilitates protein-protein docking calculations. While many sequence-based predictors of PBRs were published, they offer modest levels of predictive performance and most of them cross-predict residues that interact with other partners. One unexplored option to improve the predictive quality is to design consensus predictors that combine results produced by multiple methods.

RESULTS

We empirically investigate predictive performance of a representative set of nine predictors of PBRs. We report substantial differences in predictive quality when these methods are used to predict individual proteins, which contrast with the dataset-level benchmarks that are currently used to assess and compare these methods. Our analysis provides new insights for the cross-prediction concern, dissects complementarity between predictors and demonstrates that predictive performance of the top methods depends on unique characteristics of the input protein sequence. Using these insights, we developed PROBselect, first-of-its-kind consensus predictor of PBRs. Our design is based on the dynamic predictor selection at the protein level, where the selection relies on regression-based models that accurately estimate predictive performance of selected predictors directly from the sequence. Empirical assessment using a low-similarity test dataset shows that PROBselect provides significantly improved predictive quality when compared with the current predictors and conventional consensuses that combine residue-level predictions. Moreover, PROBselect informs the users about the expected predictive quality for the prediction generated from a given input protein.

AVAILABILITY AND IMPLEMENTATION

PROBselect is available at http://bioinformatics.csu.edu.cn/PROBselect/home/index.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

了解蛋白质结合残基 (PBR) 可以帮助我们更好地理解蛋白质-蛋白质相互作用,有助于预测蛋白质的功能,并促进蛋白质-蛋白质对接计算。虽然已经发表了许多基于序列的 PBR 预测器,但它们的预测性能都不高,而且大多数预测器都可以预测与其他伴侣相互作用的残基。提高预测质量的一种未被探索的选择是设计结合多种方法结果的共识预测器。

结果

我们实证研究了一组 9 个 PBR 预测器的代表性集合的预测性能。当这些方法用于预测单个蛋白质时,我们报告了预测质量的显著差异,这与目前用于评估和比较这些方法的数据集级别基准形成对比。我们的分析为交叉预测问题提供了新的见解,剖析了预测器之间的互补性,并证明了顶级方法的预测性能取决于输入蛋白质序列的独特特征。利用这些见解,我们开发了 PROBselect,这是首个 PBR 共识预测器。我们的设计基于蛋白质水平上的动态预测器选择,其中选择依赖于回归模型,该模型可以直接从序列中准确估计所选预测器的预测性能。使用低相似度测试数据集进行的实证评估表明,与当前的预测器和结合残基预测的传统共识相比,PROBselect 提供了显著提高的预测质量。此外,PROBselect 可以为用户提供有关从给定输入蛋白质生成的预测的预期预测质量的信息。

可用性和实现

PROBselect 可在 http://bioinformatics.csu.edu.cn/PROBselect/home/index 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
PROBselect: accurate prediction of protein-binding residues from proteins sequences via dynamic predictor selection.PROBselect:通过动态预测器选择从蛋白质序列中准确预测蛋白质结合残基。
Bioinformatics. 2020 Dec 30;36(Suppl_2):i735-i744. doi: 10.1093/bioinformatics/btaa806.
2
SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences.SCRIBER:从蛋白质序列中准确预测与伴侣类型特异性相关的蛋白质结合残基。
Bioinformatics. 2019 Jul 15;35(14):i343-i353. doi: 10.1093/bioinformatics/btz324.
3
Prediction of protein-binding residues: dichotomy of sequence-based methods developed using structured complexes versus disordered proteins.预测蛋白质结合残基:基于序列的方法在使用结构复合物与无规则蛋白质时的二分法。
Bioinformatics. 2020 Sep 15;36(18):4729-4738. doi: 10.1093/bioinformatics/btaa573.
4
A comprehensive comparative review of sequence-based predictors of DNA- and RNA-binding residues.基于序列的DNA和RNA结合残基预测因子的全面比较综述。
Brief Bioinform. 2016 Jan;17(1):88-105. doi: 10.1093/bib/bbv023. Epub 2015 May 1.
5
On the complementarity of the consensus-based disorder prediction.基于共识的无序预测的互补性
Pac Symp Biocomput. 2012:176-87.
6
DeepPRObind: Modular Deep Learner that Accurately Predicts Structure and Disorder-Annotated Protein Binding Residues.DeepPRObind:精确预测具有结构和无序注释的蛋白质结合残基的模块化深度学习模型
J Mol Biol. 2023 Jul 15;435(14):167945. doi: 10.1016/j.jmb.2023.167945. Epub 2023 Jan 5.
7
Review and comparative assessment of sequence-based predictors of protein-binding residues.基于序列的蛋白质结合残基预测因子的回顾与比较评估。
Brief Bioinform. 2018 Sep 28;19(5):821-837. doi: 10.1093/bib/bbx022.
8
DISOselect: Disorder predictor selection at the protein level.DISOselect:蛋白质水平的紊乱预测因子选择。
Protein Sci. 2020 Jan;29(1):184-200. doi: 10.1002/pro.3756. Epub 2019 Nov 7.
9
DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences.DFLpred:蛋白质序列中无序柔性连接区的高通量预测
Bioinformatics. 2016 Jun 15;32(12):i341-i350. doi: 10.1093/bioinformatics/btw280.
10
DNAgenie: accurate prediction of DNA-type-specific binding residues in protein sequences.DNAgenie:准确预测蛋白质序列中 DNA 型特异性结合残基
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab336.

引用本文的文献

1
Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.蛋白质序列中核酸结合残基预测二十年进展
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf016.
2
Accurate Prediction of Protein-Binding Residues in Protein Sequences Using SCRIBER.使用 SCRIBER 准确预测蛋白质序列中的蛋白质结合残基。
Methods Mol Biol. 2025;2867:247-260. doi: 10.1007/978-1-0716-4196-5_15.
3
PMSFF: Improved Protein Binding Residues Prediction through Multi-Scale Sequence-Based Feature Fusion Strategy.PMSFF:通过多尺度序列的基于特征融合策略来提高蛋白质结合残基预测。
Biomolecules. 2024 Sep 27;14(10):1220. doi: 10.3390/biom14101220.
4
HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins.HybridDBRpred:利用结构复合物和无序蛋白的注释改进基于序列的 DNA 结合氨基酸预测。
Nucleic Acids Res. 2024 Jan 25;52(2):e10. doi: 10.1093/nar/gkad1131.
5
HybridRNAbind: prediction of RNA interacting residues across structure-annotated and disorder-annotated proteins.HybridRNAbind:跨结构注释和无序注释蛋白质预测 RNA 相互作用残基。
Nucleic Acids Res. 2023 Mar 21;51(5):e25. doi: 10.1093/nar/gkac1253.
6
Protein-protein interaction prediction with deep learning: A comprehensive review.基于深度学习的蛋白质-蛋白质相互作用预测:综述
Comput Struct Biotechnol J. 2022 Sep 19;20:5316-5341. doi: 10.1016/j.csbj.2022.08.070. eCollection 2022.
7
ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。
Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.
8
Identification of Membrane Proteins Using Sequence-Based Features.基于序列特征的膜蛋白鉴定。
Comput Math Methods Med. 2022 Jan 12;2022:7493834. doi: 10.1155/2022/7493834. eCollection 2022.
9
IHEC_RAAC: a online platform for identifying human enzyme classes via reduced amino acid cluster strategy.IHEC\_RAAC:一种通过简化氨基酸簇策略来鉴定人类酶类的在线平台。
Amino Acids. 2021 Feb;53(2):239-251. doi: 10.1007/s00726-021-02941-9. Epub 2021 Jan 23.