Suppr超能文献

CATH 功能家族可预测蛋白质中的功能位点。

CATH functional families predict functional sites in proteins.

机构信息

PrecisionLife Ltd., Long Hanborough, OX29 8LJ Oxford, UK.

Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK.

出版信息

Bioinformatics. 2021 May 23;37(8):1099-1106. doi: 10.1093/bioinformatics/btaa937.

Abstract

MOTIVATION

Identification of functional sites in proteins is essential for functional characterization, variant interpretation and drug design. Several methods are available for predicting either a generic functional site, or specific types of functional site. Here, we present FunSite, a machine learning predictor that identifies catalytic, ligand-binding and protein-protein interaction functional sites using features derived from protein sequence and structure, and evolutionary data from CATH functional families (FunFams).

RESULTS

FunSite's prediction performance was rigorously benchmarked using cross-validation and a holdout dataset. FunSite outperformed other publicly available functional site prediction methods. We show that conserved residues in FunFams are enriched in functional sites. We found FunSite's performance depends greatly on the quality of functional site annotations and the information content of FunFams in the training data. Finally, we analyze which structural and evolutionary features are most predictive for functional sites.

AVAILABILITYAND IMPLEMENTATION

https://github.com/UCL/cath-funsite-predictor.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在蛋白质中识别功能位点对于功能表征、变体解释和药物设计至关重要。有几种方法可用于预测一般功能位点或特定类型的功能位点。在这里,我们提出了 FunSite,这是一种机器学习预测器,它使用源自蛋白质序列和结构以及 CATH 功能家族(FunFams)的进化数据的特征来识别催化、配体结合和蛋白-蛋白相互作用功能位点。

结果

我们使用交叉验证和保留数据集对 FunSite 的预测性能进行了严格的基准测试。FunSite 优于其他公开可用的功能位点预测方法。我们表明,FunFams 中的保守残基在功能位点中富集。我们发现 FunSite 的性能在很大程度上取决于功能位点注释的质量和训练数据中 FunFams 的信息量。最后,我们分析了哪些结构和进化特征对功能位点最具预测性。

可用性和实现

https://github.com/UCL/cath-funsite-predictor。

补充信息

补充数据可在《生物信息学》在线获得。

相似文献

1
CATH functional families predict functional sites in proteins.CATH 功能家族可预测蛋白质中的功能位点。
Bioinformatics. 2021 May 23;37(8):1099-1106. doi: 10.1093/bioinformatics/btaa937.
4
CATH: increased structural coverage of functional space.CATH:增加功能空间的结构覆盖率。
Nucleic Acids Res. 2021 Jan 8;49(D1):D266-D273. doi: 10.1093/nar/gkaa1079.

引用本文的文献

4
Current successes and remaining challenges in protein function prediction.蛋白质功能预测的当前成果与尚存挑战
Front Bioinform. 2023 Jul 27;3:1222182. doi: 10.3389/fbinf.2023.1222182. eCollection 2023.
5
AlphaFold2 protein structure prediction: Implications for drug discovery.AlphaFold2 蛋白质结构预测:对药物发现的影响。
Curr Opin Struct Biol. 2023 Feb;78:102526. doi: 10.1016/j.sbi.2022.102526. Epub 2023 Jan 6.

本文引用的文献

6
Choosing the Best Enzyme Complex Structure Made Easy.轻松选择最佳酶复合物结构。
Structure. 2018 Apr 3;26(4):528-530. doi: 10.1016/j.str.2018.03.009.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验