• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

功能预测的实际局限性。

Practical limits of function prediction.

作者信息

Devos D, Valencia A

机构信息

Protein Design Group, CNB-CSIC, Madrid, Spain.

出版信息

Proteins. 2000 Oct 1;41(1):98-107.

PMID:10944397
Abstract

The widening gap between known protein sequences and their functions has led to the practice of assigning a potential function to a protein on the basis of sequence similarity to proteins whose function has been experimentally investigated. We present here a critical view of the theoretical and practical bases for this approach. The results obtained by analyzing a significant number of true sequence similarities, derived directly from structural alignments, point to the complexity of function prediction. Different aspects of protein function, including (i) enzymatic function classification, (ii) functional annotations in the form of key words, (iii) classes of cellular function, and (iv) conservation of binding sites can only be reliably transferred between similar sequences to a modest degree. The reason for this difficulty is a combination of the unavoidable database inaccuracies and the plasticity of protein function. In addition, analysis of the relationship between sequence and functional descriptions defines an empirical limit for pairwise-based functional annotations, namely, the three first digits of the six numbers used as descriptors of protein folds in the FSSP database can be predicted at an average level as low as 7.5% sequence identity, two of the four EC digits at 15% identity, half of the SWISS-PROT key words related to protein function would require 20% identity, and the prediction of half of the residues in the binding site can be made at the 30% sequence identity level.

摘要

已知蛋白质序列与其功能之间日益扩大的差距,导致了基于与功能已通过实验研究的蛋白质的序列相似性来赋予蛋白质潜在功能的做法。在此,我们对这种方法的理论和实践基础提出批判性观点。通过分析大量直接源自结构比对的真实序列相似性所获得的结果,指出了功能预测的复杂性。蛋白质功能的不同方面,包括(i)酶功能分类,(ii)关键词形式的功能注释,(iii)细胞功能类别,以及(iv)结合位点的保守性,只能在相似序列之间以适度程度可靠地转移。造成这种困难的原因是不可避免的数据库不准确以及蛋白质功能的可塑性。此外,对序列与功能描述之间关系的分析确定了基于成对的功能注释的经验极限,即,在FSSP数据库中用作蛋白质折叠描述符的六个数字中的前三位数字,在序列同一性低至7.5%的平均水平下即可预测,四个EC数字中的两个在同一性为15%时可预测,与蛋白质功能相关的SWISS-PROT关键词的一半需要20%的同一性,并且在30%的序列同一性水平下可对结合位点中一半的残基进行预测。

相似文献

1
Practical limits of function prediction.功能预测的实际局限性。
Proteins. 2000 Oct 1;41(1):98-107.
2
Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases.使用序列-结构-功能范式从序列预测蛋白质功能的方法及其在谷氧还蛋白/硫氧还蛋白和T1核糖核酸酶中的应用。
J Mol Biol. 1998 Sep 4;281(5):949-68. doi: 10.1006/jmbi.1998.1993.
3
Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites.3D原子相似性的检测及其在小分子蛋白质结合位点鉴别中的应用。
Bioinformatics. 2008 Aug 15;24(16):i105-11. doi: 10.1093/bioinformatics/btn263.
4
Evolution of function in protein superfamilies, from a structural perspective.从结构角度看蛋白质超家族中功能的演变。
J Mol Biol. 2001 Apr 6;307(4):1113-43. doi: 10.1006/jmbi.2001.4513.
5
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
6
Prediction of protein subcellular localization.蛋白质亚细胞定位预测
Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018.
7
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.原子水平蛋白质设计的准确预测及其在扩展近最优序列空间中的应用。
Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280.
8
Theoretical model of restriction endonuclease HpaI in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis.通过折叠识别预测并经定点诱变验证的限制性内切酶HpaI与DNA复合物的理论模型。
Proteins. 2006 Jun 1;63(4):1059-68. doi: 10.1002/prot.20920.
9
Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity.扩展氮调节蛋白超家族:低于随机序列同一性的同源性检测
Proteins. 2002 Jul 1;48(1):75-84. doi: 10.1002/prot.10110.
10
Profile hidden Markov models for analyzing similarities and dissimilarities in the bacterial reaction center and photosystem II.用于分析细菌反应中心和光系统II中异同的轮廓隐马尔可夫模型。
Biochemistry. 2009 Feb 17;48(6):1230-43. doi: 10.1021/bi802033k.

引用本文的文献

1
Gene Surfing: An efficient and versatile tool for targeted enzyme mining in metagenomics.基因冲浪:宏基因组学中用于靶向酶挖掘的一种高效且通用的工具。
Synth Syst Biotechnol. 2025 Jul 21;10(4):1377-1387. doi: 10.1016/j.synbio.2025.07.006. eCollection 2025 Dec.
2
SaGP: identifying plant saline-alkali tolerance genes based on machine learning techniques.SaGP:基于机器学习技术鉴定植物耐盐碱基因
Front Plant Sci. 2025 Jul 16;16:1629794. doi: 10.3389/fpls.2025.1629794. eCollection 2025.
3
Biological databases in the age of generative artificial intelligence.
生成式人工智能时代的生物数据库。
Bioinform Adv. 2025 Mar 20;5(1):vbaf044. doi: 10.1093/bioadv/vbaf044. eCollection 2025.
4
Evaluating the advancements in protein language models for encoding strategies in protein function prediction: a comprehensive review.评估蛋白质语言模型在蛋白质功能预测编码策略方面的进展:全面综述。
Front Bioeng Biotechnol. 2025 Jan 21;13:1506508. doi: 10.3389/fbioe.2025.1506508. eCollection 2025.
5
Assessing the role of evolutionary information for enhancing protein language model embeddings.评估进化信息在增强蛋白质语言模型嵌入中的作用。
Sci Rep. 2024 Sep 5;14(1):20692. doi: 10.1038/s41598-024-71783-8.
6
A large-scale assessment of sequence database search tools for homology-based protein function prediction.基于序列数据库搜索工具的大规模评估用于同源蛋白功能预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae349.
7
Homology Modeling in the Twilight Zone: Improved Accuracy by Sequence Space Analysis.同系建模的灰色地带:通过序列空间分析提高准确性。
Methods Mol Biol. 2023;2627:1-23. doi: 10.1007/978-1-0716-2974-1_1.
8
Gut microbial metabolism of 5-ASA diminishes its clinical efficacy in inflammatory bowel disease.肠道微生物对 5-ASA 的代谢会降低其在炎症性肠病中的临床疗效。
Nat Med. 2023 Mar;29(3):700-709. doi: 10.1038/s41591-023-02217-7. Epub 2023 Feb 23.
9
Evolutionary Relationships Between Dysregulated Genes in Oral Squamous Cell Carcinoma and Oral Microbiota.口腔鳞状细胞癌中失调基因与口腔微生物组的进化关系。
Front Cell Infect Microbiol. 2022 Jul 13;12:931011. doi: 10.3389/fcimb.2022.931011. eCollection 2022.
10
mebipred: identifying metal-binding potential in protein sequence.mebipred:预测蛋白质序列中的金属结合位点。
Bioinformatics. 2022 Jul 11;38(14):3532-3540. doi: 10.1093/bioinformatics/btac358.