Suppr超能文献

实验和计算研究酶功能注释揭示了 EC 1.1.3.15 酶类中的错误注释。

Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.

机构信息

Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.

出版信息

PLoS Comput Biol. 2021 Sep 23;17(9):e1009446. doi: 10.1371/journal.pcbi.1009446. eCollection 2021 Sep.

Abstract

Only a small fraction of genes deposited to databases have been experimentally characterised. The majority of proteins have their function assigned automatically, which can result in erroneous annotations. The reliability of current annotations in public databases is largely unknown; experimental attempts to validate the accuracy within individual enzyme classes are lacking. In this study we performed an overview of functional annotations to the BRENDA enzyme database. We first applied a high-throughput experimental platform to verify functional annotations to an enzyme class of S-2-hydroxyacid oxidases (EC 1.1.3.15). We chose 122 representative sequences of the class and screened them for their predicted function. Based on the experimental results, predicted domain architecture and similarity to previously characterised S-2-hydroxyacid oxidases, we inferred that at least 78% of sequences in the enzyme class are misannotated. We experimentally confirmed four alternative activities among the misannotated sequences and showed that misannotation in the enzyme class increased over time. Finally, we performed a computational analysis of annotations to all enzyme classes in the BRENDA database, and showed that nearly 18% of all sequences are annotated to an enzyme class while sharing no similarity or domain architecture to experimentally characterised representatives. We showed that even well-studied enzyme classes of industrial relevance are affected by the problem of functional misannotation.

摘要

仅有一小部分存入数据库的基因经过了实验鉴定。大多数蛋白质的功能是自动分配的,这可能导致注释错误。目前公共数据库中注释的可靠性在很大程度上是未知的;缺乏针对个别酶类进行准确性验证的实验尝试。在本研究中,我们对 BRENDA 酶数据库的功能注释进行了概述。我们首先应用高通量实验平台来验证 S-2-羟基酸氧化酶(EC 1.1.3.15)这一酶类的功能注释。我们选择了该酶类的 122 个代表性序列,并对其预测功能进行了筛选。基于实验结果、预测的结构域架构以及与先前鉴定的 S-2-羟基酸氧化酶的相似性,我们推断该酶类中至少有 78%的序列被错误注释。我们在被错误注释的序列中实验确认了四种替代活性,并表明该酶类中的错误注释随着时间的推移而增加。最后,我们对 BRENDA 数据库中所有酶类的注释进行了计算分析,并表明近 18%的序列被注释到一个酶类,而与经过实验鉴定的代表序列没有相似性或结构域架构。我们表明,即使是具有工业相关性的研究充分的酶类也受到功能错误注释问题的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9cc/8491902/ee46488d5486/pcbi.1009446.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验