Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand.
Department of Biology, The College of Wooster, Wooster, Ohio, USA.
Protein Sci. 2021 Apr;30(4):914-921. doi: 10.1002/pro.4041. Epub 2021 Feb 25.
The ongoing biotechnological revolution is rooted in our knowledge of enzymes. However, metagenomics is showing how little we know about Earth's enzyme repertoire. Deep sequencing has revolutionized our view of the tree of life. The genomes of newly-discovered organisms are replete with novel sequences, emphasizing the trove of enzyme structures and functions waiting to be explored by biochemists. Here, we sought to draw attention to the vastness of the "enzymatic dark matter" within the tree of life by placing enzymological knowledge in the context of phylogeny. We used kinetic parameters from the BRaunschweig ENzyme DAtabase (BRENDA) as our proxy for enzymological knowledge. Mapping 12,677 BRENDA entries onto the phylogenetic tree revealed that 55% of these data were from eukaryotes, even though they are the least diverse part of the tree. At the next taxonomic level, only four of 18 archaeal phyla and 24 of 111 bacterial phyla are represented in the BRENDA dataset. One phylum, the Proteobacteria, accounts for over half of all bacterial entries. Similarly, the supergroup Amorphea, which includes animals and fungi, contains over half the data on eukaryotes. Many major taxonomic groups are notable for their complete absence from BRENDA, including the ultra-diverse bacterial Candidate Phyla Radiation. At the species level, five mammals (including human) contribute 15% of BRENDA entries. The taxonomic bias in enzymology is strong, but in the era of gene synthesis we now have the tools to address it. Doing so promises to enrich our biochemical understanding of life and uncover powerful new biocatalysts.
正在进行的生物技术革命植根于我们对酶的认识。然而,宏基因组学表明,我们对地球酶库的了解是多么有限。深度测序彻底改变了我们对生命之树的看法。新发现的生物体的基因组充满了新的序列,强调了等待生物化学家探索的酶结构和功能宝库。在这里,我们试图通过将酶学知识置于系统发育背景下来引起人们对生命之树中“酶学暗物质”的广阔性的关注。我们使用 BRaunschweig ENzyme DAtabase (BRENDA) 的动力学参数作为我们酶学知识的代理。将 12677 个 BRENDA 条目映射到系统发育树上,结果表明,尽管它们是树中最具多样性的部分,但其中 55%的数据来自真核生物。在下一个分类单元中,只有 18 个古菌门中的 4 个和 111 个细菌门中的 24 个在 BRENDA 数据集中得到了体现。一个门,即变形菌门,占所有细菌条目数的一半以上。同样,包括动物和真菌在内的超群变形菌门包含了超过一半的真核生物数据。许多主要的分类群由于完全不在 BRENDA 中而引人注目,包括超多样的细菌候选门辐射。在物种水平上,五种哺乳动物(包括人类)贡献了 BRENDA 条目数的 15%。酶学中的分类学偏见很强烈,但在基因合成的时代,我们现在有了解决它的工具。这样做有望丰富我们对生命的生化理解,并发现强大的新生物催化剂。