Plant Biology Section, School of Integrative Plant Sciences, Cornell University, Ithaca, New York, 14853, USA.
Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA.
Plant J. 2022 Sep;111(5):1453-1468. doi: 10.1111/tpj.15902. Epub 2022 Jul 27.
Large enzyme families catalyze metabolic diversification by virtue of their ability to use diverse chemical scaffolds. How enzyme families attain such functional diversity is not clear. Furthermore, duplication and promiscuity in such enzyme families limits their functional prediction, which has produced a burgeoning set of incompletely annotated genes in plant genomes. Here, we address these challenges using BAHD acyltransferases as a model. This fast-evolving family expanded drastically in land plants, increasing from one to five copies in algae to approximately 100 copies in diploid angiosperm genomes. Compilation of >160 published activities helped visualize the chemical space occupied by this family and define eight different classes based on structural similarities between acceptor substrates. Using orthologous groups (OGs) across 52 sequenced plant genomes, we developed a method to predict BAHD acceptor substrate class utilization as well as origins of individual BAHD OGs in plant evolution. This method was validated using six novel and 28 previously characterized enzymes and helped improve putative substrate class predictions for BAHDs in the tomato genome. Our results also revealed that while cuticular wax and lignin biosynthetic activities were more ancient, anthocyanin acylation activity was fixed in BAHDs later near the origin of angiosperms. The OG-based analysis enabled identification of signature motifs in anthocyanin-acylating BAHDs, whose importance was validated via molecular dynamic simulations, site-directed mutagenesis and kinetic assays. Our results not only describe how BAHDs contributed to evolution of multiple chemical phenotypes in the plant world but also propose a biocuration-enabled approach for improved functional annotation of plant enzyme families.
庞大的酶家族通过利用多样化的化学支架来催化代谢多样化。酶家族如何获得这种功能多样性尚不清楚。此外,这些酶家族中的重复和多功能性限制了它们的功能预测,这导致植物基因组中产生了大量未完全注释的基因。在这里,我们以 BAHD 酰基转移酶为模型来解决这些挑战。这个快速进化的家族在陆地植物中急剧扩张,从藻类中的一个拷贝增加到二倍体被子植物基因组中的大约 100 个拷贝。编译了>160 项已发表的活性,有助于可视化该家族占据的化学空间,并根据受体底物之间的结构相似性定义了八个不同的类别。使用 52 个已测序植物基因组中的直系同源物(OG),我们开发了一种方法来预测 BAHD 受体底物类别的利用以及植物进化中各个 BAHD OG 的起源。该方法通过使用六个新的和 28 个以前表征的酶进行了验证,并有助于改善番茄基因组中 BAHD 的假定底物类预测。我们的结果还表明,虽然角质层蜡和木质素生物合成活性更为古老,但在被子植物起源附近,花色苷酰化活性在 BAHD 中被固定。基于 OG 的分析能够识别花色苷酰化 BAHD 中的特征基序,通过分子动力学模拟、定点突变和动力学测定验证了其重要性。我们的研究结果不仅描述了 BAHD 如何为植物界中多种化学表型的进化做出贡献,还提出了一种基于生物信息学的方法,用于改进植物酶家族的功能注释。