Program in Biomolecular Pharmacology, Boston University School of Medicine, 72 East Concord Street, Boston, MA 02118, USA.
Program in Biomolecular Pharmacology, Boston University School of Medicine, 72 East Concord Street, Boston, MA 02118, USA; Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, MA 02215, USA.
J Mol Biol. 2020 May 1;432(10):3269-3288. doi: 10.1016/j.jmb.2020.03.007. Epub 2020 Mar 19.
The flavin-dependent amine oxidase (FAO) superfamily consists of over 9000 nonredundant sequences represented in all domains of life. Of the thousands of members identified, only 214 have been functionally annotated to date, and 40 unique structures are represented in the Protein Data Bank. The few functionally characterized members share a catalytic mechanism involving the oxidation of an amine substrate through transfer of a hydride to the FAD cofactor, with differences observed in substrate specificities. Previous studies have focused on comparing a subset of superfamily members. Here, we present a comprehensive analysis of the FAO superfamily based on reaction mechanism and substrate recognition. Using a dataset of 9192 sequences, a sequence similarity network, and subsequently, a genome neighborhood network were constructed, organizing the superfamily into eight subgroups that accord with substrate type. Likewise, through phylogenetic analysis, the evolutionary relationship of subgroups was determined, delineating the divergence between enzymes based on organism, substrate, and mechanism. In addition, using sequences and atomic coordinates of 22 structures from the Protein Data Bank to perform sequence and structural alignments, active-site elements were identified, showing divergence from the canonical aromatic-cage residues to accommodate large substrates. These specificity determinants are held in a structural framework comprising a core domain catalyzing the oxidation of amines with an auxiliary domain for substrate recognition. Overall, analysis of the FAO superfamily reveals a modular fold with cofactor and substrate-binding domains allowing for diversity of recognition via insertion/deletions. This flexibility allows facile evolution of new activities, as shown by reinvention of function between subfamilies.
黄素依赖胺氧化酶(FAO)超家族由 9000 多个非冗余序列组成,存在于所有生命领域。在已鉴定的数千个成员中,迄今为止只有 214 个具有功能注释,并且在蛋白质数据库中代表了 40 个独特的结构。少数具有功能特征的成员具有相似的催化机制,涉及通过向 FAD 辅因子转移氢化物来氧化胺底物,观察到底物特异性的差异。先前的研究集中在比较超家族的一部分成员。在这里,我们根据反应机制和底物识别对 FAO 超家族进行了全面分析。使用 9192 个序列的数据集、序列相似性网络,随后构建了基因组邻近网络,将超家族组织成 8 个亚组,与底物类型一致。同样,通过系统发育分析,确定了亚组的进化关系,根据生物体、底物和机制划定了酶的分歧。此外,使用蛋白质数据库中 22 个结构的序列和原子坐标进行序列和结构比对,鉴定了活性位点元素,显示出与经典芳香笼残基的差异,以适应较大的底物。这些特异性决定因素包含在一个结构框架中,该框架由催化胺氧化的核心结构域和用于底物识别的辅助结构域组成。总之,FAO 超家族的分析揭示了一个具有辅因子和底物结合域的模块化折叠,允许通过插入/缺失进行多样性识别。这种灵活性允许新功能的轻松进化,正如亚家族之间功能的重新发明所显示的那样。