CEITEC - Central European Institute of Technology, Masaryk University, Brno, 625 00, Czech Republic.
National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, 625 00, Czech Republic.
Sci Rep. 2021 Jun 11;11(1):12345. doi: 10.1038/s41598-021-91494-8.
Protein structural families are groups of homologous proteins defined by the organization of secondary structure elements (SSEs). Nowadays, many families contain vast numbers of structures, and the SSEs can help to orient within them. Communities around specific protein families have even developed specialized SSE annotations, always assigning the same name to the equivalent SSEs in homologous proteins. A detailed analysis of the groups of equivalent SSEs provides an overview of the studied family and enriches the analysis of any particular protein at hand. We developed a workflow for the analysis of the secondary structure anatomy of a protein family. We applied this analysis to the model family of cytochromes P450 (CYPs)-a family of important biotransformation enzymes with a community-wide used SSE annotation. We report the occurrence, typical length and amino acid sequence for the equivalent SSE groups, the conservation/variability of these properties and relationship to the substrate recognition sites. We also suggest a generic residue numbering scheme for the CYP family. Comparing the bacterial and eukaryotic part of the family highlights the significant differences and reveals a well-known anomalous group of bacterial CYPs with some typically eukaryotic features. Our workflow for SSE annotation for CYP and other families can be freely used at address https://sestra.ncbr.muni.cz .
蛋白质结构家族是由二级结构元件 (SSE) 组织定义的同源蛋白质组。如今,许多家族包含大量的结构,SSE 可以帮助在其中定位。甚至针对特定蛋白质家族的社区也开发了专门的 SSE 注释,始终将相同的名称分配给同源蛋白质中的等效 SSE。对等效 SSE 组的详细分析提供了研究家族的概述,并丰富了手头任何特定蛋白质的分析。我们开发了一种用于分析蛋白质家族二级结构解剖结构的工作流程。我们将此分析应用于细胞色素 P450 (CYPs) 的模型家族 - 一种具有社区广泛使用的 SSE 注释的重要生物转化酶家族。我们报告了等效 SSE 组的出现、典型长度和氨基酸序列、这些特性的保守性/可变性以及与底物识别位点的关系。我们还为 CYP 家族提出了一种通用的残基编号方案。比较家族的细菌和真核部分突出了显著的差异,并揭示了一组具有典型真核特征的众所周知的细菌 CYP 异常群体。我们用于 CYP 和其他家族的 SSE 注释的工作流程可在地址 https://sestra.ncbr.muni.cz 免费使用。