Institute of Biotechnology, Helsinki Institute of Life Sciences, and Organismal and Evolutionary Biology Research Program, Faculty of Biosciences, University of Helsinki, Finland.
Nucleic Acids Res. 2022 Jul 5;50(W1):W210-W215. doi: 10.1093/nar/gkac387.
Protein structure is key to understanding biological function. Structure comparison deciphers deep phylogenies, providing insight into functional conservation and functional shifts during evolution. Until recently, structural coverage of the protein universe was limited by the cost and labour involved in experimental structure determination. Recent breakthroughs in deep learning revolutionized structural bioinformatics by providing accurate structural models of numerous protein families for which no structural information existed. The Dali server for 3D protein structure comparison is widely used by crystallographers to relate new structures to pre-existing ones. Here, we report two most recent upgrades to the web server: (i) the foldomes of key organisms in the AlphaFold Database (version 1) are searchable by Dali, (ii) structural alignments are annotated with protein families. Using these new features, we discovered a novel functionally diverse subgroup within the WRKY/GCM1 clan. This was accomplished by linking the structurally characterized SWI/SNF and NAM families as well as the structural models of the CG-1 family and uncharacterized proteins to the structure of Gti1/Pac2, a previously known member of the WRKY/GCM1 clan. The Dali server is available at http://ekhidna2.biocenter.helsinki.fi/dali. This website is free and open to all users and there is no login requirement.
蛋白质结构是理解生物功能的关键。结构比较揭示了深层的系统发育,为功能保守和进化过程中的功能转变提供了深入的见解。直到最近,由于实验结构测定的成本和劳动力的限制,蛋白质宇宙的结构覆盖范围还很有限。深度学习的最新突破通过为数以千计的没有结构信息的蛋白质家族提供准确的结构模型,彻底改变了结构生物信息学。用于 3D 蛋白质结构比较的 Dali 服务器被晶体学家广泛用于将新结构与已有结构联系起来。在这里,我们报告了该网络服务器的两个最新升级:(i)在 AlphaFold Database(版本 1)中,关键生物体的折叠组可通过 Dali 搜索,(ii)结构比对用蛋白质家族进行注释。利用这些新功能,我们在 WRKY/GCM1 家族中发现了一个新的具有不同功能的亚群。这是通过将结构特征明确的 SWI/SNF 和 NAM 家族以及 CG-1 家族和未表征蛋白质的结构模型与已知的 WRKY/GCM1 家族的 Gti1/Pac2 结构联系起来实现的。Dali 服务器可在 http://ekhidna2.biocenter.helsinki.fi/dali 上获得。该网站对所有用户免费开放,无需登录。