University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States.
Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States.
J Mol Biol. 2021 May 28;433(11):166828. doi: 10.1016/j.jmb.2021.166828. Epub 2021 Jan 23.
There is a wide, and continuously widening, gap between the number of proteins known only by their amino acid sequence versus those structurally characterized by direct experiment. To close this gap, we mostly rely on homology-based inference and modeling to reason about the structures of the uncharacterized proteins by using structures of homologous proteins as templates. With the rapidly growing size of the Protein Data Bank, there are often multiple choices of templates, including multiple sets of coordinates from the same protein. The substantial conformational differences observed between different experimental structures of the same protein often reflect function related structural flexibility. Thus, depending on the questions being asked, using distant homologs, or coordinate sets with lower resolution but solved in the appropriate functional form, as templates may be more informative. The ModFlex server (https://modflex.org/) addresses this seldom mentioned gap in the standard homology modeling approach by providing the user with an interface with multiple options and tools to select the most relevant template and explore the range of structural diversity in the available templates. ModFlex is closely integrated with a range of other programs and servers developed in our group for the analysis and visualization of protein structural flexibility and divergence.
已知的蛋白质中,只有其氨基酸序列已知的蛋白质数量与通过直接实验结构特征化的蛋白质数量之间存在着广泛的差距,而且这个差距还在不断扩大。为了缩小这个差距,我们主要依靠基于同源性的推理和建模,通过使用同源蛋白质的结构作为模板来推断未特征化蛋白质的结构。随着蛋白质数据库规模的快速增长,通常有多种模板可供选择,包括同一蛋白质的多个坐标集。同一蛋白质的不同实验结构之间观察到的显著构象差异通常反映了与功能相关的结构灵活性。因此,根据所提出的问题,使用远源同源物或分辨率较低但以适当功能形式解决的坐标集作为模板可能更具信息量。ModFlex 服务器(https://modflex.org/)通过为用户提供一个具有多种选项和工具的界面来解决标准同源建模方法中很少提到的这个差距,使用户可以选择最相关的模板,并探索可用模板中的结构多样性范围。ModFlex 与我们小组开发的用于分析和可视化蛋白质结构灵活性和差异的一系列其他程序和服务器紧密集成。