Department of Pharmacology, Department of Chemistry, and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America.
Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC, Germany.
PLoS Comput Biol. 2020 Oct 28;16(10):e1007597. doi: 10.1371/journal.pcbi.1007597. eCollection 2020 Oct.
As sequencing methodologies continue to advance, the availability of protein sequences far outpaces the ability of structure determination. Homology modeling is used to bridge this gap but relies on high-identity templates for accurate model building. G-protein coupled receptors (GPCRs) represent a significant target class for pharmaceutical therapies in which homology modeling could fill the knowledge gap for structure-based drug design. To date, only about 17% of druggable GPCRs have had their structures characterized at atomic resolution. However, modeling of the remaining 83% is hindered by the low sequence identity between receptors. Here we test key inputs in the model building process using GPCRs as a focus to improve the pipeline in two critical ways: Firstly, we use a blended sequence- and structure-based alignment that accounts for structure conservation in loop regions. Secondly, by merging multiple template structures into one comparative model, the best possible template for every region of a target can be used expanding the conformational space sampled in a meaningful way. This optimization allows for accurate modeling of receptors using templates as low as 20% sequence identity, which accounts for nearly the entire druggable space of GPCRs. A model database of all non-odorant GPCRs is made available at www.rosettagpcr.org. Additionally, all protocols are made available with insights into modifications that may improve accuracy at new targets.
随着测序方法的不断进步,蛋白质序列的可用性远远超过了结构确定的能力。同源建模用于弥合这一差距,但依赖于高同源性模板来进行准确的模型构建。G 蛋白偶联受体 (GPCR) 是药物治疗的一个重要靶点,同源建模可以为基于结构的药物设计填补知识空白。迄今为止,只有大约 17%的可成药 GPCR 的结构具有原子分辨率。然而,建模的其余 83%受到受体之间低序列同一性的阻碍。在这里,我们使用 GPCR 作为重点来测试模型构建过程中的关键输入,以两种关键方式改进该流水线:首先,我们使用基于序列和结构的混合比对方法,考虑到环区的结构保守性。其次,通过将多个模板结构合并到一个比较模型中,可以为目标的每个区域使用最佳的模板,以有意义的方式扩展采样的构象空间。这种优化允许使用低至 20%序列同一性的模板进行受体的准确建模,这几乎涵盖了 GPCR 可成药的整个空间。所有非气味 GPCR 的模型数据库可在 www.rosettagpcr.org 获得。此外,所有协议都提供了有关可能提高新目标准确性的修改的见解。