dos Santos Ricardo N, Morcos Faruck, Jana Biman, Andricopulo Adriano D, Onuchic José N
Center for Theoretical Biological Physics, Rice University, Houston, TX 77005-1827.
Laboratório de Química Medicinal e Computacional, Instituto de Física de São Carlos, Universidade de São Paulo, São Paulo, São Carlos, 13563-120, Brazil.
Sci Rep. 2015 Sep 4;5:13652. doi: 10.1038/srep13652.
We develop a procedure to characterize the association of protein structures into homodimers using coevolutionary couplings extracted from Direct Coupling Analysis (DCA) in combination with Structure Based Models (SBM). Identification of dimerization contacts using DCA is more challenging than intradomain contacts since direct couplings are mixed with monomeric contacts. Therefore a systematic way to extract dimerization signals has been elusive. We provide evidence that the prediction of homodimeric complexes is possible with high accuracy for all the cases we studied which have rich sequence information. For the most accurate conformations of the structurally diverse dimeric complexes studied the mean and interfacial RMSDs are 1.95Å and 1.44Å, respectively. This methodology is also able to identify distinct dimerization conformations as for the case of the family of response regulators, which dimerize upon activation. The identification of dimeric complexes can provide interesting molecular insights in the construction of large oligomeric complexes and be useful in the study of aggregation related diseases like Alzheimer's or Parkinson's.
我们开发了一种程序,通过将从直接耦合分析(DCA)中提取的协同进化耦合与基于结构的模型(SBM)相结合,来表征蛋白质结构形成同型二聚体的关联。使用DCA识别二聚化接触比识别结构域内接触更具挑战性,因为直接耦合与单体接触混合在一起。因此,一种提取二聚化信号的系统方法一直难以捉摸。我们提供的证据表明,对于我们研究的所有具有丰富序列信息的案例,都可以高精度地预测同型二聚体复合物。对于所研究的结构多样的二聚体复合物的最精确构象,平均RMSD和界面RMSD分别为1.95Å和1.44Å。这种方法还能够识别不同的二聚化构象,如响应调节因子家族的情况,它们在激活时会二聚化。二聚体复合物的识别可以为大型寡聚体复合物的构建提供有趣的分子见解,并有助于研究与聚集相关的疾病,如阿尔茨海默病或帕金森病。