Demerdash Omar, Shrestha Utsab R, Petridis Loukas, Smith Jeremy C, Mitchell Julie C, Ramanathan Arvind
Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States.
University of Tennessee/Oak Ridge National Laboratory Center for Molecular Biophysics, Oak Ridge, TN, United States.
Front Mol Biosci. 2019 Aug 13;6:64. doi: 10.3389/fmolb.2019.00064. eCollection 2019.
Intrinsically disordered proteins (IDPs) and proteins with intrinsically disordered regions (IDRs) play important roles in many aspects of normal cell physiology, such as signal transduction and transcription, as well as pathological states, including Alzheimer's, Parkinson's, and Huntington's disease. Unlike their globular counterparts that are defined by a few structures and free energy minima, IDP/IDR comprise a large ensemble of rapidly interconverting structures and a corresponding free energy landscape characterized by multiple minima. This aspect has precluded the use of structural biological techniques, such as X-ray crystallography and nuclear magnetic resonance (NMR) for resolving their structures. Instead, low-resolution techniques, such as small-angle X-ray or neutron scattering (SAXS/SANS), have become a mainstay in characterizing coarse features of the ensemble of structures. These are typically complemented with NMR data if possible or computational techniques, such as atomistic molecular dynamics, to further resolve the underlying ensemble of structures. However, over the past 10-15 years, it has become evident that the classical, pairwise-additive force fields that have enjoyed a high degree of success for globular proteins have been somewhat limited in modeling IDP/IDR structures that agree with experiment. There has thus been a significant effort to rehabilitate these models to obtain better agreement with experiment, typically done by optimizing parameters in a piecewise fashion. In this work, we take a different approach by optimizing a set of force field parameters simultaneously, using machine learning to adapt force field parameters to experimental SAXS scattering profiles. We demonstrate our approach in modeling three biologically IDP ensembles based on experimental SAXS profiles and show that our optimization approach significantly improve force field parameters that generate ensembles in better agreement with experiment.
内在无序蛋白(IDP)和含有内在无序区域(IDR)的蛋白在正常细胞生理学的许多方面发挥着重要作用,如信号转导和转录,以及在包括阿尔茨海默病、帕金森病和亨廷顿舞蹈症在内的病理状态中。与由少数结构和自由能最小值定义的球状蛋白不同,IDP/IDR包含大量快速相互转换的结构以及具有多个最小值的相应自由能景观。这一方面使得诸如X射线晶体学和核磁共振(NMR)等结构生物学技术无法用于解析它们的结构。相反,低分辨率技术,如小角X射线或中子散射(SAXS/SANS),已成为表征结构集合粗粒度特征的主要手段。如果可能的话,这些通常会辅以NMR数据或计算技术,如原子分子动力学,以进一步解析潜在的结构集合。然而,在过去的10 - 15年里,很明显,在球状蛋白方面取得高度成功的经典成对加和力场在模拟与实验相符的IDP/IDR结构时受到了一定限制。因此,人们做出了重大努力来改进这些模型以获得与实验更好的一致性,通常是通过逐段优化参数来实现。在这项工作中,我们采用了一种不同的方法,即同时优化一组力场参数,利用机器学习使力场参数适应实验SAXS散射曲线。我们基于实验SAXS曲线对三个生物学IDP集合进行建模,展示了我们的方法,并表明我们的优化方法显著改进了能生成与实验更好吻合集合的力场参数。