Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, India.
Hum Mutat. 2019 Sep;40(9):1424-1435. doi: 10.1002/humu.23800. Epub 2019 Jul 12.
With the advent of rapid sequencing technologies, making sense of all the genomic variations that we see among us has been a major challenge. A plethora of algorithms and methods exist that try to address genome interpretation through genotype-phenotype linkage analysis or evaluating the loss of function/stability mutations in protein. Critical Assessment of Genome Interpretation (CAGI) offers an exceptional platform to blind-test all such algorithms and methods to assess their true ability. We take advantage of this opportunity to explore the use of molecular dynamics simulation as a tool to assess alteration of phenotype, loss of protein function, interaction, and stability. The results show that coarse-grained dynamics based protein flexibility analysis on 34 CHEK2 and 1719 CALM1 single mutants perform reasonably well for class-based predictions for phenotype alteration and two-thirds of the predicted scores return a correlation coefficient of 0.6 or more. When all-atom dynamics is used to predict altered stability due to mutations for Frataxin protein (8 cases), the predictions are comparable to the state-of-the-art methods. The competitive performance of our straightforward approach to phenotype interpretation contrasts with heavily trained machine learning approaches, and open new avenues to rationally improve genome interpretation.
随着快速测序技术的出现,理解我们在人类中看到的所有基因组变异一直是一个主要挑战。存在大量的算法和方法试图通过基因型-表型连锁分析或评估蛋白质功能/稳定性突变的丧失来解决基因组解释问题。基因组解释的关键评估 (CAGI) 提供了一个极好的平台,可以对所有这些算法和方法进行盲测,以评估它们的真正能力。我们利用这个机会探索使用分子动力学模拟作为一种工具来评估表型改变、蛋白质功能丧失、相互作用和稳定性。结果表明,基于粗粒度动力学的 34 个 CHEK2 和 1719 个 CALM1 单突变体的蛋白质柔性分析在基于类别的表型改变预测方面表现相当不错,三分之二的预测得分返回的相关系数为 0.6 或更高。当使用全原子动力学预测 Frataxin 蛋白(8 个案例)突变引起的稳定性改变时,预测结果可与最先进的方法相媲美。我们这种简单的表型解释方法的竞争性能与经过大量训练的机器学习方法形成对比,为合理改善基因组解释开辟了新途径。