School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China.
National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA.
Int J Mol Sci. 2018 Jul 20;19(7):2113. doi: 10.3390/ijms19072113.
Cancer is a complex disease that is driven by genetic alterations. There has been a rapid development of genome-wide techniques during the last decade along with a significant lowering of the cost of gene sequencing, which has generated widely available cancer genomic data. However, the interpretation of genomic data and the prediction of the association of genetic variations with cancer and disease phenotypes still requires significant improvement. Missense mutations, which can render proteins non-functional and provide a selective growth advantage to cancer cells, are frequently detected in cancer. Effects caused by missense mutations can be pinpointed by in silico modeling, which makes it more feasible to find a treatment and reverse the effect. Specific human phenotypes are largely determined by stability, activity, and interactions between proteins and other biomolecules that work together to execute specific cellular functions. Therefore, analysis of missense mutations' effects on proteins and their complexes would provide important clues for identifying functionally important missense mutations, understanding the molecular mechanisms of cancer progression and facilitating treatment and prevention. Herein, we summarize the major computational approaches and tools that provide not only the classification of missense mutations as cancer drivers or passengers but also the molecular mechanisms induced by driver mutations. This review focuses on the discussion of annotation and prediction methods based on structural and biophysical data, analysis of somatic cancer missense mutations in 3D structures of proteins and their complexes, predictions of the effects of missense mutations on protein stability, protein-protein and protein-nucleic acid interactions, and assessment of conformational changes in protein conformations induced by mutations.
癌症是一种由基因改变驱动的复杂疾病。在过去十年中,随着全基因组技术的快速发展和基因测序成本的显著降低,产生了广泛可用的癌症基因组数据。然而,基因组数据的解释和遗传变异与癌症及疾病表型关联的预测仍然需要显著改进。错义突变可以使蛋白质失去功能,并为癌细胞提供选择性生长优势,因此在癌症中经常被检测到。通过计算机建模可以精确定位错义突变的影响,这使得找到治疗方法和逆转效果变得更加可行。特定的人类表型在很大程度上取决于蛋白质及其复合物之间的稳定性、活性和相互作用,这些蛋白质和复合物共同执行特定的细胞功能。因此,分析错义突变对蛋白质及其复合物的影响将为识别功能重要的错义突变、理解癌症进展的分子机制以及促进治疗和预防提供重要线索。本文总结了主要的计算方法和工具,这些方法和工具不仅可以对癌症驱动突变和乘客突变进行分类,还可以揭示驱动突变所诱导的分子机制。本文重点讨论了基于结构和生物物理数据的注释和预测方法、蛋白质及其复合物的三维结构中体细胞癌症错义突变的分析、错义突变对蛋白质稳定性、蛋白质-蛋白质和蛋白质-核酸相互作用的影响的预测,以及突变诱导的蛋白质构象的构象变化评估。