利用蛋白质结构鉴定非随机体细胞突变。

Utilizing protein structure to identify non-random somatic mutations.

机构信息

Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA.

出版信息

BMC Bioinformatics. 2013 Jun 13;14:190. doi: 10.1186/1471-2105-14-190.

DOI:10.1186/1471-2105-14-190

PMID:23758891

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3691676/

Abstract

BACKGROUND

Human cancer is caused by the accumulation of somatic mutations in tumor suppressors and oncogenes within the genome. In the case of oncogenes, recent theory suggests that there are only a few key "driver" mutations responsible for tumorigenesis. As there have been significant pharmacological successes in developing drugs that treat cancers that carry these driver mutations, several methods that rely on mutational clustering have been developed to identify them. However, these methods consider proteins as a single strand without taking their spatial structures into account. We propose an extension to current methodology that incorporates protein tertiary structure in order to increase our power when identifying mutation clustering.

RESULTS

We have developed iPAC (identification of Protein Amino acid Clustering), an algorithm that identifies non-random somatic mutations in proteins while taking into account the three dimensional protein structure. By using the tertiary information, we are able to detect both novel clusters in proteins that are known to exhibit mutation clustering as well as identify clusters in proteins without evidence of clustering based on existing methods. For example, by combining the data in the Protein Data Bank (PDB) and the Catalogue of Somatic Mutations in Cancer, our algorithm identifies new mutational clusters in well known cancer proteins such as KRAS and PI3KC α. Further, by utilizing the tertiary structure, our algorithm also identifies clusters in EGFR, EIF2AK2, and other proteins that are not identified by current methodology. The R package is available at: http://www.bioconductor.org/packages/2.12/bioc/html/iPAC.html.

CONCLUSION

Our algorithm extends the current methodology to identify oncogenic activating driver mutations by utilizing tertiary protein structure when identifying nonrandom somatic residue mutation clusters.

摘要

背景

人类癌症是由基因组中肿瘤抑制基因和癌基因的体细胞突变积累引起的。在癌基因的情况下，最近的理论表明，只有少数几个关键的“驱动”突变负责肿瘤发生。由于在开发治疗携带这些驱动突变的癌症的药物方面取得了重大的药理学成功，因此已经开发了几种依赖于突变聚类的方法来识别它们。然而，这些方法将蛋白质视为单链，而不考虑其空间结构。我们提出了一种对现有方法的扩展，该方法将蛋白质的三级结构纳入其中，以提高识别突变聚类的能力。

结果

我们开发了 iPAC（蛋白质氨基酸聚类识别）算法，该算法在考虑三维蛋白质结构的同时识别蛋白质中的非随机体细胞突变。通过使用三级信息，我们能够检测到已知存在突变聚类的蛋白质中的新聚类，以及根据现有方法没有聚类证据的蛋白质中的聚类。例如，通过将蛋白质数据库（PDB）和癌症体细胞突变目录中的数据结合起来，我们的算法在 KRAS 和 PI3KCα 等著名的癌症蛋白中识别出新的突变聚类。此外，通过利用三级结构，我们的算法还在 EGFR、EIF2AK2 和其他当前方法无法识别的蛋白质中识别出聚类。R 包可在：http://www.bioconductor.org/packages/2.12/bioc/html/iPAC.html 获得。

结论

我们的算法通过在识别非随机体细胞残基突变聚类时利用三级蛋白质结构，扩展了当前的方法，以识别致癌激活驱动突变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb2f/3691676/6c751fad79b5/1471-2105-14-190-2.jpg

相似文献

Utilizing protein structure to identify non-random somatic mutations.

BMC Bioinformatics. 2013 Jun 13;14:190. doi: 10.1186/1471-2105-14-190.

A graph theoretic approach to utilizing protein structure to identify non-random somatic mutations.

BMC Bioinformatics. 2014 Mar 26;15:86. doi: 10.1186/1471-2105-15-86.

A spatial simulation approach to account for protein structure when identifying non-random somatic mutations.

BMC Bioinformatics. 2014 Jul 3;15:231. doi: 10.1186/1471-2105-15-231.

Leveraging protein quaternary structure to identify oncogenic driver mutations.

BMC Bioinformatics. 2016 Mar 22;17:137. doi: 10.1186/s12859-016-0963-3.

Statistical method on nonrandom clustering with application to somatic mutations in cancer.

BMC Bioinformatics. 2010 Jan 7;11:11. doi: 10.1186/1471-2105-11-11.

Multiplicity: an organizing principle for cancers and somatic mutations.

BMC Med Genomics. 2011 Jun 29;4:52. doi: 10.1186/1755-8794-4-52.

mutation3D: Cancer Gene Prediction Through Atomic Clustering of Coding Variants in the Structural Proteome.

Hum Mutat. 2016 May;37(5):447-56. doi: 10.1002/humu.22963. Epub 2016 Feb 18.

Protein domain-level landscape of cancer-type-specific somatic mutations.

PLoS Comput Biol. 2015 Mar 20;11(3):e1004147. doi: 10.1371/journal.pcbi.1004147. eCollection 2015 Mar.

LRT-CLUSTER: A New Clustering Algorithm Based on Likelihood Ratio Test to Identify Driving Genes.

Interdiscip Sci. 2023 Jun;15(2):217-230. doi: 10.1007/s12539-023-00554-2. Epub 2023 Feb 27.

3D clusters of somatic mutations in cancer reveal numerous rare mutations as functional targets.

Genome Med. 2017 Jan 23;9(1):4. doi: 10.1186/s13073-016-0393-x.

引用本文的文献

Adaptive genetics reveals constraints on protein structure/function by evolving E. coli under constant nutrient limitation.

BMC Biol. 2025 Aug 20;23(1):261. doi: 10.1186/s12915-025-02331-7.

Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact.

BMC Bioinformatics. 2024 Mar 6;25(1):99. doi: 10.1186/s12859-024-05707-8.

MaxCLK: discovery of cancer driver genes via maximal clique and information entropy of modules.

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad737.

Computational Methods Summarizing Mutational Patterns in Cancer: Promise and Limitations for Clinical Applications.

Cancers (Basel). 2023 Mar 24;15(7):1958. doi: 10.3390/cancers15071958.

WMDS.net: a network control framework for identifying key players in transcriptome programs.

Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad071.

SWEET: a single-sample network inference method for deciphering individual features in disease.

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad032.

Integrated analysis of genes encoding ATP-dependent chromatin remodellers identifies CHD7 as a potential target for colorectal cancer therapy.

Clin Transl Med. 2022 Jul;12(7):e953. doi: 10.1002/ctm2.953.

Allostery: Allosteric Cancer Drivers and Innovative Allosteric Drugs.

J Mol Biol. 2022 Sep 15;434(17):167569. doi: 10.1016/j.jmb.2022.167569. Epub 2022 Apr 1.

Cancer-Associated circRNA-miRNA-mRNA Regulatory Networks: A Meta-Analysis.

Front Mol Biosci. 2021 May 12;8:671309. doi: 10.3389/fmolb.2021.671309. eCollection 2021.

Computational methods for detecting cancer hotspots.

Comput Struct Biotechnol J. 2020 Nov 19;18:3567-3576. doi: 10.1016/j.csbj.2020.11.020. eCollection 2020.

本文引用的文献

Pyrazolopyridine Inhibitors of B-Raf(V600E). Part 1: The Development of Selective, Orally Bioavailable, and Efficacious Inhibitors.

ACS Med Chem Lett. 2011 Mar 8;2(5):342-7. doi: 10.1021/ml200025q. eCollection 2011 May 12.

Reorganizing the protein space at the Universal Protein Resource (UniProt).

Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. doi: 10.1093/nar/gkr981. Epub 2011 Nov 18.

Protein arginine methyltransferase 5 regulates ERK1/2 signal transduction amplitude and cell fate through CRAF.

Sci Signal. 2011 Sep 13;4(190):ra58. doi: 10.1126/scisignal.2001936.

Predicting the functional impact of protein mutations: application to cancer genomics.

Nucleic Acids Res. 2011 Sep 1;39(17):e118. doi: 10.1093/nar/gkr407. Epub 2011 Jul 3.

Initial genome sequencing and analysis of multiple myeloma.

Nature. 2011 Mar 24;471(7339):467-72. doi: 10.1038/nature09837.

Epidermal Growth Factor Receptor (EGFR) mutation analysis, gene expression profiling and EGFR protein expression in primary prostate cancer.

BMC Cancer. 2011 Jan 25;11:31. doi: 10.1186/1471-2407-11-31.

Good clinical response to gefitinib in a non-small cell lung cancer patient harboring a rare somatic epidermal growth factor gene point mutation; codon 768 AGC > ATC in exon 20 (S768I).

Jpn J Clin Oncol. 2010 Nov;40(11):1105-9. doi: 10.1093/jjco/hyq087. Epub 2010 Jun 3.

A method and server for predicting damaging missense mutations.

Nat Methods. 2010 Apr;7(4):248-9. doi: 10.1038/nmeth0410-248.

Statistical method on nonrandom clustering with application to somatic mutations in cancer.

BMC Bioinformatics. 2010 Jan 7;11:11. doi: 10.1186/1471-2105-11-11.

Growth factor receptor expression in anal squamous lesions: modifications associated with oncogenic human papillomavirus and human immunodeficiency virus.

Hum Pathol. 2009 Nov;40(11):1517-27. doi: 10.1016/j.humpath.2009.05.010. Epub 2009 Aug 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用蛋白质结构鉴定非随机体细胞突变。

Utilizing protein structure to identify non-random somatic mutations.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献