基于体细胞突变的关联分析。

Association analysis using somatic mutations.

机构信息

Department of Mathematics and Statistics, Wright State University, Dayton, Ohio, United States of America.

Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America.

出版信息

PLoS Genet. 2018 Nov 2;14(11):e1007746. doi: 10.1371/journal.pgen.1007746. eCollection 2018 Nov.

DOI:10.1371/journal.pgen.1007746

PMID:30388102

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6235399/

Abstract

Somatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for somatic mutation association studies because somatic mutation calls often have non-ignorable false positive rate and/or false negative rate. While large scale association analysis using somatic mutations becomes feasible recently-thanks for the improvement of sequencing techniques and the reduction of sequencing cost-there is an urgent need for a new statistical method designed for somatic mutation association analysis. We propose such a method with computationally efficient software implementation: Somatic mutation Association test with Measurement Errors (SAME). SAME accounts for somatic mutation calling uncertainty using a likelihood based approach. It can be used to assess the associations between continuous/dichotomous outcomes and individual mutations or gene-level mutations. Through simulation studies across a wide range of realistic scenarios, we show that SAME can significantly improve statistical power than the naive generalized linear model that ignores mutation calling uncertainty. Finally, using the data collected from The Cancer Genome Atlas (TCGA) project, we apply SAME to study the associations between somatic mutations and gene expression in 12 cancer types, as well as the associations between somatic mutations and colon cancer subtype defined by DNA methylation data. SAME recovered some interesting findings that were missed by the generalized linear model. In addition, we demonstrated that mutation-level and gene-level analyses are often more appropriate for oncogene and tumor-suppressor gene, respectively.

摘要

体细胞突变驱动肿瘤细胞的生长，是许多癌症治疗的关键生物标志物。使用体细胞突变进行遗传关联分析是研究体细胞突变功能影响的有效方法。然而，标准回归方法并不适用于体细胞突变关联研究，因为体细胞突变检测通常具有不可忽略的假阳性率和/或假阴性率。虽然由于测序技术的改进和测序成本的降低，最近大规模使用体细胞突变进行关联分析变得可行，但迫切需要一种新的专门用于体细胞突变关联分析的统计方法。我们提出了一种具有计算效率的软件实现方法：带有测量误差的体细胞突变关联测试（SAME）。SAME 使用基于似然的方法来解释体细胞突变检测的不确定性。它可用于评估连续/二分类结果与个体突变或基因水平突变之间的关联。通过在广泛的现实场景中进行模拟研究，我们表明 SAME 可以显著提高统计功效，优于忽略突变检测不确定性的简单广义线性模型。最后，我们使用从癌症基因组图谱（TCGA）项目收集的数据，应用 SAME 研究了 12 种癌症类型中体细胞突变与基因表达之间的关联，以及体细胞突变与基于 DNA 甲基化数据定义的结肠癌亚型之间的关联。SAME 发现了一些被广义线性模型遗漏的有趣发现。此外，我们还证明了突变水平和基因水平分析分别更适合癌基因和肿瘤抑制基因。

相似文献

Association analysis using somatic mutations.基于体细胞突变的关联分析。

PLoS Genet. 2018 Nov 2;14(11):e1007746. doi: 10.1371/journal.pgen.1007746. eCollection 2018 Nov.

Multivariate association analysis with somatic mutation data.体细胞突变数据的多变量关联分析。

Biometrics. 2018 Mar;74(1):176-184. doi: 10.1111/biom.12745. Epub 2017 Jul 19.

Evaluating somatic tumor mutation detection without matched normal samples.评估无配对正常样本的体细胞肿瘤突变检测。

Hum Genomics. 2017 Sep 4;11(1):22. doi: 10.1186/s40246-017-0118-2.

Binning somatic mutations based on biological knowledge for predicting survival: an application in renal cell carcinoma.基于生物学知识对体细胞突变进行分类以预测生存：在肾细胞癌中的应用

Pac Symp Biocomput. 2015:96-107.

Impacts of somatic mutations on gene expression: an association perspective.体细胞突变对基因表达的影响：关联视角

Brief Bioinform. 2017 May 1;18(3):413-425. doi: 10.1093/bib/bbw037.

Oncotarget. 2015 Sep 22;6(28):24627-35. doi: 10.18632/oncotarget.5685.

Absence of EIF1AX, PPM1D, and CHEK2 mutations reported in Thyroid Cancer Genome Atlas (TCGA) in a large series of thyroid cancer.在甲状腺癌基因组图谱（TCGA）的一项大型甲状腺癌系列研究中，未发现 EIF1AX、PPM1D 和 CHEK2 突变。

Endocrine. 2019 Jan;63(1):94-100. doi: 10.1007/s12020-018-1762-6. Epub 2018 Sep 29.

Targeted next-generation sequencing identifies clinically relevant somatic mutations in a large cohort of inflammatory breast cancer.靶向下一代测序在大样本炎性乳腺癌中鉴定出有临床意义的体细胞突变。

Breast Cancer Res. 2018 Aug 7;20(1):88. doi: 10.1186/s13058-018-1007-x.

isma: an R package for the integrative analysis of mutations detected by multiple pipelines.isma：一个用于综合分析多个分析流程检测到的突变的 R 包。

BMC Bioinformatics. 2019 Feb 28;20(1):107. doi: 10.1186/s12859-019-2701-0.

Associations Between Somatic Mutations and Metabolic Imaging Phenotypes in Non-Small Cell Lung Cancer.非小细胞肺癌中体细胞突变与代谢成像表型之间的关联

J Nucl Med. 2017 Apr;58(4):569-576. doi: 10.2967/jnumed.116.181826. Epub 2016 Sep 29.

引用本文的文献

Constructing gene similarity networks using co-occurrence probabilities.基于共现概率构建基因相似性网络。

BMC Genomics. 2023 Nov 21;24(1):697. doi: 10.1186/s12864-023-09780-w.

Comprehensive pan-cancer analysis identifies FHL2 associated with poor prognosis in lung adenocarcinoma.全面的泛癌分析确定FHL2与肺腺癌的不良预后相关。

Transl Cancer Res. 2023 Jun 30;12(6):1516-1534. doi: 10.21037/tcr-22-2786. Epub 2023 Jun 20.

Privacy-preserving cancer type prediction with homomorphic encryption.基于同态加密的隐私保护癌症类型预测。

Sci Rep. 2023 Jan 30;13(1):1661. doi: 10.1038/s41598-023-28481-8.

Associating somatic mutation with clinical outcomes through kernel regression and optimal transport.通过核回归和最优传输将体细胞突变与临床结果关联起来。

Biometrics. 2023 Sep;79(3):2705-2718. doi: 10.1111/biom.13769. Epub 2022 Oct 17.

Immune Landscape and Classification in Lung Adenocarcinoma Based on a Novel Cell Cycle Checkpoints Related Signature for Predicting Prognosis and Therapeutic Response.基于一种新型细胞周期检查点相关特征的肺腺癌免疫景观与分类，用于预测预后和治疗反应

Front Genet. 2022 May 11;13:908104. doi: 10.3389/fgene.2022.908104. eCollection 2022.

A method for subtype analysis with somatic mutations.一种利用体细胞突变进行亚型分析的方法。

Bioinformatics. 2021 Apr 9;37(1):50-56. doi: 10.1093/bioinformatics/btaa1090.

Mapping Tumor-Specific Expression QTLs in Impure Tumor Samples.在不纯肿瘤样本中定位肿瘤特异性表达数量性状基因座

J Am Stat Assoc. 2020;115(529):79-89. doi: 10.1080/01621459.2019.1609968. Epub 2019 Jun 4.

Correction: Association analysis using somatic mutations.更正：使用体细胞突变进行关联分析。

PLoS Genet. 2018 Dec 6;14(12):e1007848. doi: 10.1371/journal.pgen.1007848. eCollection 2018 Dec.

本文引用的文献

A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。

Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.

The Cancer Genomics Cloud: Collaborative, Reproducible, and Democratized-A New Paradigm in Large-Scale Computational Research.癌症基因组学云：协作、可重复且民主化——大规模计算研究的新范式

Cancer Res. 2017 Nov 1;77(21):e3-e6. doi: 10.1158/0008-5472.CAN-17-0387.

Putting p53 in Context.将p53置于背景中考虑。

Cell. 2017 Sep 7;170(6):1062-1078. doi: 10.1016/j.cell.2017.08.028.

Census and evaluation of p53 target genes.p53靶基因的普查与评估

Oncogene. 2017 Jul 13;36(28):3943-3956. doi: 10.1038/onc.2016.502. Epub 2017 Mar 13.

Clonal Heterogeneity and Tumor Evolution: Past, Present, and the Future.克隆异质性与肿瘤演进：过去、现在与未来。

Cell. 2017 Feb 9;168(4):613-628. doi: 10.1016/j.cell.2017.01.018.

MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data.MuSE：使用样本特异性误差模型考虑肿瘤异质性可提高从测序数据中检测突变的灵敏度和特异性。

Genome Biol. 2016 Aug 24;17(1):178. doi: 10.1186/s13059-016-1029-6.

The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes.2,433 例乳腺癌的体细胞突变图谱细化了其基因组和转录组景观。

Nat Commun. 2016 May 10;7:11479. doi: 10.1038/ncomms11479.

Testing Rare-Variant Association without Calling Genotypes Allows for Systematic Differences in Sequencing between Cases and Controls.在不进行基因型分型的情况下测试罕见变异关联会导致病例组和对照组在测序方面存在系统性差异。

PLoS Genet. 2016 May 6;12(5):e1006040. doi: 10.1371/journal.pgen.1006040. eCollection 2016 May.

Tree inference for single-cell data.单细胞数据的树推断

Genome Biol. 2016 May 5;17:86. doi: 10.1186/s13059-016-0936-x.

OncoNEM: inferring tumor evolution from single-cell sequencing data.OncoNEM：从单细胞测序数据推断肿瘤进化

Genome Biol. 2016 Apr 15;17:69. doi: 10.1186/s13059-016-0929-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于体细胞突变的关联分析。

Association analysis using somatic mutations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献