Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, New York.
Cancer Res Commun. 2023 Mar 22;3(3):483-488. doi: 10.1158/2767-9764.CRC-22-0355. eCollection 2023 Mar.
UNLABELLED: Many studies have shown that the distributions of the genomic, nucleotide, and epigenetic contexts of somatic variants in tumors are informative of cancer etiology. Recently, a new direction of research has focused on extracting signals from the contexts of germline variants and evidence has emerged that patterns defined by these factors are associated with oncogenic pathways, histologic subtypes, and prognosis. It remains an open question whether aggregating germline variants using meta-features capturing their genomic, nucleotide, and epigenetic contexts can improve cancer risk prediction. This aggregation approach can potentially increase statistical power for detecting signals from rare variants, which have been hypothesized to be a major source of the missing heritability of cancer. Using germline whole-exome sequencing data from the UK Biobank, we developed risk models for 10 cancer types using known risk variants (cancer-associated SNPs and pathogenic variants in known cancer predisposition genes) as well as models that additionally include the meta-features. The meta-features did not improve the prediction accuracy of models based on known risk variants. It is possible that expanding the approach to whole-genome sequencing can lead to gains in prediction accuracy. SIGNIFICANCE: There is evidence that cancer is partly caused by rare genetic variants that have not yet been identified. We investigate this issue using novel statistical methods and data from the UK Biobank.
未加标签:许多研究表明,肿瘤中体细胞变异的基因组、核苷酸和表观遗传背景的分布为癌症病因学提供了信息。最近,研究的一个新方向集中于从种系变异的背景中提取信号,有证据表明,这些因素定义的模式与致癌途径、组织学亚型和预后相关。使用捕获其基因组、核苷酸和表观遗传背景的元特征来聚集种系变异是否可以提高癌症风险预测仍然是一个悬而未决的问题。这种聚合方法可以潜在地提高检测稀有变异信号的统计能力,这些变异被假设为癌症遗传缺失的主要来源。我们使用英国生物库的种系全外显子测序数据,开发了 10 种癌症类型的风险模型,使用已知的风险变异(与癌症相关的 SNP 和已知癌症易感性基因中的致病性变异)以及另外包括元特征的模型。元特征并没有提高基于已知风险变异的模型的预测准确性。可能将该方法扩展到全基因组测序可以提高预测准确性。
意义:有证据表明,癌症部分是由尚未确定的罕见遗传变异引起的。我们使用新的统计方法和英国生物库的数据来研究这个问题。
Clin Orthop Relat Res. 2020-11
Genes Chromosomes Cancer. 2022-2
Endocrinol Metab (Seoul). 2020-12
Genet Med. 2019-7-25
Int J Mol Sci. 2023-11-30
Eur J Hum Genet. 2022-3
Nat Commun. 2021-5-24