Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK.
Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Waltham, MA, USA.
Sci Adv. 2022 Aug 26;8(34):eabo6371. doi: 10.1126/sciadv.abo6371.
Large reference datasets of protein-coding variation in human populations have allowed us to determine which genes and genic subregions are intolerant to germline genetic variation. There is also a growing number of genes implicated in severe Mendelian diseases that overlap with genes implicated in cancer. We hypothesized that cancer-driving mutations might be enriched in genic subregions that are depleted of germline variation relative to somatic variation. We introduce a new metric, OncMTR (oncology missense tolerance ratio), which uses 125,748 exomes in the Genome Aggregation Database (gnomAD) to identify these genic subregions. We demonstrate that OncMTR can significantly predict driver mutations implicated in hematologic malignancies. Divergent OncMTR regions were enriched for cancer-relevant protein domains, and overlaying OncMTR scores on protein structures identified functionally important protein residues. Last, we performed a rare variant, gene-based collapsing analysis on an independent set of 394,694 exomes from the UK Biobank and find that OncMTR markedly improves genetic signals for hematologic malignancies.
大量的人类蛋白质编码变异参考数据集使我们能够确定哪些基因和基因亚区对种系遗传变异不具有耐受性。越来越多的与严重孟德尔疾病相关的基因与与癌症相关的基因重叠。我们假设,致癌突变可能在相对于体细胞变异而言种系变异减少的基因亚区中富集。我们引入了一个新的度量标准,OncMTR(肿瘤同义突变容忍比),它使用基因组聚合数据库(gnomAD)中的 125748 个外显子来识别这些基因亚区。我们证明 OncMTR 可以显著预测血液恶性肿瘤中涉及的驱动突变。不同的 OncMTR 区域富集了与癌症相关的蛋白质结构域,并且在蛋白质结构上叠加 OncMTR 评分可以确定功能重要的蛋白质残基。最后,我们在 UK Biobank 的 394694 个外显子的独立数据集上进行了罕见变异、基于基因的合并分析,发现 OncMTR 显著提高了血液恶性肿瘤的遗传信号。