Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States of America.
Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States of America; Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, United States of America; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, United States of America.
Math Biosci. 2023 Dec;366:109091. doi: 10.1016/j.mbs.2023.109091. Epub 2023 Nov 22.
Cancer occurs as a consequence of multiple somatic mutations that lead to uncontrolled cell growth. Mutual exclusivity and co-occurrence of mutations imply-but do not prove-that mutations exert synergistic or antagonistic epistatic effects on oncogenesis. Knowledge of these interactions, and the consequent trajectories of mutation and selection that lead to cancer has been a longstanding goal within the cancer research community. Recent research has revealed mutation rates and scaled selection coefficients for specific recurrent variants across many cancer types. However, there are no current methods to quantify the strength of selection incorporating pairwise and higher-order epistatic effects on selection within the trajectory of likely cancer genotoypes. Therefore, we have developed a continuous-time Markov chain model that enables the estimation of mutation origination and fixation (flux), dependent on somatic cancer genotype. Coupling this continuous-time Markov chain model with a deconvolution approach provides estimates of underlying mutation rates and selection across the trajectory of oncogenesis. We demonstrate computation of fluxes and selection coefficients in a somatic evolutionary model for the four most frequently variant driver genes (TP53, LRP1B, KRAS and STK11) from 565 cases of lung adenocarcinoma. Our analysis reveals multiple antagonistic epistatic effects that reduce the possible routes of oncogenesis, and inform cancer research regarding viable trajectories of somatic evolution whose progression could be forestalled by precision medicine. Synergistic epistatic effects are also identified, most notably in the somatic genotype TP53 LRP1B for mutations in the KRAS gene, and in somatic genotypes containing KRAS or TP53 mutations for mutations in the STK11 gene. Large positive fluxes of KRAS variants were driven by large selection coefficients, whereas the flux toward LRP1B mutations was substantially aided by a large mutation rate for this gene. The approach enables inference of the most likely routes of site-specific variant evolution and estimation of the strength of selection operating on each step along the route, a key component of what we need to know to develop and implement personalized cancer therapies.
癌症是由于多个体细胞突变导致细胞失控生长而产生的。突变的互斥性和共同发生意味着——但并不能证明——突变对致癌作用具有协同或拮抗的上位效应。了解这些相互作用以及导致癌症的突变和选择的后续轨迹一直是癌症研究界的一个长期目标。最近的研究揭示了许多癌症类型中特定反复出现的变体的突变率和规模选择系数。然而,目前没有方法可以量化在可能的癌症基因型轨迹中纳入对选择的成对和更高阶上位效应的选择强度。因此,我们开发了一种连续时间马尔可夫链模型,该模型能够根据体细胞癌症基因型估计突变起源和固定(通量)。将这种连续时间马尔可夫链模型与去卷积方法相结合,可以提供在致癌作用轨迹上的潜在突变率和选择的估计。我们在 565 例肺腺癌的四个最常变异的驱动基因(TP53、LRP1B、KRAS 和 STK11)的体细胞进化模型中计算了通量和选择系数。我们的分析揭示了多种拮抗的上位效应,这些效应减少了致癌作用的可能途径,并为癌症研究提供了有关可行的体细胞进化轨迹的信息,这些轨迹的进展可以通过精准医学来阻止。还确定了协同的上位效应,最显著的是在 KRAS 基因的体细胞基因型 TP53 LRP1B 中存在 KRAS 基因突变,以及在包含 KRAS 或 TP53 基因突变的体细胞基因型中存在 STK11 基因突变。KRAS 变体的大正通量是由大的选择系数驱动的,而 LRP1B 突变的通量则主要得益于该基因的高突变率。该方法能够推断出特定位点变体进化的最可能途径,并估计在途径上的每一步操作的选择强度,这是我们开发和实施个性化癌症治疗所需的关键部分。