Cao Kefan, Schwartz Russell
Computer Science Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, 15213, USA.
Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, 15213, USA.
bioRxiv. 2024 Dec 23:2024.12.23.629914. doi: 10.1101/2024.12.23.629914.
Understanding the evolution of cancer in its early stages is critical to identifying key drivers of cancer progression and developing better early diagnostics or prophylactic treatments. Early cancer is difficult to observe, though, since it is generally asymptomatic until extensive genetic damage has accumulated. In this study, we develop a computational approach to infer how once-healthy cells enter into and become committed to a pathway of aggressive cancer. We accomplish this through a strategy of using tumor phylogenetics to look backwards in time to earlier stages of tumor development combined with machine learning to infer how progression risk changes over those stages. We apply this paradigm to point mutation data from a set of cohorts from the Cancer Genome Atlas (TCGA) to formulate models of how progression risk evolves from the earliest stages of tumor growth, as well as how this evolution varies within and between cohorts. The results suggest general mechanisms by which risk develops as a cell population commits to aggressive cancer, but with significant variability between cohorts and individuals. These results imply limits to the potential for earlier diagnosis and intervention while also providing grounds for hope in extending these beyond current practice.
了解癌症早期阶段的演变对于确定癌症进展的关键驱动因素以及开发更好的早期诊断方法或预防性治疗至关重要。然而,早期癌症很难观察到,因为在积累大量基因损伤之前它通常没有症状。在本研究中,我们开发了一种计算方法来推断曾经健康的细胞如何进入并致力于侵袭性癌症的发展途径。我们通过一种策略来实现这一目标,即利用肿瘤系统发育学回顾肿瘤发展的早期阶段,并结合机器学习来推断进展风险在这些阶段如何变化。我们将这种范式应用于来自癌症基因组图谱(TCGA)的一组队列的点突变数据,以建立肿瘤生长最早阶段进展风险如何演变的模型,以及这种演变在队列内部和队列之间如何变化。结果表明了随着细胞群体发展为侵袭性癌症风险产生的一般机制,但队列和个体之间存在显著差异。这些结果意味着早期诊断和干预的潜力有限,同时也为将这些方法扩展到当前实践之外带来了希望。