Wang Menghan, Xie Yanqi, Liu Jinpeng, Li Austin, Chen Li, Stromberg Arnold, Arnold Susanne M, Liu Chunming, Wang Chi
Department of Statistics, University of Kentucky, Lexington, KY 40536, USA.
Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40508, USA.
Cancers (Basel). 2024 Jul 8;16(13):2488. doi: 10.3390/cancers16132488.
The development of cancer involves the accumulation of somatic mutations in several essential biological pathways. Delineating the temporal order of pathway mutations during tumorigenesis is crucial for comprehending the biological mechanisms underlying cancer development and identifying potential targets for therapeutic intervention. Several computational and statistical methods have been introduced for estimating the order of somatic mutations based on mutation profile data from a cohort of patients. However, one major issue of current methods is that they do not take into account intra-tumor heterogeneity (ITH), which limits their ability to accurately discern the order of pathway mutations. To address this problem, we propose PATOPAI, a probabilistic approach to estimate the temporal order of mutations at the pathway level by incorporating ITH information as well as pathway and functional annotation information of mutations. PATOPAI uses a maximum likelihood approach to estimate the probability of pathway mutational events occurring in a specific sequence, wherein it focuses on the orders that are consistent with the phylogenetic structure of the tumors. Applications to whole exome sequencing data from The Cancer Genome Atlas (TCGA) illustrate our method's ability to recover the temporal order of pathway mutations in several cancer types.
癌症的发展涉及多个关键生物学途径中体细胞突变的积累。描绘肿瘤发生过程中途径突变的时间顺序对于理解癌症发展的生物学机制以及确定治疗干预的潜在靶点至关重要。已经引入了几种计算和统计方法,用于根据一组患者的突变谱数据估计体细胞突变的顺序。然而,当前方法的一个主要问题是它们没有考虑肿瘤内异质性(ITH),这限制了它们准确辨别途径突变顺序的能力。为了解决这个问题,我们提出了PATOPAI,一种通过纳入ITH信息以及突变的途径和功能注释信息来估计途径水平突变时间顺序的概率方法。PATOPAI使用最大似然方法来估计特定序列中途径突变事件发生的概率,其中它关注与肿瘤系统发育结构一致的顺序。对来自癌症基因组图谱(TCGA)的全外显子测序数据的应用说明了我们的方法在几种癌症类型中恢复途径突变时间顺序的能力。