School of Mathematical Sciences, University of Science and Technology of China, Hefei, 230026, Anhui, China.
School of Mathematics and Statistics, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, China.
BMC Bioinformatics. 2024 Sep 13;25(1):300. doi: 10.1186/s12859-024-05897-1.
Overall Survival (OS) and Progression-Free Interval (PFI) as survival times have been collected in The Cancer Genome Atlas (TCGA). It is of biomedical interest to consider their dependence in pathway detection and survival prediction. We intend to develop novel methods for integrating PFI as condition based on parametric survival models for identifying pathways associated with OS and predicting OS.
Based on the framework of conditional probability, we developed a family of frailty-based parametric-models for this purpose, with exponential or Weibull distribution as baseline. We also considered two classes of existing methods with PFI as a covariate. We evaluated the performance of three approaches by analyzing RNA-seq expression data from TCGA for lung squamous cell carcinoma and lung adenocarcinoma (LUNG), brain lower grade glioma and glioblastoma multiforme (GBMLGG), as well as skin cutaneous melanoma (SKCM). Our focus was on fourteen general cancer-related pathways. The 10-fold cross-validation was employed for the evaluation of predictive accuracy. For LUNG, p53 signaling and cell cycle pathways were detected by all approaches. Furthermore, three approaches with the consideration of PFI demonstrated significantly better predictive performance compared to the approaches without the consideration of PFI. For GBMLGG, ten pathways (e.g., Wnt signaling, JAK-STAT signaling, ECM-receptor interaction, etc.) were detected by all approaches. Furthermore, three approaches with the consideration of PFI demonstrated better predictive performance compared to the approaches without the consideration of PFI. For SKCM, p53 signaling pathway was detected only by our Weibull-baseline-based model. And three approaches with the consideration of PFI demonstrated significantly better predictive performance compared to the approaches without the consideration of PFI.
Based on our study, it is necessary to incorporate PFI into the survival analysis of OS. Furthermore, PFI is a survival-type time, and improved results can be achieved by our conditional-probability-based approach.
总体生存时间(OS)和无进展间隔时间(PFI)已在癌症基因组图谱(TCGA)中收集。考虑它们在通路检测和生存预测中的依赖性是生物医学的研究热点。我们旨在开发新的方法,通过将 PFI 作为基于参数生存模型的条件来整合 PFI,以识别与 OS 相关并预测 OS 的通路。
基于条件概率的框架,我们为此目的开发了一系列基于脆弱性的参数模型,以指数或 Weibull 分布作为基线。我们还考虑了两种将 PFI 作为协变量的现有方法。我们通过分析来自 TCGA 的肺鳞癌和肺腺癌(LUNG)、脑低级别神经胶质瘤和胶质母细胞瘤多形性(GBMLGG)以及皮肤黑色素瘤(SKCM)的 RNA-seq 表达数据来评估三种方法的性能。我们的重点是十四种一般癌症相关通路。采用 10 折交叉验证评估预测准确性。对于 LUNG,所有方法都检测到 p53 信号通路和细胞周期通路。此外,与不考虑 PFI 的方法相比,考虑 PFI 的三种方法表现出显著更好的预测性能。对于 GBMLGG,所有方法都检测到十个通路(例如 Wnt 信号通路、JAK-STAT 信号通路、ECM-受体相互作用等)。此外,与不考虑 PFI 的方法相比,考虑 PFI 的三种方法表现出更好的预测性能。对于 SKCM,仅我们的 Weibull 基线模型检测到 p53 信号通路。与不考虑 PFI 的方法相比,考虑 PFI 的三种方法表现出显著更好的预测性能。
基于我们的研究,有必要将 PFI 纳入 OS 的生存分析中。此外,PFI 是一种生存类型的时间,我们基于条件概率的方法可以获得改进的结果。