Sun Gongping, Duan He, Xing Yuanhao, Zhang Dewei
Department of General Surgery, The Fourth Affiliated Hospital of the China Medical University, Shenyang, 110032, People's Republic of China.
China Medical University, Shenyang, 110000, People's Republic of China.
Cancer Manag Res. 2021 Jun 28;13:5113-5125. doi: 10.2147/CMAR.S312085. eCollection 2021.
We aimed to screen novel genetic biomarkers for use in a prognostic score (PS) model for the accurate prediction of survival outcomes for patients with colon adenocarcinoma (COAD).
Gene expression and methylation data were downloaded from The Cancer Genome Atlas database, and the samples were randomly divided into training and validation sets for the screening of differentially methylated genes (DMGs) and differentially expressed genes (DEGs). Co-methylated genes were screened using weighted gene co-expression network analysis. Functional enrichment analysis was performed using the Database for Annotation, Visualization, and Integrated Discovery. Univariate and multivariate Cox regression analyses were performed to identify prognosis-related genes and clinical factors. Receiver operating characteristic curve analysis was carried out to evaluate the predictive performance of the PS model.
In total, 1434 DEGs and 1038 DMGs were screened in the training set, among which 284 were found to be overlapping genes. For 127 of these overlapping genes, the methylation and expression levels were significantly negatively correlated. An optimal signature from 10 DMGs was identified to construct the PS model. Patients with a high PS seemed to have worse outcomes than those with a low PS. Moreover, cancer recurrence and the PS model status were independent prognostic factors.
This PS model based on an optimal 10-gene signature would help in the stratification of patients with COAD and improve the assessment of their clinical outcomes.
我们旨在筛选新型遗传生物标志物,用于构建预后评分(PS)模型,以准确预测结肠腺癌(COAD)患者的生存结局。
从癌症基因组图谱数据库下载基因表达和甲基化数据,将样本随机分为训练集和验证集,用于筛选差异甲基化基因(DMG)和差异表达基因(DEG)。使用加权基因共表达网络分析筛选共甲基化基因。使用注释、可视化和综合发现数据库进行功能富集分析。进行单变量和多变量Cox回归分析,以确定与预后相关的基因和临床因素。进行受试者工作特征曲线分析,以评估PS模型的预测性能。
在训练集中共筛选出1434个DEG和1038个DMG,其中284个为重叠基因。在这些重叠基因中,有127个基因的甲基化水平与表达水平呈显著负相关。从10个DMG中确定了一个最佳特征,用于构建PS模型。PS高的患者似乎比PS低的患者预后更差。此外,癌症复发和PS模型状态是独立的预后因素。
这种基于10个基因最佳特征的PS模型将有助于对COAD患者进行分层,并改善对其临床结局的评估。