School and Hospital of Stomatology, Shandong University & Shandong Key Laboratory of Oral Tissue Regeneration & Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, Jinan, Shandong, China.
Sichuan University, West China Hospital of Stomatology, Department of Orthodontontics, State Key Laboratory of Oral Disease, National Clinical Research Centre of Oral Disease, Chengdu, China.
DNA Cell Biol. 2020 Apr;39(4):709-723. doi: 10.1089/dna.2019.5272. Epub 2020 Feb 11.
Head and neck squamous cell carcinoma (HNSCC) is the sixth malignancy, which is characterized by poor prognosis or high mortality because of the lack of predicting markers. Aberrant cancer pseudogenes have been found predictive for prognosis. We aim to identify a pseudogene-based prognosis signature for HNSCC by machine learning. RNA-seq data were downloaded from The Cancer Genome Atlas, and 700 differentially-expressed pseudogenes were identified. The survival-related pseudogenes were screened through COX-regression analysis, which includes univariate regression, least absolute shrinkage and selection operator regression, and multivariate regression, and a five-pseudogene signature was constructed. The value of prediction for the signature was validated in multiple subgroups in terms of survival. Gene set enrichment analysis (GSEA) and coexpression analysis were used to determine the underlying biological functions. Seven hundred dysregulated pseudogenes were identified, and the five-pseudogene signature can distinguish the low-risk and high-risk patients for both training and testing sets and predicted prognosis with high sensitivity and specificity. Furthermore, the signature was applicable to patients of different genders, ages, stages, and grades. Coexpression analysis revealed that the five-pseudogene is associated with immune system. GSEA showed cancer-related biological process and pathways the five-pseudogene involved in. The five-pseudogene signature is not only a novel marker for prognosis but also a promising signature for monitoring therapeutic schedule. Therefore, our findings may have potential clinical significance.
头颈部鳞状细胞癌 (HNSCC) 是第六大恶性肿瘤,由于缺乏预测标志物,预后较差或死亡率较高。异常的癌症假基因已被发现具有预测预后的能力。我们旨在通过机器学习确定基于假基因的 HNSCC 预后特征。从癌症基因组图谱下载了 RNA-seq 数据,并鉴定了 700 个差异表达的假基因。通过 COX 回归分析筛选与生存相关的假基因,包括单变量回归、最小绝对值收缩和选择算子回归以及多变量回归,并构建了一个五假基因特征。该特征的预测价值在多个生存亚组中得到了验证。基因集富集分析 (GSEA) 和共表达分析用于确定潜在的生物学功能。鉴定了 700 个失调的假基因,该五假基因特征可以区分训练集和测试集的低风险和高风险患者,并具有高灵敏度和特异性预测预后的能力。此外,该特征适用于不同性别、年龄、分期和分级的患者。共表达分析显示,这五个假基因与免疫系统有关。GSEA 显示了五个假基因涉及的癌症相关生物过程和途径。该五假基因特征不仅是一种新的预后标志物,也是监测治疗方案的有前途的标志物。因此,我们的研究结果可能具有潜在的临床意义。