Zhang Chengpeng, Huang Yong, Fang Chen, Liang Yingkuan, Jiang Dong, Li Jiaxi, Ma Haitao, Jiang Wei, Feng Yu
Department of Thoracic Surgery, Suzhou Ninth People's Hospital, Suzhou, Jiangsu, China.
Department of Thoracic Surgery, Haimen People's Hospital, Nantong, Jiangsu, China.
Cancer Biomark. 2023;36(4):313-326. doi: 10.3233/CBM-220227.
We performed a bioinformatics analysis to screen for cell cycle-related differentially expressed genes (DEGs) and constructed a model for the prognostic prediction of patients with early-stage lung squamous cell carcinoma (LSCC).
From a gene expression omnibus (GEO) database, the GSE157011 dataset was randomly divided into an internal training group and an internal testing group at a 1:1 ratio, and the GSE30219, GSE37745, GSE42127, and GSE73403 datasets were merged as the external validation group. We performed single-sample gene set enrichment analysis (ssGSEA), univariate Cox analysis, and difference analysis, and identified 372 cell cycle-related genes. Additionally, we combined LASSO/Cox regression analysis to construct a prognostic model. Then, patients were divided into high-risk and low-risk groups according to risk scores. The internal testing group, discovery set, and external verification set were used to assess model reliability. We used a nomogram to predict patient prognoses based on clinical features and risk values. Clinical relevance analysis and the Human Protein Atlas (HPA) database were used to verify signature gene expression.
Ten cell cycle-related DEGs (EIF2B1, FSD1L, FSTL3, ORC3, HMMR, SETD6, PRELP, PIGW, HSD17B6, and GNG7) were identified and a model based on the internal training group constructed. From this, patients in the low-risk group had a higher survival rate when compared with the high-risk group. Time-dependent receiver operating characteristic (tROC) and Cox regression analyses showed the model was efficient and accurate. Clinical relevance analysis and the HPA database showed that DEGs were significantly dysregulated in LSCC tissue.
Our model predicted the prognosis of early-stage LSCC patients and demonstrated potential applications for clinical decision-making and individualized therapy.
我们进行了一项生物信息学分析,以筛选细胞周期相关的差异表达基因(DEG),并构建了一个用于预测早期肺鳞状细胞癌(LSCC)患者预后的模型。
从基因表达综合数据库(GEO)中,将GSE157011数据集按1:1的比例随机分为内部训练组和内部测试组,并将GSE30219、GSE37745、GSE42127和GSE73403数据集合并作为外部验证组。我们进行了单样本基因集富集分析(ssGSEA)、单变量Cox分析和差异分析,并鉴定出372个细胞周期相关基因。此外,我们结合LASSO/Cox回归分析构建了一个预后模型。然后,根据风险评分将患者分为高风险组和低风险组。使用内部测试组、发现集和外部验证集来评估模型的可靠性。我们使用列线图根据临床特征和风险值预测患者的预后。使用临床相关性分析和人类蛋白质图谱(HPA)数据库来验证特征基因的表达。
鉴定出10个细胞周期相关的DEG(EIF2B1、FSD1L、FSTL3、ORC3、HMMR、SETD6、PRELP、PIGW、HSD17B6和GNG7),并基于内部训练组构建了一个模型。由此,与高风险组相比,低风险组患者的生存率更高。时间依赖性受试者工作特征(tROC)和Cox回归分析表明该模型有效且准确。临床相关性分析和HPA数据库表明,DEG在LSCC组织中显著失调。
我们的模型预测了早期LSCC患者的预后,并展示了在临床决策和个体化治疗中的潜在应用。