Jiang Aimin, Wang Jingjing, Liu Na, Zheng Xiaoqiang, Li Yimeng, Ma Yuyan, Zheng Haoran, Chen Xue, Fan Chaoxin, Zhang Rui, Fu Xiao, Yao Yu
Department of Medical Oncology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.
Front Genet. 2022 Jan 27;13:833797. doi: 10.3389/fgene.2022.833797. eCollection 2022.
Lung adenocarcinoma (LUAD) remains a lethal disease worldwide, with numerous studies exploring its potential prognostic markers using traditional RNA sequencing (RNA-) data. However, it cannot detect the exact cellular and molecular changes in tumor cells. This study aimed to construct a prognostic model for LUAD using single-cell RNA- (scRNA-) and traditional RNA- data. Bulk RNA- data were downloaded from The Cancer Genome Atlas (TCGA) database. LUAD scRNA- data were acquired from Gene Expression Omnibus (GEO) database. The uniform manifold approximation and projection (UMAP) was used for dimensionality reduction and cluster identification. Weighted Gene Correlation Network Analysis (WGCNA) was utilized to identify key modules and differentially expressed genes (DEGs). The non-negative Matrix Factorization (NMF) algorithm was used to identify different subtypes based on DEGs. The Cox regression analysis was used to develop the prognostic model. The characteristics of mutation landscape, immune status, and immune checkpoint inhibitors (ICIs) related genes between different risk groups were also investigated. scRNA- data of four samples were integrated to identify 13 clusters and 9cell types. After applying differential analysis, NK cells, bladder epithelial cells, and bronchial epithelial cells were identified as significant cell types. Overall, 329 DEGs were selected for prognostic model construction through differential analysis and WGCNA. Besides, NMF identified two clusters based on DEGs in the TCGA cohort, with distinct prognosis and immune characteristics being observed. We developed a prognostic model based on the expression levels of six DEGs. A higher risk score was significantly correlated with poor survival outcomes but was associated with a more frequent mutation rate, higher tumor mutation burden (TMB), and up-regulation of . Two independent external validation cohorts were also adopted to verify our results, with consistent results observed in them. This study constructed and validated a prognostic model for LUAD by integrating 10× scRNA- and bulk RNA- data. Besides, we observed two distinct subtypes in this population, with different prognosis and immune characteristics.
肺腺癌(LUAD)在全球范围内仍然是一种致命疾病,众多研究利用传统RNA测序(RNA-)数据探索其潜在的预后标志物。然而,它无法检测肿瘤细胞中确切的细胞和分子变化。本研究旨在利用单细胞RNA-(scRNA-)和传统RNA-数据构建LUAD的预后模型。批量RNA-数据从癌症基因组图谱(TCGA)数据库下载。LUAD的scRNA-数据从基因表达综合数据库(GEO)获取。使用均匀流形近似和投影(UMAP)进行降维和聚类识别。利用加权基因共表达网络分析(WGCNA)识别关键模块和差异表达基因(DEG)。非负矩阵分解(NMF)算法用于基于DEG识别不同亚型。采用Cox回归分析建立预后模型。还研究了不同风险组之间的突变图谱特征、免疫状态和免疫检查点抑制剂(ICI)相关基因。整合四个样本的scRNA-数据以识别13个聚类和9种细胞类型。经过差异分析后,自然杀伤细胞、膀胱上皮细胞和支气管上皮细胞被确定为显著的细胞类型。总体而言,通过差异分析和WGCNA选择了329个DEG用于预后模型构建。此外,NMF在TCGA队列中基于DEG识别出两个聚类,观察到不同的预后和免疫特征。我们基于六个DEG的表达水平建立了一个预后模型。较高的风险评分与较差的生存结果显著相关,但与更高的突变率、更高的肿瘤突变负荷(TMB)以及[此处原文缺失部分内容]的上调相关。还采用了两个独立的外部验证队列来验证我们的结果,在这些队列中观察到了一致的结果。本研究通过整合10× scRNA-和批量RNA-数据构建并验证了LUAD的预后模型。此外,我们在该人群中观察到两种不同的亚型,具有不同的预后和免疫特征。