Feng Haiming, Zhao Ye, Yan Weijian, Wei Xiaoping, Lin Junping, Jiang Peng, Wang Cheng, Li Bin
Department of Thoracic Surgery, Second Clinical Medical College, Lanzhou University, Lanzhou, China.
First Clinical Medical College, Lanzhou University, Lanzhou, China.
Front Med (Lausanne). 2022 Feb 25;9:843749. doi: 10.3389/fmed.2022.843749. eCollection 2022.
The implication of the Estimation of Stromal and Immune cells in Malignant tumor tissues using expression data (ESTIMATE) method to determine the tumor microenvironment (TME) and tumor immune score including tumor purity represents an efficient method to identify and assess biomarkers for immunotherapy response in precision medicine. In this study we utilized a machine learning algorithm to analyze the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus database (GEO) lung adenocarcinoma (LUAD) transcriptome data to evaluate the association between TME and tumor purity. Furthermore, we investigated whether fewer TME components or a few dominant genes can infer tumor purity. The results indicated that the 29 immune infiltrating components determined by the ssGSEA method could screen the 5 TME components [chemokine C-C-Motif receptor (CCR), T-helper-cells, Check-point, Treg, and tumor-infiltrating lymphocytes (TIL)] that significantly contributed the most to tumor purity prediction through regression tree and random forest regression methods. The findings revealed that higher activity of these five immune infiltrating components significantly lowered the tumor purity. Moreover, 5 TME components contributed significantly to the improvement of Mean Square Error (MES); therefore, we selected these five sets' genes and analyzed survival data to establish a prognostic model. We screened out 11 prognostic-related genes and constructed a risk model comprising 11 genes with good predictive value for patients' prognosis. Furthermore, we obtained four genes (GIMAP6, CD80, IL16, and CCR2) that had predictive advantages for tumor purity using random forest classification and random forest regression. The comprehensive score of genes for tumor purity prediction (CSGTPP) was obtained by least absolute shrinkage and selection operator (LASSO) regression indicated that four genes could be successfully used to classify high and low CSGTPP samples and that tumor purity was negatively correlated with CSGTPP. Survival analysis revealed that the higher the CSGTPP, the better the prognosis of patients. The association between a cluster of differentiation 274 (CD274) and CSGTPP revealed a higher expression of CD274 in the high CSGTPP group. Collectively, we speculated that CSGTPP could serve as a predictor of the response to immunotherapy and a promising indicator of immunotherapy effect.
利用表达数据估算恶性肿瘤组织中的基质细胞和免疫细胞(ESTIMATE)方法来确定肿瘤微环境(TME)以及包括肿瘤纯度在内的肿瘤免疫评分,是精准医学中识别和评估免疫治疗反应生物标志物的有效方法。在本研究中,我们利用机器学习算法分析癌症基因组图谱(TCGA)和基因表达综合数据库(GEO)中的肺腺癌(LUAD)转录组数据,以评估TME与肿瘤纯度之间的关联。此外,我们研究了较少的TME成分或少数主导基因是否能够推断肿瘤纯度。结果表明,通过单样本基因集富集分析(ssGSEA)方法确定的29种免疫浸润成分,能够筛选出5种对肿瘤纯度预测贡献最大的TME成分[趋化因子C-C基序受体(CCR)、辅助性T细胞、检查点、调节性T细胞和肿瘤浸润淋巴细胞(TIL)],通过回归树和随机森林回归方法实现。研究结果显示,这五种免疫浸润成分的较高活性显著降低了肿瘤纯度。此外,5种TME成分对均方误差(MES)的改善有显著贡献;因此,我们选择这五组基因并分析生存数据以建立一个预后模型。我们筛选出11个与预后相关的基因,并构建了一个包含11个基因的风险模型,该模型对患者预后具有良好的预测价值。此外,我们利用随机森林分类和随机森林回归获得了4个对肿瘤纯度具有预测优势的基因(GIMAP6、CD80、IL16和CCR2)。通过最小绝对收缩和选择算子(LASSO)回归获得的肿瘤纯度预测基因综合评分(CSGTPP)表明,这4个基因能够成功用于对高CSGTPP样本和低CSGTPP样本进行分类,并且肿瘤纯度与CSGTPP呈负相关。生存分析显示,CSGTPP越高,患者的预后越好。分化簇274(CD274)与CSGTPP之间关联显示,高CSGTPP组中CD274表达较高。总体而言,我们推测CSGTPP可作为免疫治疗反应的预测指标以及免疫治疗效果的一个有前景的指标。