Hu Huanhuan, Yuan Shenglong, Fu Yuqi, Li Huixin, Xiao Shuyue, Gong Zhen, Zhong Shanliang
Department of Gynecology, Women's Hospital of Nanjing Medical University & Nanjing Women and Children's Healthcare Hospital, Nanjing, China.
Department of Medical Oncology, The Affiliated Cancer Hospital of Nanjing Medical University & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, China.
Transl Cancer Res. 2024 Jul 31;13(7):3652-3667. doi: 10.21037/tcr-24-215. Epub 2024 Jul 5.
Changes in gene expression are associated with malignancy. Analysis of gene expression data could be used to reveal cancer subtypes, key molecular drivers, and prognostic characteristics and to predict cancer susceptibility, treatment response, and mortality. It has been reported that inflammation plays an important role in the occurrence and development of tumors. Our aim was to establish a risk signature model of breast cancer with inflammation-related genes (IRGs) to evaluate their survival prognosis.
We downloaded 200 IRGs from the Molecular Signatures Database (MSigDB). The data of breast cancer were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). Differential gene expression analysis, the least absolute shrinkage and selection operator (LASSO), Cox regression analysis, and overall survival (OS) analysis were used to construct a multiple-IRG risk signature. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were carried out to annotate functions of the differentially expressed IRGs (DEIRGs) The predictive accuracy of the prognostic model was evaluated by time-dependent receiver operating characteristic (ROC) curves. Subsequently, nomograms were constructed to guide clinical application according to the univariate and multivariate Cox proportional hazards regression analyses. Eventually, we applied gene set variation analysis (GSVA), mutation analysis, immune infiltration analysis, and drug response analysis to compare the differences between high- and low-risk patients.
Totally, 65 DEIRGs were obtained after comparing 1,092 breast cancer tissues with 113 paracancerous tissues in TCGA. Among them, 11 IRGs (, , , , , , , , , , ) were screened with nonzero coefficient by LASSO regression analysis to construct the prognostic model, which was validated in GSE96058.The 11-gene IRGs risk signature model stratified patients into high- or low-risk groups, with those in the low-risk group having longer survival time and less deaths. Multivariate Cox analysis manifested that risk score, age, and stage were the three independent prognostic factors for breast cancer patients. There were 12 pathways with higher activities and 24 pathways with lower activities in the high-risk group compared with the low-risk group, yet no difference of gene mutation load was observed between the two groups. In immune infiltration analysis, we noted that the proportion of T cells showed a decreased trend according to the increase of risk score and most of the immune cells were enriched in the low-risk group. Inversely, macrophages M2 were more highly distributed in the high-risk group. We identified 67 approved drugs that showed a different effect between the high- and low-risk patients and the top 2 gene-drug pairs were -sunitinib and -ruxolitinib.
The 11-IRG risk signature model is a promising tool to predict the survival of breast cancer patients and the expressions of and may serve as potential targets for therapy of breast cancer.
基因表达变化与恶性肿瘤相关。基因表达数据分析可用于揭示癌症亚型、关键分子驱动因素和预后特征,并预测癌症易感性、治疗反应和死亡率。据报道,炎症在肿瘤的发生和发展中起重要作用。我们的目的是建立一个包含炎症相关基因(IRGs)的乳腺癌风险特征模型,以评估其生存预后。
我们从分子特征数据库(MSigDB)下载了200个IRGs。乳腺癌数据来自癌症基因组图谱(TCGA)和基因表达综合数据库(GEO)。采用差异基因表达分析、最小绝对收缩和选择算子(LASSO)、Cox回归分析和总生存(OS)分析来构建多IRG风险特征。进行基因本体论(GO)和京都基因与基因组百科全书(KEGG)富集分析以注释差异表达IRGs(DEIRGs)的功能。通过时间依赖的受试者工作特征(ROC)曲线评估预后模型的预测准确性。随后,根据单变量和多变量Cox比例风险回归分析构建列线图以指导临床应用。最后,我们应用基因集变异分析(GSVA)、突变分析、免疫浸润分析和药物反应分析来比较高风险和低风险患者之间的差异。
在TCGA中,将1092个乳腺癌组织与113个癌旁组织进行比较后,共获得65个DEIRGs。其中,通过LASSO回归分析筛选出11个IRGs(,,,,,,,,,,)具有非零系数,用于构建预后模型,并在GSE96058中进行了验证。11基因IRGs风险特征模型将患者分为高风险或低风险组,低风险组患者的生存时间更长,死亡人数更少。多变量Cox分析表明,风险评分、年龄和分期是乳腺癌患者的三个独立预后因素。与低风险组相比,高风险组有12条通路活性较高,24条通路活性较低,但两组之间未观察到基因突变负荷的差异。在免疫浸润分析中,我们注意到T细胞比例随着风险评分的增加呈下降趋势,并且大多数免疫细胞在低风险组中富集。相反,M2巨噬细胞在高风险组中分布更广泛。我们鉴定出67种已批准的药物,在高风险和低风险患者之间显示出不同的效果,前2个基因 - 药物对是 - 舒尼替尼和 - 鲁索替尼。
11 - IRG风险特征模型是预测乳腺癌患者生存的有前景的工具,并且和的表达可能作为乳腺癌治疗的潜在靶点。