Suppr超能文献

基于非负矩阵分解的生物信息学分析揭示,TPX2 和 SELENBP1 是肺腺癌内部共识亚群的两个预测因子。

Nonnegative matrix factorization-based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub-consensuses of lung adenocarcinoma.

机构信息

Fujian Key Laboratory for Prenatal Diagnosis and Birth Defect, Fujian Maternity and Child Health Hospital, Affiliated Hospital of Fujian Medical University, Fuzhou, Fujian, China.

Key Laboratory of Technical Evaluation of Fertility Regulation for Non-human Primate, National Health and Family Planning Commission, Fuzhou, Fujian, China.

出版信息

Cancer Med. 2021 Dec;10(24):9058-9077. doi: 10.1002/cam4.4386. Epub 2021 Nov 3.

Abstract

BACKGROUND

Lung adenocarcinoma (LUAD) is a heterogeneous disease. However the inner sub-groups of LUAD have not been fully studied. Markers predicted the sub-groups and prognosis of LUAD are badly needed.

AIMS

To identify biomarkers associated with the sub-groups and prognosis of LUAD.

MATERIALS AND METHODS

Using nonnegative matrix factorization (NMF) clustering, LUAD patients from The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO) datasets and LUAD cell lines from Genomics of Drug Sensitivity in Cancer (GDSC) dataset were divided into different sub-consensuses based on the gene expression profiling. The overall survival of LUAD patients in each sub-consensus was determined by Kaplan-Meier survival analysis. The common genes which were differentially expressed in each sub-consensus of LUAD patients and LUAD cell lines were identified using TBtools. The predictive accuracy of TPX2 and SELENBP1 for theinner sub-consensuses of LUAD was determined by Receiver operator characteristic (ROC) analysis. The Kaplan-Meier survival analysis was also used to test the prognostic significance of TPX2 and SELENBP1 in LUAD patients.

RESULTS

Using nonnegative matrix factorization clustering, LUAD patients in The Cancer Genome Atlas (TCGA), GSE30219, GSE42127, GSE50081, GSE68465, and GSE72094 datasets were divided into three sub-consensuses. Sub-consensus3 LUAD patients were with low overall survival and were with high TP53 mutations. Similarly, LUAD cell lines were also divided into three sub-consensuses by NMF method, and sub-consensus2 cell lines were resistant to EGFR inhibitors. Identification of the common genes which were differentially expressed in different sub-consensuses of LUAD patients and LUAD cell lines revealed that TPX2 was highly expressed in sub-consensus3 LUAD patients and sub-consensus2 LUAD cell lines. On the contrary, SELENBP1 was highly expressed in sub-consensus1 LUAD patients and sub-consensus1 LUAD cell lines. The expression levels of TPX2 and SELENBP1 could distinguish sub-consensus3 LUAD patients or sub-consensus2 LUAD cell lines from other sub-consensuses of LUAD patients or cell lines. Moreover, compared with normal lung tissues, TPX2 was highly expressed, while, SELENBP1 was lowly expressed in LUAD tissues. Furthermore, the higher expression levels of TPX2 were associated with the lower relapse-free survival and the lower overall survival of LUAD patients. While, the higher expression levels of SELENBP1 were associated with the higher relapse-free survival and higher overall survival. At last, we showed that TP53 mutant LUAD patients were with higher TPX2 and lower SELENBP1 expressions.

DISCUSSION

Both iCluster and NMF method are proved to be robust LUAD classification systems. However, the LUAD patients in different iclusters had no significant clinical overall survival, while, sub-consensus3 LUAD patients from NMF classification were with lower overall survival than other sub-consensuses.

CONCLUSIONS

By integrated analysis of 1765 LUAD patients and 64 LUAD cell lines, we showed that NMF was a robust inner sub-consensuses classification method of LUAD. TPX2 and SELENBP1 were differentially expressed in different LUAD sub- consensuses, and predicted the inner sub-consensuses of LUAD with high accuracy. TPX2 was an unfavorable prognostic biomarker of LUAD which was up-regulated in LUAD tissues and associated with the low overall survival of LUAD. SELENBP1 was a favorable prognostic biomarker of LUAD which was down-regulated in LUAD tissues and associated with the prolonged overall survival of LUAD.

摘要

背景

肺腺癌 (LUAD) 是一种异质性疾病。然而,LUAD 的内部亚群尚未得到充分研究。迫切需要预测 LUAD 亚群和预后的标志物。

目的

鉴定与 LUAD 的亚群和预后相关的生物标志物。

材料和方法

使用非负矩阵分解 (NMF) 聚类,根据基因表达谱将来自癌症基因组图谱 (TCGA)、基因表达综合数据库 (GEO) 数据集的 LUAD 患者和来自癌症药物敏感性基因组学 (GDSC) 数据集的 LUAD 细胞系分为不同的子共识。通过 Kaplan-Meier 生存分析确定每个子共识中 LUAD 患者的总体生存率。使用 TBtools 鉴定 LUAD 患者和 LUAD 细胞系的每个子共识中差异表达的共同基因。通过接收器工作特征 (ROC) 分析确定 TPX2 和 SELENBP1 对 LUAD 内部子共识的预测准确性。Kaplan-Meier 生存分析也用于测试 TPX2 和 SELENBP1 在 LUAD 患者中的预后意义。

结果

使用非负矩阵分解聚类,将 TCGA、GSE30219、GSE42127、GSE50081、GSE68465 和 GSE72094 数据集的 LUAD 患者分为三个子共识。子共识 3 LUAD 患者总体生存率较低,且 TP53 突变率较高。同样,NMF 方法也将 LUAD 细胞系分为三个子共识,子共识 2 细胞系对 EGFR 抑制剂有耐药性。鉴定 LUAD 患者和 LUAD 细胞系的不同子共识中差异表达的共同基因表明,TPX2 在子共识 3 LUAD 患者和子共识 2 LUAD 细胞系中高度表达。相反,SELENBP1 在子共识 1 LUAD 患者和子共识 1 LUAD 细胞系中高度表达。TPX2 和 SELENBP1 的表达水平可将子共识 3 LUAD 患者或子共识 2 LUAD 细胞系与其他 LUAD 患者或细胞系的子共识区分开来。此外,与正常肺组织相比,TPX2 在 LUAD 组织中高表达,而 SELENBP1 低表达。此外,TPX2 的高表达水平与 LUAD 患者无复发生存率和总体生存率降低有关。而 SELENBP1 的高表达水平与无复发生存率和总体生存率升高有关。最后,我们表明 TP53 突变型 LUAD 患者的 TPX2 表达较高,SELENBP1 表达较低。

讨论

iCluster 和 NMF 方法均被证明是稳健的 LUAD 分类系统。然而,不同 iclusters 中的 LUAD 患者没有显著的临床总体生存率,而 NMF 分类的子共识 3 LUAD 患者的总体生存率低于其他子共识。

结论

通过对 1765 名 LUAD 患者和 64 个 LUAD 细胞系的综合分析,我们表明 NMF 是 LUAD 的一种稳健内部子共识分类方法。TPX2 和 SELENBP1 在不同的 LUAD 亚群中差异表达,并具有较高的准确性预测 LUAD 的内部亚群。TPX2 是 LUAD 的不利预后生物标志物,在 LUAD 组织中上调,与 LUAD 的总体生存率降低相关。SELENBP1 是 LUAD 的有利预后生物标志物,在 LUAD 组织中下调,与 LUAD 的总体生存率延长相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9bb9/8683537/581ab3ee2914/CAM4-10-9058-g009.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验