Li Bing, Zhang Fengbin, Niu Qikai, Liu Jun, Yu Yanan, Wang Pengqian, Zhang Siqi, Zhang Huamin, Wang Zhong
Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China.
Department of Gastroenterology and Hepatology, The Fourth Hospital of Hebei Medical University, Shijiazhuang 050011, China.
Mol Ther Nucleic Acids. 2022 Dec 27;31:224-240. doi: 10.1016/j.omtn.2022.12.014. eCollection 2023 Mar 14.
Gastric cancer (GC) is a heterogeneous disease and a leading cause of cancer-related deaths. Discovering robust, clinically relevant molecular classifications is critical for guiding personalized therapies for GC. Here, we propose a refined molecular classification scheme for GC using integrated optimal algorithms and multi-omics data. Based on the important features of mRNA, microRNA, and DNA methylation data selected by the multivariate Cox regression model, three subtypes linked to distinct clinical outcomes were identified by combining similarity network fusion and consensus clustering methods. Three subtypes were validated by an extreme gradient boosting machine learning prediction model with 125 differentially expressed genes in multiple independent cohorts. The molecular characteristics of mutation signatures, characteristic gene sets, driver genes, and chemotherapy sensitivity for each subtype were also identified: subtype 1 was associated with favorable prognosis and characterized by high ARID1A and PIK3CA mutations, subtype 2 was associated with a poor prognosis and harbored high recurrent TP53 mutations, and subtype 3 was associated with high CHD1, APOA1 mutations, and a poor prognosis. The proposed three-subtype scheme achieved a better clinical prediction performance (area under the curve value = 0.71) than The Cancer Genome Atlas classification, which may provide a practical subtyping framework to improve the treatment of GC.
胃癌(GC)是一种异质性疾病,也是癌症相关死亡的主要原因。发现强大的、与临床相关的分子分类对于指导GC的个性化治疗至关重要。在此,我们使用综合优化算法和多组学数据提出了一种针对GC的精细分子分类方案。基于多变量Cox回归模型选择的mRNA、微小RNA和DNA甲基化数据的重要特征,通过结合相似性网络融合和一致性聚类方法,鉴定出与不同临床结果相关的三种亚型。通过具有125个在多个独立队列中差异表达基因的极端梯度提升机器学习预测模型对三种亚型进行了验证。还确定了每种亚型的突变特征、特征基因集、驱动基因和化疗敏感性的分子特征:亚型1与良好预后相关,其特征是高ARID1A和PIK3CA突变;亚型2与不良预后相关,具有高复发性TP53突变;亚型3与高CHD1、APOA1突变和不良预后相关。所提出的三亚型方案比癌症基因组图谱分类具有更好的临床预测性能(曲线下面积值 = 0.71),这可能为改善GC的治疗提供一个实用的亚型框架。