Zengin Talip, Önal-Süzek Tuğba
Department of Molecular Biology and Genetics, Muğla Sıtkı Koçman University, 48000 Muğla, Turkey.
Department of Bioinformatics, Muğla Sıtkı Koçman University, 48000 Muğla, Turkey.
J Pers Med. 2021 Feb 23;11(2):154. doi: 10.3390/jpm11020154.
Lung cancer is the second most frequently diagnosed cancer type and responsible for the highest number of cancer deaths worldwide. Lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) are subtypes of non-small-cell lung cancer which has the highest frequency of lung cancer cases. We aimed to analyze genomic and transcriptomic variations including simple nucleotide variations (SNVs), copy number variations (CNVs) and differential expressed genes (DEGs) in order to find key genes and pathways for diagnostic and prognostic prediction for lung adenocarcinoma and lung squamous cell carcinoma. We performed a univariate Cox model and then lasso-regularized Cox model with leave-one-out cross-validation using The Cancer Genome Atlas (TCGA) gene expression data in tumor samples. We generated 35- and 33-gene signatures for prognostic risk prediction based on the overall survival time of the patients with LUAD and LUSC, respectively. When we clustered patients into high- and low-risk groups, the survival analysis showed highly significant results with high prediction power for both training and test datasets. Then, we characterized the differences including significant SNVs, CNVs, DEGs, active subnetworks, and the pathways. We described the results for the risk groups and cancer subtypes separately to identify specific genomic alterations between both high-risk groups and cancer subtypes. Both LUAD and LUSC high-risk groups have more downregulated immune pathways and upregulated metabolic pathways. On the other hand, low-risk groups have both up- and downregulated genes on cancer-related pathways. Both LUAD and LUSC have important gene alterations such as CDKN2A and CDKN2B deletions with different frequencies. SOX2 amplification occurs in LUSC and PSMD4 amplification in LUAD. EGFR and KRAS mutations are mutually exclusive in LUAD samples. EGFR, MGA, SMARCA4, ATM, RBM10, and KDM5C genes are mutated only in LUAD but not in LUSC. CDKN2A, PTEN, and HRAS genes are mutated only in LUSC samples. The low-risk groups of both LUAD and LUSC tend to have a higher number of SNVs, CNVs, and DEGs. The signature genes and altered genes have the potential to be used as diagnostic and prognostic biomarkers for personalized oncology.
肺癌是全球第二常见的确诊癌症类型,也是癌症死亡人数最多的癌症。肺腺癌(LUAD)和肺鳞状细胞癌(LUSC)是非小细胞肺癌的亚型,非小细胞肺癌是肺癌病例中最常见的类型。我们旨在分析基因组和转录组变异,包括单核苷酸变异(SNV)、拷贝数变异(CNV)和差异表达基因(DEG),以寻找肺腺癌和肺鳞状细胞癌诊断和预后预测的关键基因和通路。我们使用癌症基因组图谱(TCGA)肿瘤样本中的基因表达数据,进行了单变量Cox模型,然后是带有留一法交叉验证的套索正则化Cox模型。我们分别基于LUAD和LUSC患者的总生存时间,生成了用于预后风险预测的35基因和33基因特征。当我们将患者分为高风险组和低风险组时,生存分析显示出高度显著的结果,对训练数据集和测试数据集均具有很高的预测能力。然后,我们对差异进行了表征,包括显著的SNV、CNV、DEG、活跃子网和通路。我们分别描述了风险组和癌症亚型的结果,以确定高风险组和癌症亚型之间的特定基因组改变。LUAD和LUSC高风险组的免疫通路下调更多,代谢通路上调。另一方面,低风险组在癌症相关通路上既有上调基因也有下调基因。LUAD和LUSC都有重要的基因改变,如不同频率的CDKN2A和CDKN2B缺失。SOX2扩增发生在LUSC中,PSMD4扩增发生在LUAD中。EGFR和KRAS突变在LUAD样本中相互排斥。EGFR、MGA、SMARCA4、ATM、RBM10和KDM5C基因仅在LUAD中发生突变,而不在LUSC中发生突变。CDKN2A、PTEN和HRAS基因仅在LUSC样本中发生突变。LUAD和LUSC的低风险组往往具有更多的SNV、CNV和DEG。特征基因和改变的基因有可能用作个性化肿瘤学的诊断和预后生物标志物。