Department of Bioengineering, School of Chemical and BioTechnology, SASTRA deemed University, Thanjavur, Tamil Nadu, 613401, India.
Department of Bioinformatics, School of Chemical and BioTechnology, SASTRA deemed University, Thanjavur, Tamil Nadu, 613401, India.
BMC Cancer. 2019 Jul 5;19(1):663. doi: 10.1186/s12885-019-5838-3.
Liver cancer is among top deadly cancers worldwide with a very poor prognosis, and the liver is a vulnerable site for metastases of other cancers. Early diagnosis is crucial for treatment of the predominant liver cancers, namely hepatocellular carcinoma (HCC). Here we developed a novel computational framework for the stage-specific analysis of HCC.
Using publicly available clinical and RNA-Seq data of cancer samples and controls and the AJCC staging system, we performed a linear modelling analysis of gene expression across all stages and found significant genome-wide changes in the log fold-change of gene expression in cancer samples relative to control. To identify genes that were stage-specific controlling for confounding differential expression in other stages, we developed a set of six pairwise contrasts between the stages and enforced a p-value threshold (< 0.05) for each such contrast. Genes were specific for a stage if they passed all the significance filters for that stage. The monotonicity of gene expression with cancer progression was analyzed with a linear model using the cancer stage as a numeric variable.
Our analysis yielded two stage-I specific genes (CA9, WNT7B), two stage-II specific genes (APOBEC3B, FAM186A), ten stage-III specific genes including DLG5, PARI, NCAPG2, GNMT and XRCC2, and 35 stage-IV specific genes including GABRD, PGAM2, PECAM1 and CXCR2P1. Overexpression of DLG5 was found to be tumor-promoting contrary to the cancer literature on this gene. Further, GABRD was found to be signifincantly monotonically upregulated across stages. Our work has revealed 1977 genes with significant monotonic patterns of expression across cancer stages. NDUFA4L2, CRHBP and PIGU were top genes with monotonic changes of expression across cancer stages that could represent promising targets for therapy. Comparison with gene signatures from the BCLC staging system identified two genes, HSP90AB1 and ARHGAP42. Gene set enrichment analysis indicated overrepresented pathways specific to each stage, notably viral infection pathways in HCC initiation.
Our study identified novel significant stage-specific differentially expressed genes which could enhance our understanding of the molecular determinants of hepatocellular carcinoma progression. Our findings could serve as biomarkers that potentially underpin diagnosis as well as pinpoint therapeutic targets.
肝癌是全球致死率最高的癌症之一,预后极差,且肝脏是其他癌症转移的脆弱部位。早期诊断对于治疗主要的肝癌,即肝细胞癌(HCC)至关重要。在这里,我们开发了一种新的计算框架,用于 HCC 的特定阶段分析。
使用公开的癌症样本和对照的临床和 RNA-Seq 数据以及 AJCC 分期系统,我们对所有阶段的基因表达进行了线性建模分析,发现癌症样本中基因表达的对数倍数变化在全基因组范围内相对于对照有显著变化。为了确定在控制其他阶段差异表达的情况下具有阶段特异性的基因,我们在各个阶段之间开发了六对对比,并为每个对比设置了一个 p 值阈值(<0.05)。如果一个基因通过了该阶段所有的显著性过滤,那么这个基因就是该阶段特异性的。使用癌症阶段作为数值变量的线性模型分析了基因表达与癌症进展的单调性。
我们的分析产生了两个 I 期特异性基因(CA9、WNT7B)、两个 II 期特异性基因(APOBEC3B、FAM186A)、十个 III 期特异性基因,包括 DLG5、PARI、NCAPG2、GNMT 和 XRCC2,以及 35 个 IV 期特异性基因,包括 GABRD、PGAM2、PECAM1 和 CXCR2P1。与该基因的癌症文献相反,DLG5 的过表达被发现是促进肿瘤的。此外,GABRD 在各阶段均呈显著单调上调。我们的工作揭示了 1977 个具有显著的癌症阶段表达单调模式的基因。NDUFA4L2、CRHBP 和 PIGU 是具有癌症阶段表达单调变化的最显著基因,它们可能代表有前途的治疗靶点。与 BCLC 分期系统的基因特征进行比较,确定了两个基因,HSP90AB1 和 ARHGAP42。基因集富集分析表明,每个阶段都有特定的通路被过度表达,特别是在 HCC 发生的病毒感染通路。
我们的研究确定了新的显著的阶段特异性差异表达基因,这可以增强我们对肝细胞癌进展的分子决定因素的理解。我们的发现可以作为潜在的诊断标志物,并确定治疗靶点。