School of Science, Jiangnan University, Wuxi, 214122, China.
Wuxi Engineering Research Center for Biocomputing, Wuxi, 214122, China.
BMC Bioinformatics. 2021 Oct 25;22(Suppl 3):521. doi: 10.1186/s12859-021-04411-1.
Liver cancer is a common malignant tumor in China, with high mortality. Its occurrence and development were thoroughly studied by high-throughput expression microarray, which produced abundant data on gene expression, mRNA quantification and the clinical data of liver cancer. However, the hub genes, which can be served as biomarkers for diagnosis and treatment of early liver cancer, are not well screened.
Here we present a new method for getting 6 key genes, aiming to diagnose and treat the early liver cancer. We firstly analyzed the different expression microarrays based on TCGA database, and a total of 1564 differentially expressed genes were obtained, of which 1400 were up-regulated and 164 were down-regulated. Furthermore, these differentially expressed genes were studied by using GO and KEGG enrichment analysis, a PPI network was constructed based on the STRING database, and 15 hub genes were obtained. Finally, 15 hub genes were verified by applying the survival analysis method on Oncomine database, and 6 key genes were ultimately identified, including PLK1, CDC20, CCNB2, BUB1, MAD2L1 and CCNA2. The robustness analysis of four independent data sets verifies the accuracy of the key gene's classification of the data set.
Although there are complicated differences between cancer and normal cells in gene functions, cancer cells could be differentiated in case that a group of special genes expresses abnormally. Here we presented a new method to identify the 6 key genes for diagnosis and treatment of early liver cancer, and these key genes can help us understand the pathogenesis of liver cancer more deeply.
肝癌是中国常见的恶性肿瘤,死亡率较高。高通量表达微阵列对其发生发展进行了深入研究,产生了大量基因表达、mRNA 定量和肝癌临床数据。然而,作为早期肝癌诊断和治疗的生物标志物的关键基因尚未得到很好的筛选。
我们提出了一种新的方法来获得 6 个关键基因,旨在诊断和治疗早期肝癌。我们首先基于 TCGA 数据库分析了不同的表达微阵列,共获得了 1564 个差异表达基因,其中 1400 个上调,164 个下调。此外,我们对这些差异表达基因进行了 GO 和 KEGG 富集分析,基于 STRING 数据库构建了一个 PPI 网络,并获得了 15 个枢纽基因。最后,我们应用 Oncomine 数据库的生存分析方法验证了 15 个枢纽基因,最终确定了 6 个关键基因,包括 PLK1、CDC20、CCNB2、BUB1、MAD2L1 和 CCNA2。四个独立数据集的稳健性分析验证了关键基因对数据集分类的准确性。
尽管癌细胞与正常细胞在基因功能上存在复杂差异,但如果一组特殊基因异常表达,仍可以区分癌细胞。我们提出了一种新的方法来识别早期肝癌诊断和治疗的 6 个关键基因,这些关键基因可以帮助我们更深入地了解肝癌的发病机制。