Gudivada Indu Priya, Amajala Krishna Chaitanya
Department of Biochemistry and Bioinformatics, GITAM School of Science, GITAM (Deemed to be University), Visakhapatnam, 530045, Andhra Pradesh, India.
Curr Genomics. 2025;26(1):48-80. doi: 10.2174/0113892029308243240709073945. Epub 2024 Jul 18.
The damage in the liver and hepatocytes is where the primary liver cancer begins, and this is referred to as Hepatocellular Carcinoma (HCC). One of the best methods for detecting changes in gene expression of hepatocellular carcinoma is through bioinformatics approaches.
This study aimed to identify potential drug target(s) hubs mediating HCC progression using computational approaches through gene expression and protein-protein interaction datasets.
Four datasets related to HCC were acquired from the GEO database, and Differentially Expressed Genes (DEGs) were identified. Using Evenn, the common genes were chosen. Using the Fun Rich tool, functional associations among the genes were identified. Further, protein-protein interaction networks were predicted using STRING, and hub genes were identified using Cytoscape. The selected hub genes were subjected to GEPIA and Shiny GO analysis for survival analysis and functional enrichment studies for the identified hub genes. The up-regulating genes were further studied for immunohistopathological studies using HPA to identify gene/protein expression in normal HCC conditions. Drug Bank and Drug Gene Interaction Database were employed to find the reported drug status and targets. Finally, STITCH was performed to identify the functional association between the drugs and the identified hub genes.
The GEO2R analysis for the considered datasets identified 735 upregulating and 284 downregulating DEGs. Functional gene associations were identified through the Fun Rich tool. Further, PPIN network analysis was performed using STRING. A comparative study was carried out between the experimental evidence and the other seven data evidence in STRING, revealing that most proteins in the network were involved in protein-protein interactions. Further, through Cytoscape plugins, the ranking of the genes was analyzed, and densely connected regions were identified, resulting in the selection of the top 20 hub genes involved in HCC pathogenesis. The identified hub genes were: KIF2C, CDK1, TPX2, CEP55, MELK, TTK, BUB1, NCAPG, ASPM, KIF11, CCNA2, HMMR, BUB1B, TOP2A, CENPF, KIF20A, NUSAP1, DLGAP5, PBK, and CCNB2. Further, GEPIA and Shiny GO analyses provided insights into survival ratios and functional enrichment studied for the hub genes. The HPA database studies further found that upregulating genes were involved in changes in protein expression in Normal HCC tissues. These findings indicated that hub genes were certainly involved in the progression of HCC. STITCH database studies uncovered that existing drug molecules, including sorafenib, regorafenib, cabozantinib, and lenvatinib, could be used as leads to identify novel drugs, and identified hub genes could also be considered as potential and promising drug targets as they are involved in the gene-chemical interaction networks.
The present study involved various integrated bioinformatics approaches, analyzing gene expression and protein-protein interaction datasets, resulting in the identification of 20 top-ranked hubs involved in the progression of HCC. They are KIF2C, CDK1, TPX2, CEP55, MELK, TTK, BUB1, NCAPG, ASPM, KIF11, CCNA2, HMMR, BUB1B, TOP2A, CENPF, KIF20A, NUSAP1, DLGAP5, PBK, and CCNB2. Gene-chemical interaction network studies uncovered that existing drug molecules, including sorafenib, regorafenib, cabozantinib, and lenvatinib, can be used as leads to identify novel drugs, and the identified hub genes can be promising drug targets. The current study underscores the significance of targeting these hub genes and utilizing existing molecules to generate new molecules to combat liver cancer effectively and can be further explored in terms of drug discovery research to develop treatments for HCC.
肝脏和肝细胞的损伤是原发性肝癌的起始部位,这被称为肝细胞癌(HCC)。检测肝细胞癌基因表达变化的最佳方法之一是通过生物信息学方法。
本研究旨在通过基因表达和蛋白质-蛋白质相互作用数据集,利用计算方法识别介导HCC进展的潜在药物靶点枢纽。
从GEO数据库获取四个与HCC相关的数据集,并识别差异表达基因(DEG)。使用Evenn选择共同基因。使用Fun Rich工具识别基因之间的功能关联。此外,使用STRING预测蛋白质-蛋白质相互作用网络,并使用Cytoscape识别枢纽基因。对选定的枢纽基因进行GEPIA和Shiny GO分析,以进行生存分析和对已识别枢纽基因的功能富集研究。使用人类蛋白质图谱(HPA)对上调基因进行进一步的免疫组织病理学研究,以识别正常和HCC条件下的基因/蛋白质表达。利用药物银行和药物-基因相互作用数据库查找已报道的药物状态和靶点。最后,进行STITCH分析以识别药物与已识别枢纽基因之间的功能关联。
对所考虑数据集的GEO2R分析识别出735个上调和284个下调的DEG。通过Fun Rich工具识别功能基因关联。此外,使用STRING进行蛋白质-蛋白质相互作用网络(PPIN)分析。在STRING中对实验证据与其他七个数据证据进行了比较研究,结果表明网络中的大多数蛋白质参与了蛋白质-蛋白质相互作用。此外,通过Cytoscape插件分析基因排名,并识别紧密连接区域,从而选择出参与HCC发病机制的前20个枢纽基因。所识别的枢纽基因包括:KIF2C、CDK1、TPX2、CEP55、MELK、TTK、BUB1、NCAPG、ASPM、KIF11、CCNA2、HMMR、BUB1B、TOP2A、CENPF、KIF20A、NUSAP1、DLGAP5、PBK和CCNB2。此外,GEPIA和Shiny GO分析为枢纽基因的生存率和功能富集研究提供了见解。HPA数据库研究进一步发现,上调基因参与了正常和HCC组织中蛋白质表达的变化。这些发现表明枢纽基因肯定参与了HCC的进展。STITCH数据库研究发现,包括索拉非尼、瑞戈非尼、卡博替尼和乐伐替尼在内的现有药物分子可作为识别新型药物的先导,并且所识别的枢纽基因也可被视为潜在且有前景的药物靶点,因为它们参与了基因-化学相互作用网络。
本研究涉及多种综合生物信息学方法,分析基因表达和蛋白质-蛋白质相互作用数据集,从而识别出参与HCC进展的20个排名靠前的枢纽基因。它们是KIF2C、CDK1、TPX2、CEP55、MELK、TTK、BUB1、NCAPG、ASPM、KIF11、CCNA2、HMMR、BUB1B、TOP2A、CENPF、KIF20A、NUSAP1、DLGAP5、PBK和CCNB2。基因-化学相互作用网络研究发现,包括索拉非尼、瑞戈非尼、卡博替尼和乐伐替尼在内的现有药物分子可作为识别新型药物的先导,并且所识别的枢纽基因可能是有前景的药物靶点。当前研究强调了靶向这些枢纽基因以及利用现有分子生成新分子以有效对抗肝癌的重要性,并且在药物发现研究方面可进一步探索以开发HCC的治疗方法。