Suppr超能文献

一种使用监督算法和临床模型为胰腺癌生成高精度特征的多阶段方法。

A multi-phase approach using supervised algorithms and clinical models to generate high-accuracy signatures for pancreatic cancer.

作者信息

Bararia Akash, Chakraborty Agniswar, Ghosh Gourav, Dastidar Debabrata Ghosh, Mukherjee Sumit, Sikdar Nilabja

机构信息

Human Genetics Unit, Indian Statistical Institute, Kolkata, India.

Department of Computer Science, Jadavpur University, Kolkata, India.

出版信息

Comput Biol Med. 2025 Aug;194:110559. doi: 10.1016/j.compbiomed.2025.110559. Epub 2025 Jun 14.

Abstract

BACKGROUND

The in silico analyses provide evidence supporting the potential of methylation-driven differentially expressed genes as therapeutic targets across cancer types. This leads us to identify novel targets and their associated drug compounds for further progress towards pancreatic cancer treatment.

OBJECTIVE

To identify targeted drugs based on methylation driven genes identified using bulk multi-omics data and single-cell level data to pinpoint important disease markers.

METHODS

The workflow involves screening using the TCGA and ICGC databases, followed by validation with GEO datasets. The study employs supervised learning algorithms like kNN and random forests, and constructs a prediction model using adaptive LASSO-Cox regression. The process also includes pathway analysis, evaluation of survival status, and immune profile deconvolution, as well as multistage evaluation of the methylation driven genes. We conducted drug targeting and molecular dynamic simulations, taking into account genes of interest.Lastly, molecular docking and dynamics simulations were used to find out if the key MEDEGs could be utilized as drug targets.

RESULTS

CD36, UGT1A1, TFF1, S100P, MUC13, CALHM3 and ANKRD44 were found to be top 7 methylation driven genes. The mutational profile was also documented along with pathway analysis, which showed concordance with our observation based on their significant enriched terms namely "Maintenance of Gastrointestinal Epithelium", and "Digestive System Homeostasis". CD36 had prognostic capabilities and was seen to significant in terms of survival and also showed significant immune dysregulation. Our novel findings suggest TFF1, S100P, and MUC13 were found to be associated with cell type specific expression as seen in single cell data and UGT1A1 was found to be suitable for probable drug targeting. CD36, UGT1A1, TFF1, S100P, and MUC13 showed concordance when observed at proteomics level and across other datasets. Apigenin-7-O-glucuronide emerged as the top binder for UDP-glucuronosyltransferase 1A1 (also known as UDP 1A1), forming stable complexes with favourable interactions. Catechin and epicatechin were identified as the best ligands for TFF1 and S100P, while rutin showed high-affinity binding to MUC13.

CONCLUSION

The study successfully identified and validated a panel of biomarkers specific to pancreatic cancer, with potential applications in early diagnosis and treatment. The findings highlight the importance of multi-omics data integration in cancer research and the potential of personalized medicine in improving patient outcomes. The in-silico drug targeting analysis provides a foundation for the development of novel drugs for PanCa treatment. Hence TFF1, S100P, MUC13, and UGT1A1 showcased themselves as most promising biomarkers and novel drug targets.

摘要

背景

计算机模拟分析提供了证据,支持甲基化驱动的差异表达基因作为跨癌症类型治疗靶点的潜力。这促使我们识别新的靶点及其相关药物化合物,以推动胰腺癌治疗取得进一步进展。

目的

基于使用批量多组学数据和单细胞水平数据识别出的甲基化驱动基因,识别靶向药物,以确定重要的疾病标志物。

方法

工作流程包括使用TCGA和ICGC数据库进行筛选,随后用GEO数据集进行验证。该研究采用kNN和随机森林等监督学习算法,并使用自适应LASSO-Cox回归构建预测模型。该过程还包括通路分析、生存状态评估、免疫图谱反卷积,以及对甲基化驱动基因的多阶段评估。我们进行了药物靶向和分子动力学模拟,同时考虑了感兴趣的基因。最后,使用分子对接和动力学模拟来确定关键的甲基化驱动的差异表达基因(MEDEGs)是否可以用作药物靶点。

结果

发现CD36、UGT1A1、TFF1、S100P、MUC13、CALHM3和ANKRD44是前7个甲基化驱动基因。还记录了突变谱并进行了通路分析,结果表明与我们基于其显著富集的术语“胃肠道上皮的维持”和“消化系统稳态”的观察结果一致。CD36具有预后能力,在生存方面具有显著性,并且还显示出显著的免疫失调。我们的新发现表明,如在单细胞数据中所见,TFF1、S100P和MUC13与细胞类型特异性表达相关,并且发现UGT1A1适合作为可能的药物靶点。当在蛋白质组学水平和其他数据集中观察时,CD36、UGT1A1、TFF1、S100P和MUC13表现出一致性。芹菜素-7-O-葡萄糖醛酸苷成为UDP-葡萄糖醛酸基转移酶1A1(也称为UDP 1A1)的最佳结合剂,形成具有良好相互作用的稳定复合物。儿茶素和表儿茶素被确定为TFF1和S100P的最佳配体,而芦丁对MUC13表现出高亲和力结合。

结论

该研究成功识别并验证了一组胰腺癌特异性生物标志物,并在早期诊断和治疗中具有潜在应用。研究结果突出了多组学数据整合在癌症研究中的重要性以及个性化医疗在改善患者预后方面的潜力。计算机模拟药物靶向分析为开发用于胰腺癌治疗的新型药物奠定了基础。因此,TFF1、S100P、MUC13和UGT1A1展现出自己是最有前景的生物标志物和新型药物靶点。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验