Suppr超能文献

基于知识图谱的代谢物-疾病关联识别

Identification of metabolite-disease associations based on knowledge graph.

作者信息

Xiao Fuheng, Huang Canling, Chen Ali, Xiao Wei, Li Zhanchao

机构信息

School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, P.R. China.

Center for Drug Research and Development, Guangdong Provincial Key Laboratory of Advanced Drug Delivery System, Guangdong Pharmaceutical University, Guangzhou, 510006, P.R. China.

出版信息

Metabolomics. 2025 Feb 22;21(2):32. doi: 10.1007/s11306-025-02227-1.

Abstract

BACKGROUND

Despite the insights that metabolite analysis can provide into the onset, development, and progression of diseases-thus offering new concepts and methodologies for prevention, diagnosis, and treatment-traditional wet lab experiments are often time-consuming and labor-intensive. Consequently, this study aimed to develop a machine learning model named COM-RAN, which is based on a knowledge graph and random forest algorithm, to identify potential associations between metabolites and diseases.

METHODS

Firstly, we integrated the known associations between diseases and metabolites. Secondly, we provided a synthesis of the extant data regarding diseases and metabolites, accompanied by supplementary information pertinent to these entities. Thirdly, knowledge graph-based embedded features were used to characterize disease-metabolite associations. Finally, a random forest algorithm was employed to construct a model for identifying potential disease-metabolite associations.

RESULTS

The experimental results demonstrated that the proposed model achieved an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.968 in 5-fold cross-validations, while the Area Under the Precision-Recall Curve (AUPR) was 0.901, outperforming the vast majority of existing prediction methods. The case studies corroborated the majority of the novel associations identified by COM-RAN, thereby further demonstrating the reliability of the current method in predicting the potential relationship between metabolites and diseases.

CONCLUSION

The COM-RAN model demonstrated promise in predicting associations between diseases and metabolites, suggesting that integrating knowledge graphs with machine learning methodologies can significantly improve the accuracy and reliability of predictions related to disease-associated metabolites.

摘要

背景

尽管代谢物分析能够为疾病的发生、发展和进程提供深刻见解,从而为预防、诊断和治疗提供新的概念和方法,但传统的湿实验室实验往往既耗时又费力。因此,本研究旨在开发一种名为COM-RAN的机器学习模型,该模型基于知识图谱和随机森林算法,以识别代谢物与疾病之间的潜在关联。

方法

首先,我们整合了疾病与代谢物之间的已知关联。其次,我们对有关疾病和代谢物的现有数据进行了综合,并附带了与这些实体相关的补充信息。第三,基于知识图谱的嵌入特征被用于表征疾病-代谢物关联。最后,采用随机森林算法构建一个用于识别潜在疾病-代谢物关联的模型。

结果

实验结果表明,所提出的模型在五折交叉验证中实现了受试者工作特征曲线下面积(AUC)为0.968,而精确召回率曲线下面积(AUPR)为0.901,优于绝大多数现有的预测方法。案例研究证实了COM-RAN识别出的大多数新关联,从而进一步证明了当前方法在预测代谢物与疾病之间潜在关系方面的可靠性。

结论

COM-RAN模型在预测疾病与代谢物之间的关联方面显示出前景,这表明将知识图谱与机器学习方法相结合能够显著提高与疾病相关代谢物预测的准确性和可靠性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验