Suppr超能文献

DRML-Ensemble:基于多层集成特征构建的药物重定位方法。

DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble.

机构信息

School of Medicine, Shanghai University, Shanghai, 200444, China.

School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China.

出版信息

J Mol Model. 2024 Jul 31;30(8):296. doi: 10.1007/s00894-024-06087-9.

Abstract

CONTEXT

Computational drug repurposing methods have been continuously developed in recent years to alleviate the high costs associated with drug development. As drug targets or the products of disease-related genes, proteins play an important role in drug repurposing. Although the potential has been demonstrated, heterogeneous graphs with proteins as independent nodes have yet to be studied, where extracting high-quality protein features from heterogeneous graphs poses a significant challenge. A novel drug repurposing model based on the feature construction of multi-layer ensemble (DRML-Ensemble) is proposed in this study. The performance of DRML-Ensemble, as evaluated on publicly available datasets, achieves an AUPR value of 0.93 and an AUROC value of 0.92, surpassing those of existing state-of-the-art methods. Additionally, DRML-Ensemble demonstrates its notable ability for drug repurposing in Alzheimer's disease.

METHODS

DRML-Ensemble is primarily composed of multiple layers of heterogeneous graph feature construction (HGFC). Each HGFC can extract protein features by leveraging the relationships between drugs, diseases, and proteins. These protein features are then utilized in subsequent layers to build drug and disease features, facilitating drug repurposing. By stacking multiple layers, optimal protein features can be obtained from the heterogeneous graph, consequently improving the accuracy of drug repurposing. However, an excessive· stacking of layers usually affect the model's training process, for example, causing problems such as overfitting; a multi-layer ensemble prediction module is designed to further improve the model's performance.

摘要

背景

近年来,计算药物再利用方法不断发展,以减轻与药物开发相关的高昂成本。作为药物靶点或疾病相关基因的产物,蛋白质在药物再利用中起着重要作用。尽管已经证明了其潜力,但尚未研究具有蛋白质作为独立节点的异质图,从异质图中提取高质量的蛋白质特征是一个重大挑战。本研究提出了一种基于多层集成特征构建的新型药物再利用模型(DRML-Ensemble)。在公开数据集上进行评估时,DRML-Ensemble 的性能达到了 AUPR 值 0.93 和 AUROC 值 0.92,超过了现有最先进方法的性能。此外,DRML-Ensemble 在阿尔茨海默病的药物再利用中表现出了显著的能力。

方法

DRML-Ensemble 主要由多层异质图特征构建(HGFC)组成。每个 HGFC 都可以通过利用药物、疾病和蛋白质之间的关系来提取蛋白质特征。然后,这些蛋白质特征被用于后续层以构建药物和疾病特征,从而促进药物再利用。通过堆叠多个层,可以从异质图中获得最佳的蛋白质特征,从而提高药物再利用的准确性。然而,过多的堆叠层通常会影响模型的训练过程,例如导致过拟合等问题;因此,设计了一个多层集成预测模块来进一步提高模型的性能。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验