Suppr超能文献

基于药物遗传变异的英国生物银行中药物使用预测的图表示学习

Graph Representation Learning for the Prediction of Medication Usage in the UK Biobank Based on Pharmacogenetic Variants.

作者信息

Qi Bill, Trakadis Yannis J

机构信息

Department of Human Genetics, McGill University, Montreal, QC H3A 0C7, Canada.

Department of Medical Genetics, McGill University Health Center, Montreal, QC H4A 3J1, Canada.

出版信息

Bioengineering (Basel). 2025 May 31;12(6):595. doi: 10.3390/bioengineering12060595.

Abstract

Ineffective treatment and side effects are associated with high burdens for the patient and society. We investigated the application of graph representation learning (GRL) for predicting medication usage based on individual genetic data in the United Kingdom Biobank (UKBB). A graph convolutional network (GCN) was used to integrate interconnected biomedical entities in the form of a knowledge graph as part of a machine learning (ML) prediction model. Data from The Pharmacogenomics Knowledgebase (PharmGKB) was used to construct a biomedical knowledge graph. Individual genetic data ( = 485,754) from the UKBB was obtained and preprocessed to match with pharmacogenetic variants in the PharmGKB. Self-reported medication usage labels were obtained from UKBB data field 20003. We hypothesize that pharmacogenetic variants can predict the impact of medications on individuals. We assume that an individual using a medication on a regular basis experiences a net benefit (vs. side-effects) from the medication. ML models were trained to predict medication usage for 264 medications. The GCN model significantly outperformed both a baseline logistic regression model (-value: 1.53 × 10) and a deep neural network model (-value: 8.68 × 10). The GCN model also significantly outperformed a GCN model trained using a random graph (GCN-random) (-value: 5.44 × 10). A consistent trend of medications with higher sample sizes having better performance was observed, and for several medications, a high relative rank of the medication (among multiple medications) was associated with greater than 2-fold higher odds of usage of the medication. In conclusion, a graph-based ML approach could be useful in advancing precision medicine by prioritizing medications that a patient may need based on their genetic data. However, further research is needed to improve the quality and quantity of genetic data and to validate our approach using more reliable medication labels.

摘要

无效治疗和副作用给患者和社会带来了沉重负担。我们研究了图表示学习(GRL)在英国生物银行(UKBB)中基于个体遗传数据预测药物使用情况的应用。图卷积网络(GCN)被用于以知识图谱的形式整合相互关联的生物医学实体,作为机器学习(ML)预测模型的一部分。来自药物基因组学知识库(PharmGKB)的数据被用于构建生物医学知识图谱。获取了UKBB中的个体遗传数据(n = 485,754)并进行预处理,以与PharmGKB中的药物遗传变异相匹配。自我报告的药物使用标签来自UKBB数据字段20003。我们假设药物遗传变异可以预测药物对个体产生的影响。我们假定定期使用某种药物的个体从该药物中获得净益处(相对于副作用)。训练ML模型以预测264种药物的使用情况。GCN模型显著优于基线逻辑回归模型(p值:1.53×10)和深度神经网络模型(p值:8.68×10)。GCN模型也显著优于使用随机图训练的GCN模型(GCN - 随机)(p值:5.44×10)。观察到样本量较大的药物具有更好性能的一致趋势,并且对于几种药物,药物的高相对排名(在多种药物中)与该药物使用几率高出2倍以上相关。总之,基于图的ML方法可能有助于通过根据患者的遗传数据优先选择患者可能需要的药物来推进精准医学。然而,需要进一步研究以提高遗传数据的质量和数量,并使用更可靠的药物标签来验证我们的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea7d/12189576/f48b88bcde5d/bioengineering-12-00595-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验