• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

阿尔茨海默病知识图谱增强知识发现与疾病预测。

Alzheimer's Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction.

作者信息

Yang Yue, Yu Kaixian, Gao Shan, Yu Sheng, Xiong Di, Qin Chuanyang, Chen Huiyuan, Tang Jiarui, Tang Niansheng, Zhu Hongtu

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill.

Independent Researcher, Shanghai, P.R. China.

出版信息

bioRxiv. 2024 Jul 5:2024.07.03.601339. doi: 10.1101/2024.07.03.601339.

DOI:10.1101/2024.07.03.601339
PMID:39005357
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11245034/
Abstract

BACKGROUND

Alzheimer's disease (AD), a progressive neurodegenerative disorder, continues to increase in prevalence without any effective treatments to date. In this context, knowledge graphs (KGs) have emerged as a pivotal tool in biomedical research, offering new perspectives on drug repurposing and biomarker discovery by analyzing intricate network structures. Our study seeks to build an AD-specific knowledge graph, highlighting interactions among AD, genes, variants, chemicals, drugs, and other diseases. The goal is to shed light on existing treatments, potential targets, and diagnostic methods for AD, thereby aiding in drug repurposing and the identification of biomarkers.

RESULTS

We annotated 800 PubMed abstracts and leveraged GPT-4 for text augmentation to enrich our training data for named entity recognition (NER) and relation classification. A comprehensive data mining model, integrating NER and relationship classification, was trained on the annotated corpus. This model was subsequently applied to extract relation triplets from unannotated abstracts. To enhance entity linking, we utilized a suite of reference biomedical databases and refine the linking accuracy through abbreviation resolution. As a result, we successfully identified 3,199,276 entity mentions and 633,733 triplets, elucidating connections between 5,000 unique entities. These connections were pivotal in constructing a comprehensive Alzheimer's Disease Knowledge Graph (ADKG). We also integrated the ADKG constructed after entity linking with other biomedical databases. The ADKG served as a training ground for Knowledge Graph Embedding models with the high-ranking predicted triplets supported by evidence, underscoring the utility of ADKG in generating testable scientific hypotheses. Further application of ADKG in predictive modeling using the UK Biobank data revealed models based on ADKG outperforming others, as evidenced by higher values in the areas under the receiver operating characteristic (ROC) curves.

CONCLUSION

The ADKG is a valuable resource for generating hypotheses and enhancing predictive models, highlighting its potential to advance AD's disease research and treatment strategies.

摘要

背景

阿尔茨海默病(AD)是一种进行性神经退行性疾病,其患病率持续上升,迄今为止尚无任何有效治疗方法。在此背景下,知识图谱(KGs)已成为生物医学研究中的关键工具,通过分析复杂的网络结构,为药物再利用和生物标志物发现提供了新视角。我们的研究旨在构建一个特定于AD的知识图谱,突出AD、基因、变体、化学物质、药物和其他疾病之间的相互作用。目标是阐明AD的现有治疗方法、潜在靶点和诊断方法,从而有助于药物再利用和生物标志物的识别。

结果

我们注释了800篇PubMed摘要,并利用GPT-4进行文本扩充,以丰富我们用于命名实体识别(NER)和关系分类的训练数据。在注释语料库上训练了一个综合数据挖掘模型,该模型整合了NER和关系分类。随后,该模型被应用于从未注释的摘要中提取关系三元组。为了增强实体链接,我们利用了一套参考生物医学数据库,并通过缩写解析提高链接准确性。结果,我们成功识别了3199276个实体提及和633733个三元组,阐明了5000个独特实体之间的联系。这些联系对于构建全面的阿尔茨海默病知识图谱(ADKG)至关重要。我们还将实体链接后构建的ADKG与其他生物医学数据库进行了整合。ADKG作为知识图谱嵌入模型的训练平台,其预测的三元组排名靠前且有证据支持,突出了ADKG在生成可测试科学假设方面的效用。ADKG在使用英国生物银行数据进行预测建模中的进一步应用表明,基于ADKG的模型优于其他模型,这在受试者操作特征(ROC)曲线下面积的值更高中得到了证明。

结论

ADKG是生成假设和增强预测模型的宝贵资源,突出了其推进AD疾病研究和治疗策略的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/c97f8d285a95/nihpp-2024.07.03.601339v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/08237b554a25/nihpp-2024.07.03.601339v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/8def668f7670/nihpp-2024.07.03.601339v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/e705ebdd502b/nihpp-2024.07.03.601339v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/c97f8d285a95/nihpp-2024.07.03.601339v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/08237b554a25/nihpp-2024.07.03.601339v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/8def668f7670/nihpp-2024.07.03.601339v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/e705ebdd502b/nihpp-2024.07.03.601339v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3792/11245034/c97f8d285a95/nihpp-2024.07.03.601339v1-f0004.jpg

相似文献

1
Alzheimer's Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction.阿尔茨海默病知识图谱增强知识发现与疾病预测。
bioRxiv. 2024 Jul 5:2024.07.03.601339. doi: 10.1101/2024.07.03.601339.
2
Alzheimer's disease knowledge graph enhances knowledge discovery and disease prediction.阿尔茨海默病知识图谱增强了知识发现和疾病预测能力。
Comput Biol Med. 2025 Apr 29;192(Pt A):110285. doi: 10.1016/j.compbiomed.2025.110285.
3
Graph embedding-based link prediction for literature-based discovery in Alzheimer's Disease.基于图嵌入的阿尔茨海默病文献发现链路预测。
J Biomed Inform. 2023 Sep;145:104464. doi: 10.1016/j.jbi.2023.104464. Epub 2023 Aug 2.
4
Mining on Alzheimer's diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing.挖掘阿尔茨海默病相关知识图谱以识别潜在的 AD 相关语义三元组,用于药物再利用。
BMC Bioinformatics. 2022 Sep 30;23(Suppl 6):407. doi: 10.1186/s12859-022-04934-1.
5
Deciphering the role of lipid metabolism-related genes in Alzheimer's disease: a machine learning approach integrating Traditional Chinese Medicine.解析脂质代谢相关基因在阿尔茨海默病中的作用:一种整合中医的机器学习方法。
Front Endocrinol (Lausanne). 2024 Oct 23;15:1448119. doi: 10.3389/fendo.2024.1448119. eCollection 2024.
6
KG-Predict: A knowledge graph computational framework for drug repurposing.KG-Predict:一种用于药物重定位的知识图谱计算框架。
J Biomed Inform. 2022 Aug;132:104133. doi: 10.1016/j.jbi.2022.104133. Epub 2022 Jul 12.
7
Repurposing Non-pharmacological Interventions for Alzheimer's Diseases through Link Prediction on Biomedical Literature.通过生物医学文献中的链接预测将非药物干预措施用于阿尔茨海默病的新用途。
medRxiv. 2023 May 21:2023.05.15.23290002. doi: 10.1101/2023.05.15.23290002.
8
Precision Drug Repurposing (PDR): Patient-level modeling and prediction combining foundational knowledge graph with biobank data.精准药物再利用(PDR):结合基础知识图谱与生物样本库数据的患者层面建模与预测
J Biomed Inform. 2025 Mar;163:104786. doi: 10.1016/j.jbi.2025.104786. Epub 2025 Feb 12.
9
TarKG: a comprehensive biomedical knowledge graph for target discovery.TarKG:一个全面的生物医学知识图谱,用于目标发现。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae598.
10
FuseLinker: Leveraging LLM's pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs.FuseLinker:利用大语言模型的预训练文本嵌入和领域知识增强基于图神经网络的生物医学知识图谱的链接预测。
J Biomed Inform. 2024 Oct;158:104730. doi: 10.1016/j.jbi.2024.104730. Epub 2024 Sep 24.

本文引用的文献

1
Plasma proteomic associations with genetics and health in the UK Biobank.英国生物库中血浆蛋白质组与遗传学和健康的关联。
Nature. 2023 Oct;622(7982):329-338. doi: 10.1038/s41586-023-06592-6. Epub 2023 Oct 4.
2
Biomedical knowledge graph learning for drug repurposing by extending guilt-by-association to multiple layers.通过将关联推断扩展到多个层次来进行药物再利用的生物医学知识图学习。
Nat Commun. 2023 Jun 15;14(1):3570. doi: 10.1038/s41467-023-39301-y.
3
Alzheimer's disease drug development pipeline: 2023.2023年阿尔茨海默病药物研发进展
Alzheimers Dement (N Y). 2023 May 25;9(2):e12385. doi: 10.1002/trc2.12385. eCollection 2023 Apr-Jun.
4
2023 Alzheimer's disease facts and figures.2023 年阿尔茨海默病事实和数据。
Alzheimers Dement. 2023 Apr;19(4):1598-1695. doi: 10.1002/alz.13016. Epub 2023 Mar 14.
5
Modeling the enigma of complex disease etiology.模拟复杂疾病病因的谜团。
J Transl Med. 2023 Feb 25;21(1):148. doi: 10.1186/s12967-023-03987-x.
6
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
7
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.2023 年的 STRING 数据库:针对任何感兴趣的测序基因组的蛋白质-蛋白质关联网络和功能富集分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D638-D646. doi: 10.1093/nar/gkac1000.
8
The Alzheimer's Cell Atlas (TACA): A single-cell molecular map for translational therapeutics accelerator in Alzheimer's disease.阿尔茨海默病细胞图谱(TACA):阿尔茨海默病转化治疗加速器的单细胞分子图谱。
Alzheimers Dement (N Y). 2022 Oct 13;8(1):e12350. doi: 10.1002/trc2.12350. eCollection 2022.
9
Mining on Alzheimer's diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing.挖掘阿尔茨海默病相关知识图谱以识别潜在的 AD 相关语义三元组,用于药物再利用。
BMC Bioinformatics. 2022 Sep 30;23(Suppl 6):407. doi: 10.1186/s12859-022-04934-1.
10
Multimodal reasoning based on knowledge graph embedding for specific diseases.基于知识图嵌入的特定疾病的多模态推理。
Bioinformatics. 2022 Apr 12;38(8):2235-2245. doi: 10.1093/bioinformatics/btac085.