• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用生物医学知识图谱中的语义模式预测治疗和因果关系。

Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.

机构信息

Department of Computer Science, University of Kentucky, United States.

Division of Hospital Medicine, Department of Internal Medicine, University of Kentucky, United States.

出版信息

J Biomed Inform. 2018 Jun;82:189-199. doi: 10.1016/j.jbi.2018.05.003. Epub 2018 May 12.

DOI:10.1016/j.jbi.2018.05.003
PMID:29763706
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6070294/
Abstract

BACKGROUND

Identifying new potential treatment options for medical conditions that cause human disease burden is a central task of biomedical research. Since all candidate drugs cannot be tested with animal and clinical trials, in vitro approaches are first attempted to identify promising candidates. Likewise, identifying different causal relations between biomedical entities is also critical to understand biomedical processes. Generally, natural language processing (NLP) and machine learning are used to predict specific relations between any given pair of entities using the distant supervision approach.

OBJECTIVE

To build high accuracy supervised predictive models to predict previously unknown treatment and causative relations between biomedical entities based only on semantic graph pattern features extracted from biomedical knowledge graphs.

METHODS

We used 7000 treats and 2918 causes hand-curated relations from the UMLS Metathesaurus to train and test our models. Our graph pattern features are extracted from simple paths connecting biomedical entities in the SemMedDB graph (based on the well-known SemMedDB database made available by the U.S. National Library of Medicine). Using these graph patterns connecting biomedical entities as features of logistic regression and decision tree models, we computed mean performance measures (precision, recall, F-score) over 100 distinct 80-20% train-test splits of the datasets. For all experiments, we used a positive:negative class imbalance of 1:10 in the test set to model relatively more realistic scenarios.

RESULTS

Our models predict treats and causes relations with high F-scores of 99% and 90% respectively. Logistic regression model coefficients also help us identify highly discriminative patterns that have an intuitive interpretation. We are also able to predict some new plausible relations based on false positives that our models scored highly based on our collaborations with two physician co-authors. Finally, our decision tree models are able to retrieve over 50% of treatment relations from a recently created external dataset.

CONCLUSIONS

We employed semantic graph patterns connecting pairs of candidate biomedical entities in a knowledge graph as features to predict treatment/causative relations between them. We provide what we believe is the first evidence in direct prediction of biomedical relations based on graph features. Our work complements lexical pattern based approaches in that the graph patterns can be used as additional features for weakly supervised relation prediction.

摘要

背景

为了减轻人类疾病负担,寻找新的潜在治疗方法是生物医学研究的核心任务。由于无法对所有候选药物进行动物和临床试验,因此首先尝试采用体外方法来确定有希望的候选药物。同样,确定生物医学实体之间的不同因果关系对于理解生物医学过程也至关重要。通常,使用自然语言处理 (NLP) 和机器学习通过远距离监督方法来预测给定的任意两个实体之间的特定关系。

目的

仅基于从生物医学知识图中提取的语义图模式特征,构建高精度监督预测模型,以预测生物医学实体之间以前未知的治疗和因果关系。

方法

我们使用 UMLS Metathesaurus 中的 7000 种治疗方法和 2918 种因果关系来训练和测试我们的模型。我们的图模式特征是从 SemMedDB 图中连接生物医学实体的简单路径中提取的(基于美国国立医学图书馆提供的著名 SemMedDB 数据库)。使用这些连接生物医学实体的图模式作为逻辑回归和决策树模型的特征,我们在数据集的 100 个不同的 80-20%训练-测试分割上计算了平均性能指标(精度、召回率、F 分数)。对于所有实验,我们在测试集中使用阳性:阴性类不平衡率为 1:10,以模拟更现实的情况。

结果

我们的模型分别以 99%和 90%的高 F 分数预测治疗和因果关系。逻辑回归模型系数还帮助我们识别了具有直观解释的高度区分模式。根据我们与两位医师合著者的合作,我们还能够根据模型的高分预测一些新的合理关系。最后,我们的决策树模型能够从最近创建的外部数据集中检索超过 50%的治疗关系。

结论

我们在知识图中使用连接候选生物医学实体对的语义图模式作为特征来预测它们之间的治疗/因果关系。我们提供了基于图特征直接预测生物医学关系的第一个证据。我们的工作补充了基于词汇模式的方法,因为图模式可以用作弱监督关系预测的附加特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/780962052755/nihms982571f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/7de55b358497/nihms982571f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/afb54afc7818/nihms982571f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/2cc680babf9e/nihms982571f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/780962052755/nihms982571f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/7de55b358497/nihms982571f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/afb54afc7818/nihms982571f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/2cc680babf9e/nihms982571f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1378/6070294/780962052755/nihms982571f4.jpg

相似文献

1
Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.利用生物医学知识图谱中的语义模式预测治疗和因果关系。
J Biomed Inform. 2018 Jun;82:189-199. doi: 10.1016/j.jbi.2018.05.003. Epub 2018 May 12.
2
Predicting Treatment Relations with Semantic Patterns over Biomedical Knowledge Graphs.基于生物医学知识图谱的语义模式预测治疗关系
Min Intell Knowl Explor (2015). 2015 Dec;9468:586-596. doi: 10.1007/978-3-319-26832-3_55. Epub 2016 Jan 3.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
A novel framework for biomedical entity sense induction.一种用于生物医学实体感知归纳的新框架。
J Biomed Inform. 2018 Aug;84:31-41. doi: 10.1016/j.jbi.2018.06.007. Epub 2018 Jun 20.
5
Broad-coverage biomedical relation extraction with SemRep.基于 SemRep 的广谱生物医学关系抽取。
BMC Bioinformatics. 2020 May 14;21(1):188. doi: 10.1186/s12859-020-3517-7.
6
Explaining protein-protein interactions with knowledge graph-based semantic similarity.用基于知识图的语义相似度解释蛋白质-蛋白质相互作用。
Comput Biol Med. 2024 Mar;170:108076. doi: 10.1016/j.compbiomed.2024.108076. Epub 2024 Feb 1.
7
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion.基于路径的知识推理与文本语义信息融合的医疗知识图谱补全方法
BMC Med Inform Decis Mak. 2021 Nov 29;21(Suppl 9):335. doi: 10.1186/s12911-021-01622-7.
8
Constructing a Graph Database for Semantic Literature-Based Discovery.构建用于基于语义文献发现的图形数据库。
Stud Health Technol Inform. 2015;216:1094.
9
Extraction of semantic biomedical relations from text using conditional random fields.使用条件随机场从文本中提取语义生物医学关系。
BMC Bioinformatics. 2008 Apr 23;9:207. doi: 10.1186/1471-2105-9-207.
10
Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation.在PubMed查询中发现生物医学语义关系以进行信息检索和数据库管理。
Database (Oxford). 2016 Mar 25;2016. doi: 10.1093/database/baw025. Print 2016.

引用本文的文献

1
How important is domain-specific language model pretraining and instruction finetuning for biomedical relation extraction?特定领域语言模型预训练和指令微调对生物医学关系抽取有多重要?
Nat Lang Process Inf Syst. 2026;15836:80-94. doi: 10.1007/978-3-031-97141-9_6. Epub 2025 Jul 1.
2
Deep learning approach for automated hMPV classification.用于hMPV自动分类的深度学习方法。
Sci Rep. 2025 Aug 8;15(1):29068. doi: 10.1038/s41598-025-14467-1.
3
Validation approaches for computational drug repurposing: a review.计算药物重新利用的验证方法:综述

本文引用的文献

1
Predicting Treatment Relations with Semantic Patterns over Biomedical Knowledge Graphs.基于生物医学知识图谱的语义模式预测治疗关系
Min Intell Knowl Explor (2015). 2015 Dec;9468:586-596. doi: 10.1007/978-3-319-26832-3_55. Epub 2016 Jan 3.
2
Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks.基于精神科病历中的“现病史”用深度神经网络预测精神状况。
J Biomed Inform. 2017 Nov;75S:S138-S148. doi: 10.1016/j.jbi.2017.06.010. Epub 2017 Jun 10.
3
A standard database for drug repositioning.
AMIA Annu Symp Proc. 2024 Jan 11;2023:559-568. eCollection 2023.
4
From language models to large-scale food and biomedical knowledge graphs.从语言模型到大尺度的食品和生物医学知识图谱。
Sci Rep. 2023 May 15;13(1):7815. doi: 10.1038/s41598-023-34981-4.
5
Exploring relationship between emotion and probiotics with knowledge graphs.利用知识图谱探索情绪与益生菌之间的关系。
Health Inf Sci Syst. 2022 Sep 10;10(1):27. doi: 10.1007/s13755-022-00179-7. eCollection 2022 Dec.
6
AnthraxKP: a knowledge graph-based, Anthrax Knowledge Portal mined from biomedical literature.炭疽病知识图谱:基于知识图谱的炭疽病知识库,从生物医学文献中挖掘而来。
Database (Oxford). 2022 Jun 2;2022. doi: 10.1093/database/baac037.
7
Identifying Cases of Shoulder Injury Related to Vaccine Administration (SIRVA) in the United States: Development and Validation of a Natural Language Processing Method.美国疫苗接种相关肩部损伤(SIRVA)病例的识别:自然语言处理方法的开发和验证。
JMIR Public Health Surveill. 2022 May 24;8(5):e30426. doi: 10.2196/30426.
8
Adverse Drug Event Prediction Using Noisy Literature-Derived Knowledge Graphs: Algorithm Development and Validation.使用有噪声的文献衍生知识图谱进行药物不良事件预测:算法开发与验证
JMIR Med Inform. 2021 Oct 25;9(10):e32730. doi: 10.2196/32730.
9
Catalyzing Knowledge-Driven Discovery in Environmental Health Sciences through a Community-Driven Harmonized Language.通过社区驱动的协调语言推动环境健康科学的知识驱动发现。
Int J Environ Res Public Health. 2021 Aug 26;18(17):8985. doi: 10.3390/ijerph18178985.
10
Knowledge-Based Biomedical Data Science.基于知识的生物医学数据科学
Annu Rev Biomed Data Sci. 2020 Jul;3:23-41. doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7.
一个药物重定位的标准数据库。
Sci Data. 2017 Mar 14;4:170029. doi: 10.1038/sdata.2017.29.
4
DrugCentral: online drug compendium.药物中枢:在线药物汇编。
Nucleic Acids Res. 2017 Jan 4;45(D1):D932-D939. doi: 10.1093/nar/gkw993. Epub 2016 Oct 26.
5
Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.用图算法弥合语义与句法——提取生物医学关系的研究现状
Brief Bioinform. 2017 Jan;18(1):160-178. doi: 10.1093/bib/bbw001. Epub 2016 Feb 5.
6
A survey of current trends in computational drug repositioning.计算药物重新定位的当前趋势调查。
Brief Bioinform. 2016 Jan;17(1):2-12. doi: 10.1093/bib/bbv020. Epub 2015 Mar 31.
7
Extracting drug-drug interactions from literature using a rich feature-based linear kernel approach.使用基于丰富特征的线性核方法从文献中提取药物相互作用。
J Biomed Inform. 2015 Jun;55:23-30. doi: 10.1016/j.jbi.2015.03.002. Epub 2015 Mar 19.
8
Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建
J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.
9
Exploiting Literature-derived Knowledge and Semantics to Identify Potential Prostate Cancer Drugs.利用文献衍生知识和语义学来识别潜在的前列腺癌药物。
Cancer Inform. 2014 Oct 14;13(Suppl 1):103-11. doi: 10.4137/CIN.S13889. eCollection 2014.
10
Predicting high-throughput screening results with scalable literature-based discovery methods.用可扩展的基于文献的发现方法预测高通量筛选结果。
CPT Pharmacometrics Syst Pharmacol. 2014 Oct 8;3(10):e140. doi: 10.1038/psp.2014.37.