从生物医学文本中推断药物-蛋白质-副作用关系。

Inferring Drug-Protein⁻Side Effect Relationships from Biomedical Text.

机构信息

Department of Library and Information Science, Yonsei University, Seoul 03722, Korea.

Institute of Convergence, Yonsei University, Seoul 03722, Korea.

出版信息

Genes (Basel). 2019 Feb 19;10(2):159. doi: 10.3390/genes10020159.

DOI:10.3390/genes10020159

PMID:30791472

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6409686/

Abstract

BACKGROUND

Although there are many studies of drugs and their side effects, the underlying mechanisms of these side effects are not well understood. It is also difficult to understand the specific pathways between drugs and side effects.

OBJECTIVE

The present study seeks to construct putative paths between drugs and their side effects by applying text-mining techniques to free text of biomedical studies, and to develop ranking metrics that could identify the most-likely paths.

MATERIALS AND METHODS

We extracted three types of relationships-drug-protein, protein-protein, and protein⁻side effect-from biomedical texts by using text mining and predefined relation-extraction rules. Based on the extracted relationships, we constructed whole drug-protein⁻side effect paths. For each path, we calculated its ranking score by a new ranking function that combines corpus- and ontology-based semantic similarity as well as co-occurrence frequency.

RESULTS

We extracted 13 plausible biomedical paths connecting drugs and their side effects from cancer-related abstracts in the PubMed database. The top 20 paths were examined, and the proposed ranking function outperformed the other methods tested, including co-occurrence, COALS, and UMLS by P@5-P@20. In addition, we confirmed that the paths are novel hypotheses that are worth investigating further.

DISCUSSION

The risk of side effects has been an important issue for the US Food and Drug Administration (FDA). However, the causes and mechanisms of such side effects have not been fully elucidated. This study extends previous research on understanding drug side effects by using various techniques such as Named Entity Recognition (NER), Relation Extraction (RE), and semantic similarity.

CONCLUSION

It is not easy to reveal the biomedical mechanisms of side effects due to a huge number of possible paths. However, we automatically generated predictable paths using the proposed approach, which could provide meaningful information to biomedical researchers to generate plausible hypotheses for the understanding of such mechanisms.

摘要

背景

尽管有许多关于药物及其副作用的研究，但这些副作用的潜在机制仍未得到很好的理解。也很难理解药物和副作用之间的具体途径。

目的

本研究通过应用文本挖掘技术从生物医学研究的自由文本中提取药物与其副作用之间的可能路径，并开发排名指标来识别最有可能的路径。

材料和方法

我们通过文本挖掘和预定义的关系提取规则从生物医学文本中提取了药物-蛋白、蛋白-蛋白和蛋白-副作用三种关系。基于提取的关系，我们构建了完整的药物-蛋白-副作用路径。对于每条路径，我们通过一种新的排名函数计算其排名得分，该函数结合了语料库和本体论的语义相似性以及共现频率。

结果

我们从 PubMed 数据库中癌症相关摘要中提取了 13 条连接药物及其副作用的合理生物医学路径。对前 20 条路径进行了检查，提出的排名函数优于其他测试方法，包括共现、COALS 和 UMLS 在 P@5-P@20 中的表现。此外，我们还证实这些路径是值得进一步研究的新假设。

讨论

副作用的风险一直是美国食品和药物管理局（FDA）的一个重要问题。然而，这些副作用的原因和机制尚未完全阐明。本研究通过使用命名实体识别（NER）、关系提取（RE）和语义相似性等各种技术，扩展了以前关于理解药物副作用的研究。

结论

由于可能的路径数量众多，揭示副作用的生物医学机制并不容易。然而，我们使用提出的方法自动生成可预测的路径，这可以为生物医学研究人员提供有意义的信息，以生成对这些机制的理解的合理假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce5f/6409686/3e2d9fad6c32/genes-10-00159-g001.jpg

相似文献

Inferring Drug-Protein⁻Side Effect Relationships from Biomedical Text.

Genes (Basel). 2019 Feb 19;10(2):159. doi: 10.3390/genes10020159.

BertSRC: transformer-based semantic relation classification.

BMC Med Inform Decis Mak. 2022 Sep 6;22(1):234. doi: 10.1186/s12911-022-01977-5.

Linking entities through an ontology using word embeddings and syntactic re-ranking.

BMC Bioinformatics. 2019 Mar 27;20(1):156. doi: 10.1186/s12859-019-2678-8.

Large-Scale Biomedical Relation Extraction Across Diverse Relation Types: Model Development and Usability Study on COVID-19.

J Med Internet Res. 2023 Sep 20;25:e48115. doi: 10.2196/48115.

An adverse drug effect mentions extraction method based on weighted online recurrent extreme learning machine.

Comput Methods Programs Biomed. 2019 Jul;176:33-41. doi: 10.1016/j.cmpb.2019.04.029. Epub 2019 Apr 30.

Leveraging graph topology and semantic context for pharmacovigilance through twitter-streams.

BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):335. doi: 10.1186/s12859-016-1220-5.

Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

J Biomed Inform. 2015 Jun;55:64-72. doi: 10.1016/j.jbi.2015.03.009. Epub 2015 Mar 27.

Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer's disease.

BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):240. doi: 10.1186/s12911-019-0934-5.

miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.

J Biomed Semantics. 2016 Apr 29;7(1):9. doi: 10.1186/s13326-015-0044-y.

PISTON: Predicting drug indications and side effects using topic modeling and natural language processing.

J Biomed Inform. 2018 Nov;87:96-107. doi: 10.1016/j.jbi.2018.09.015. Epub 2018 Sep 27.

引用本文的文献

Better understanding the phenotypic effects of drugs through shared targets in genetic disease networks.

Front Pharmacol. 2025 Jan 22;15:1470931. doi: 10.3389/fphar.2024.1470931. eCollection 2024.

本文引用的文献

PKDE4J: Entity and relation extraction for public knowledge discovery.

J Biomed Inform. 2015 Oct;57:320-32. doi: 10.1016/j.jbi.2015.08.008. Epub 2015 Aug 12.

Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature.

J Biomed Inform. 2014 Oct;51:191-9. doi: 10.1016/j.jbi.2014.05.013. Epub 2014 Jun 10.

NF-kappaB mediated transcriptional repression of acid modifying hormone gastrin.

PLoS One. 2013 Aug 23;8(8):e73409. doi: 10.1371/journal.pone.0073409. eCollection 2013.

Text-mining solutions for biomedical research: enabling integrative biology.

Nat Rev Genet. 2012 Dec;13(12):829-39. doi: 10.1038/nrg3337. Epub 2012 Nov 14.

p38 MAPK in myeloma cells regulates osteoclast and osteoblast activity and induces bone destruction.

Cancer Res. 2012 Dec 15;72(24):6393-402. doi: 10.1158/0008-5472.CAN-12-2664. Epub 2012 Oct 11.

BMC Bioinformatics. 2012 Oct 10;13:261. doi: 10.1186/1471-2105-13-261.

Assessing drug target association using semantic linked data.

PLoS Comput Biol. 2012;8(7):e1002574. doi: 10.1371/journal.pcbi.1002574. Epub 2012 Jul 5.

Large-scale prediction and testing of drug activity on side-effect targets.

Nature. 2012 Jun 10;486(7403):361-7. doi: 10.1038/nature11159.

Identification of colorectal cancer related genes with mRMR and shortest path in protein-protein interaction network.

PLoS One. 2012;7(4):e33393. doi: 10.1371/journal.pone.0033393. Epub 2012 Apr 4.

Pathway analysis of genomic data: concepts, methods, and prospects for future development.

Trends Genet. 2012 Jul;28(7):323-32. doi: 10.1016/j.tig.2012.03.004. Epub 2012 Apr 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从生物医学文本中推断药物-蛋白质-副作用关系。

Inferring Drug-Protein⁻Side Effect Relationships from Biomedical Text.

机构信息

出版信息

BACKGROUND

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

背景

目的

材料和方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献