解决药理学文献中药物-药物相互作用提取的回指问题。

Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents.

机构信息

Computer Science Department, University Carlos III of Madrid, Leganés, 28921, Spain.

出版信息

BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-11-S2-S1.

DOI:10.1186/1471-2105-11-S2-S1

PMID:20406499

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3288782/

Abstract

BACKGROUND

Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction.

METHODS

The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution.

RESULTS

An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmacological documents. The obtained results resemble those found in related-semantic domains.

CONCLUSIONS

The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system.

摘要

背景

药物-药物相互作用在日益增多的生物医学文献中经常被报道。信息提取 (IE) 技术已被设计为管理这种知识的有用工具。然而，由于话语中经常引用先前的实体，句子级别的 IE 效果有限，这种现象称为“回指”。为了解决药理学文献中共同引用表达式的问题，提出了一种药物回指解析系统 DrugNerAR。这项开发是关于自动药物-药物相互作用提取的更大创新研究的一部分。

方法

该系统使用一组基于中心理论的语言规则，对生物医学句法分析器提供的分析进行处理。还集成了统一医学语言系统 (UMLS) 的语义信息，以提高对名词药物回指的识别和解析。此外，还开发了一个语料库来分析现象并评估当前方法。对于每个可能的回指表达案例，都进行了研究，以确定最有效的解析方法。

结果

回指解析的 F 分数达到 0.76，比基线高出近 73%。由于没有以前在药理学文献中解决回指问题的工作，因此开发了这个特定的参考线来检查结果。获得的结果与在相关语义领域中找到的结果相似。

结论

在解决药理学文本中回指表达的问题时，当前方法显示出非常有前途的结果。DrugNerAr 在处理生物医学领域中的回指解析方面取得了与其他方法相似的结果，但与这些方法不同的是，它专注于反映药物相互作用的文档。中心理论已被证明在回指解析中选择先行词方面是有效的。该框架成功的关键组成部分是 MMTx 程序和 DrugNer 系统提供的分析，这使得我们能够处理药理学语言的复杂性。预计解析器的积极结果将提高我们未来药物-药物相互作用提取系统的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c72/3288782/81e2384fd4c2/1471-2105-11-S2-S1-1.jpg

相似文献

Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents.解决药理学文献中药物-药物相互作用提取的回指问题。

BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-11-S2-S1.

Sortal anaphora resolution to enhance relation extraction from biomedical literature.用于增强从生物医学文献中提取关系的类别指代消解。

BMC Bioinformatics. 2016 Apr 14;17:163. doi: 10.1186/s12859-016-1009-6.

Anaphoric reference in clinical reports: characteristics of an annotated corpus.临床报告中的照应关系：标注语料库的特点。

J Biomed Inform. 2012 Jun;45(3):507-21. doi: 10.1016/j.jbi.2012.01.010. Epub 2012 Feb 9.

Automatic identification and classification of noun argument structures in biomedical literature.生物医学文献中名词论元结构的自动识别与分类。

IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1639-48. doi: 10.1109/TCBB.2012.111.

A categorical analysis of coreference resolution errors in biomedical texts.生物医学文本中指代消解错误的分类分析。

J Biomed Inform. 2016 Apr;60:309-18. doi: 10.1016/j.jbi.2016.02.015. Epub 2016 Feb 27.

Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.科罗拉多生物医学期刊文章丰富注释全文（CRAFT）语料库中的共指标注与消解

BMC Bioinformatics. 2017 Aug 17;18(1):372. doi: 10.1186/s12859-017-1775-9.

BelSmile: a biomedical semantic role labeling approach for extracting biological expression language from text.BelSmile：一种用于从文本中提取生物表达语言的生物医学语义角色标注方法。

Database (Oxford). 2016 May 12;2016. doi: 10.1093/database/baw064. Print 2016.

The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.共指消解对细菌与生物栖息地实体之间监督关系检测的贡献。

BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S6. doi: 10.1186/1471-2105-16-S10-S6. Epub 2015 Jul 13.

A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.一种使用混合特征集的生物医学文本关系提取框架。

Comput Math Methods Med. 2015;2015:910423. doi: 10.1155/2015/910423. Epub 2015 Aug 10.

The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.自然语言处理中领域知识与语言结构的相互作用：解读生物医学文本中的上位命题

J Biomed Inform. 2003 Dec;36(6):462-77. doi: 10.1016/j.jbi.2003.11.003.

引用本文的文献

Opportunities and challenges for ChatGPT and large language models in biomedicine and health.ChatGPT 和大型语言模型在生物医学和健康领域的机遇与挑战。

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad493.

Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.基于具有句法依存特征的多头注意力模型的生物医学文本关系抽取：建模研究

JMIR Med Inform. 2022 Oct 20;10(10):e41136. doi: 10.2196/41136.

Computational Advances in Drug Safety: Systematic and Mapping Review of Knowledge Engineering Based Approaches.药物安全性的计算进展：基于知识工程方法的系统综述与图谱综述

Front Pharmacol. 2019 May 17;10:415. doi: 10.3389/fphar.2019.00415. eCollection 2019.

Detection of drug-drug interactions through data mining studies using clinical sources, scientific literature and social media.通过临床来源、科学文献和社交媒体的数据挖掘研究来检测药物-药物相互作用。

Brief Bioinform. 2018 Sep 28;19(5):863-877. doi: 10.1093/bib/bbx010.

Sortal anaphora resolution to enhance relation extraction from biomedical literature.用于增强从生物医学文献中提取关系的类别指代消解。

BMC Bioinformatics. 2016 Apr 14;17:163. doi: 10.1186/s12859-016-1009-6.

Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.生物共指消解评分系统（Bio-SCoRes）：一种用于生物医学文本共指消解的混合架构

PLoS One. 2016 Mar 2;11(3):e0148538. doi: 10.1371/journal.pone.0148538. eCollection 2016.

Extraction of pharmacokinetic evidence of drug-drug interactions from the literature.从文献中提取药物相互作用的药代动力学证据。

PLoS One. 2015 May 11;10(5):e0122199. doi: 10.1371/journal.pone.0122199. eCollection 2015.

CheNER: a tool for the identification of chemical entities and their classes in biomedical literature.CheNER：一个用于在生物医学文献中识别化学实体及其类别的工具。

J Cheminform. 2015 Jan 19;7(Suppl 1 Text mining for chemistry and the CHEMDNER track):S15. doi: 10.1186/1758-2946-7-S1-S15. eCollection 2015.

Open issues in intelligent personal health record--an updated status report for 2012.智能个人健康记录中的未决问题——2012年最新状况报告

J Med Syst. 2013 Jun;37(3):9943. doi: 10.1007/s10916-013-9943-6. Epub 2013 Apr 13.

Dynamic enhancement of drug product labels to support drug safety, efficacy, and effectiveness.动态强化药品标签以支持药品安全性、有效性及实际效果。

J Biomed Semantics. 2013 Jan 26;4(1):5. doi: 10.1186/2041-1480-4-5.

本文引用的文献

Drug name recognition and classification in biomedical texts. A case study outlining approaches underpinning automated systems.生物医学文本中的药物名称识别与分类。一项概述自动化系统基础方法的案例研究。

Drug Discov Today. 2008 Sep;13(17-18):816-23. doi: 10.1016/j.drudis.2008.06.001. Epub 2008 Jul 17.

DrugBank: a knowledgebase for drugs, drug actions and drug targets.药物银行：一个关于药物、药物作用和药物靶点的知识库。

Nucleic Acids Res. 2008 Jan;36(Database issue):D901-6. doi: 10.1093/nar/gkm958. Epub 2007 Nov 29.

Comparative assessment of four drug interaction compendia.四种药物相互作用汇编的比较评估

Br J Clin Pharmacol. 2007 Jun;63(6):709-14. doi: 10.1111/j.1365-2125.2006.02809.x. Epub 2006 Dec 7.

DrugBank: a comprehensive resource for in silico drug discovery and exploration.药物银行：用于计算机辅助药物发现与探索的综合资源。

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D668-72. doi: 10.1093/nar/gkj067.

Bioie: retargetable information extraction and ontological annotation of biological interactions from the literature.Bioie：从文献中提取可重新定位的生物相互作用信息并进行本体注释。

J Bioinform Comput Biol. 2004 Sep;2(3):551-68. doi: 10.1142/s0219720004000739.

Detection, verification, and quantification of adverse drug reactions.药物不良反应的检测、验证与定量分析。

BMJ. 2004 Jul 3;329(7456):44-7. doi: 10.1136/bmj.329.7456.44.

Adverse drug reactions as cause of admission to hospital: prospective analysis of 18 820 patients.药物不良反应作为入院原因：对18820例患者的前瞻性分析。

BMJ. 2004 Jul 3;329(7456):15-9. doi: 10.1136/bmj.329.7456.15.

GENIA corpus--semantically annotated corpus for bio-textmining.GENIA语料库——用于生物文本挖掘的语义标注语料库。

Bioinformatics. 2003;19 Suppl 1:i180-2. doi: 10.1093/bioinformatics/btg1023.

International Nonproprietary Names (INN) for pharmaceutical substances.药品国际非专利名称。

Bull World Health Organ. 1995;73(3):275-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

解决药理学文献中药物-药物相互作用提取的回指问题。

Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献