• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从生物医学文本中提取空间信息的结构化学习:细菌生物栖息地

Structured learning for spatial information extraction from biomedical text: bacteria biotopes.

作者信息

Kordjamshidi Parisa, Roth Dan, Moens Marie-Francine

机构信息

KULeuven, Leuven, Belgium.

University of Illinois at Urbana-Champaign, Urbana-Champaign, USA.

出版信息

BMC Bioinformatics. 2015 Apr 25;16:129. doi: 10.1186/s12859-015-0542-z.

DOI:10.1186/s12859-015-0542-z
PMID:25909637
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4426185/
Abstract

BACKGROUND

We aim to automatically extract species names of bacteria and their locations from webpages. This task is important for exploiting the vast amount of biological knowledge which is expressed in diverse natural language texts and putting this knowledge in databases for easy access by biologists. The task is challenging and the previous results are far below an acceptable level of performance, particularly for extraction of localization relationships. Therefore, we aim to design a new system for such extractions, using the framework of structured machine learning techniques.

RESULTS

We design a new model for joint extraction of biomedical entities and the localization relationship. Our model is based on a spatial role labeling (SpRL) model designed for spatial understanding of unrestricted text. We extend SpRL to extract discourse level spatial relations in the biomedical domain and apply it on the BioNLP-ST 2013, BB-shared task. We highlight the main differences between general spatial language understanding and spatial information extraction from the scientific text which is the focus of this work. We exploit the text's structure and discourse level global features. Our model and the designed features substantially improve on the previous systems, achieving an absolute improvement of approximately 57 percent over F1 measure of the best previous system for this task.

CONCLUSIONS

Our experimental results indicate that a joint learning model over all entities and relationships in a document outperforms a model which extracts entities and relationships independently. Our global learning model significantly improves the state-of-the-art results on this task and has a high potential to be adopted in other natural language processing (NLP) tasks in the biomedical domain.

摘要

背景

我们旨在从网页中自动提取细菌的物种名称及其位置。这项任务对于利用大量以各种自然语言文本表达的生物学知识,并将这些知识存入数据库以便生物学家轻松访问而言非常重要。该任务具有挑战性,且先前的结果远低于可接受的性能水平,尤其是在提取定位关系方面。因此,我们旨在使用结构化机器学习技术框架设计一个用于此类提取的新系统。

结果

我们设计了一种用于联合提取生物医学实体及其定位关系的新模型。我们的模型基于为理解无限制文本的空间信息而设计的空间角色标注(SpRL)模型。我们扩展了SpRL以提取生物医学领域中的篇章级空间关系,并将其应用于BioNLP - ST 2013的BB共享任务。我们强调了一般空间语言理解与从科学文本中提取空间信息之间的主要差异,而科学文本正是本工作的重点。我们利用了文本的结构和篇章级全局特征。我们的模型和所设计的特征在先前系统的基础上有了显著改进,相对于此任务中先前最佳系统的F1值,绝对提升了约57%。

结论

我们的实验结果表明,对文档中所有实体和关系进行联合学习的模型优于独立提取实体和关系的模型。我们的全局学习模型显著改进了该任务的当前最优结果,并且在生物医学领域的其他自然语言处理(NLP)任务中具有很高的应用潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/777416b5eb0f/12859_2015_542_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/ce3744835968/12859_2015_542_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/1df6b98ef6c9/12859_2015_542_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/777416b5eb0f/12859_2015_542_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/ce3744835968/12859_2015_542_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/1df6b98ef6c9/12859_2015_542_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ede2/4426185/777416b5eb0f/12859_2015_542_Fig3_HTML.jpg

相似文献

1
Structured learning for spatial information extraction from biomedical text: bacteria biotopes.从生物医学文本中提取空间信息的结构化学习:细菌生物栖息地
BMC Bioinformatics. 2015 Apr 25;16:129. doi: 10.1186/s12859-015-0542-z.
2
Biomedical event extraction based on GRU integrating attention mechanism.基于 GRU 集成注意力机制的生物医学事件抽取。
BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):285. doi: 10.1186/s12859-018-2275-2.
3
Recognizing names in biomedical texts: a machine learning approach.识别生物医学文本中的名称:一种机器学习方法。
Bioinformatics. 2004 May 1;20(7):1178-90. doi: 10.1093/bioinformatics/bth060. Epub 2004 Feb 10.
4
The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.共指消解对细菌与生物栖息地实体之间监督关系检测的贡献。
BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S6. doi: 10.1186/1471-2105-16-S10-S6. Epub 2015 Jul 13.
5
Detection and categorization of bacteria habitats using shallow linguistic analysis.利用浅层语言分析检测和分类细菌栖息地
BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S5. doi: 10.1186/1471-2105-16-S10-S5. Epub 2015 Jul 13.
6
Extracting biomedical events from pairs of text entities.从文本实体对中提取生物医学事件。
BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-16-S10-S8. Epub 2015 Jul 13.
7
Self-training in significance space of support vectors for imbalanced biomedical event data.针对不平衡生物医学事件数据在支持向量的显著性空间中进行自训练。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-16-S7-S6. Epub 2015 Apr 23.
8
Semi-supervised learning of causal relations in biomedical scientific discourse.生物医学科学话语中因果关系的半监督学习
Biomed Eng Online. 2014;13 Suppl 2(Suppl 2):S1. doi: 10.1186/1475-925X-13-S2-S1. Epub 2014 Dec 11.
9
Active learning for ontological event extraction incorporating named entity recognition and unknown word handling.结合命名实体识别和未知词处理的本体事件抽取的主动学习
J Biomed Semantics. 2016 Apr 27;7:22. doi: 10.1186/s13326-016-0059-z. eCollection 2016.
10
A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.一种使用混合特征集的生物医学文本关系提取框架。
Comput Math Methods Med. 2015;2015:910423. doi: 10.1155/2015/910423. Epub 2015 Aug 10.

引用本文的文献

1
Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports.Rad-SpatialNet:用于放射学报告中细粒度空间关系的基于框架的资源。
LREC Int Conf Lang Resour Eval. 2020 May;2020:2251-2260.
2
Understanding spatial language in radiology: Representation framework, annotation, and spatial relation extraction from chest X-ray reports using deep learning.理解放射学中的空间语言:使用深度学习从胸部X光报告中进行表示框架、标注和空间关系提取。
J Biomed Inform. 2020 Aug;108:103473. doi: 10.1016/j.jbi.2020.103473. Epub 2020 Jun 18.
3
Spatial Relation Extraction from Radiology Reports using Syntax-Aware Word Representations.

本文引用的文献

1
Biomedical event extraction from abstracts and full papers using search-based structured prediction.基于搜索的结构化预测在摘要和全文中进行生物医学事件抽取。
BMC Bioinformatics. 2012 Jun 26;13 Suppl 11(Suppl 11):S5. doi: 10.1186/1471-2105-13-S11-S5.
2
BioLemmatizer: a lemmatization tool for morphological processing of biomedical text.生物词元化器:一种用于生物医学文本形态处理的词元化工具。
J Biomed Semantics. 2012 Apr 1;3:3. doi: 10.1186/2041-1480-3-3.
使用句法感知词表示法从放射学报告中提取空间关系
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:116-125. eCollection 2020.
4
Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning.利用结合知识库和深度学习的自然语言处理系统提取药物和相关药物不良事件。
J Am Med Inform Assoc. 2020 Jan 1;27(1):56-64. doi: 10.1093/jamia/ocz141.
5
Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning.从电子健康记录笔记中提取与药物不良事件相关的信息:基于深度学习的端到端模型设计
JMIR Med Inform. 2018 Nov 26;6(4):e12159. doi: 10.2196/12159.
6
ProtozoaDB 2.0: A Trypanosoma Brucei Case Study.原生动物数据库2.0:布氏锥虫案例研究。
Pathogens. 2017 Jul 20;6(3):32. doi: 10.3390/pathogens6030032.
7
A neural joint model for entity and relation extraction from biomedical text.一种用于从生物医学文本中提取实体和关系的神经联合模型。
BMC Bioinformatics. 2017 Mar 31;18(1):198. doi: 10.1186/s12859-017-1609-9.
8
Machine Reading for Extraction of Bacteria and Habitat Taxonomies.用于提取细菌和栖息地分类法的机器阅读
Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap. 2015 Jan;574:239-255. doi: 10.1007/978-3-319-27707-3_15. Epub 2016 Jan 5.
9
An Overview of Biomolecular Event Extraction from Scientific Documents.科学文献中生物分子事件提取概述
Comput Math Methods Med. 2015;2015:571381. doi: 10.1155/2015/571381. Epub 2015 Oct 26.