• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

否定的生物事件:分析与识别。

Negated bio-events: analysis and identification.

机构信息

National Centre for Text Mining, Manchester Interdisciplinary Biocentre, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK.

出版信息

BMC Bioinformatics. 2013 Jan 16;14:14. doi: 10.1186/1471-2105-14-14.

DOI:10.1186/1471-2105-14-14
PMID:23323936
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3561152/
Abstract

BACKGROUND

Negation occurs frequently in scientific literature, especially in biomedical literature. It has previously been reported that around 13% of sentences found in biomedical research articles contain negation. Historically, the main motivation for identifying negated events has been to ensure their exclusion from lists of extracted interactions. However, recently, there has been a growing interest in negative results, which has resulted in negation detection being identified as a key challenge in biomedical relation extraction. In this article, we focus on the problem of identifying negated bio-events, given gold standard event annotations.

RESULTS

We have conducted a detailed analysis of three open access bio-event corpora containing negation information (i.e., GENIA Event, BioInfer and BioNLP'09 ST), and have identified the main types of negated bio-events. We have analysed the key aspects of a machine learning solution to the problem of detecting negated events, including selection of negation cues, feature engineering and the choice of learning algorithm. Combining the best solutions for each aspect of the problem, we propose a novel framework for the identification of negated bio-events. We have evaluated our system on each of the three open access corpora mentioned above. The performance of the system significantly surpasses the best results previously reported on the BioNLP'09 ST corpus, and achieves even better results on the GENIA Event and BioInfer corpora, both of which contain more varied and complex events.

CONCLUSIONS

Recently, in the field of biomedical text mining, the development and enhancement of event-based systems has received significant interest. The ability to identify negated events is a key performance element for these systems. We have conducted the first detailed study on the analysis and identification of negated bio-events. Our proposed framework can be integrated with state-of-the-art event extraction systems. The resulting systems will be able to extract bio-events with attached polarities from textual documents, which can serve as the foundation for more elaborate systems that are able to detect mutually contradicting bio-events.

摘要

背景

否定在科学文献中经常出现,尤其是在生物医学文献中。据报道,生物医学研究文章中约有 13%的句子包含否定。从历史上看,识别否定事件的主要动机是确保它们不会被提取的交互列表所包含。然而,最近,人们对负面结果的兴趣越来越大,这导致否定检测被确定为生物医学关系提取中的一个关键挑战。在本文中,我们专注于给定金标准事件标注识别否定生物事件的问题。

结果

我们对包含否定信息的三个开放获取生物事件语料库(即 GENIA Event、BioInfer 和 BioNLP'09 ST)进行了详细分析,并确定了否定生物事件的主要类型。我们分析了机器学习解决方案检测否定事件的关键方面,包括否定线索的选择、特征工程和学习算法的选择。我们结合了该问题每个方面的最佳解决方案,提出了一种用于识别否定生物事件的新框架。我们在上述三个开放获取语料库中的每一个上都评估了我们的系统。该系统的性能明显超过了之前在 BioNLP'09 ST 语料库上报告的最佳结果,并且在 GENIA Event 和 BioInfer 语料库上的性能甚至更好,这两个语料库都包含了更多样化和复杂的事件。

结论

最近,在生物医学文本挖掘领域,基于事件的系统的开发和增强受到了极大的关注。识别否定事件的能力是这些系统的关键性能要素。我们对否定生物事件的分析和识别进行了首次详细研究。我们提出的框架可以与最先进的事件提取系统集成。由此产生的系统将能够从文本文档中提取带有极性的生物事件,这可以作为能够检测相互矛盾的生物事件的更精细系统的基础。

相似文献

1
Negated bio-events: analysis and identification.否定的生物事件:分析与识别。
BMC Bioinformatics. 2013 Jan 16;14:14. doi: 10.1186/1471-2105-14-14.
2
Extracting semantically enriched events from biomedical literature.从生物医学文献中提取语义丰富的事件。
BMC Bioinformatics. 2012 May 23;13:108. doi: 10.1186/1471-2105-13-108.
3
Biomedical events extraction using the hidden vector state model.基于隐向量状态模型的生物医学事件抽取。
Artif Intell Med. 2011 Nov;53(3):205-13. doi: 10.1016/j.artmed.2011.08.002. Epub 2011 Sep 25.
4
Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora.生物语义范围语料库和基因事件语料库中基于语言范围和基于生物事件的推测与否定标注。
J Biomed Semantics. 2011 Oct 6;2 Suppl 5(Suppl 5):S8. doi: 10.1186/2041-1480-2-S5-S8.
5
Biomedical negation scope detection with conditional random fields.基于条件随机场的生物医学否定范围检测。
J Am Med Inform Assoc. 2010 Nov-Dec;17(6):696-701. doi: 10.1136/jamia.2010.003228.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Event extraction with complex event classification using rich features.利用丰富特征进行复杂事件分类的事件抽取。
J Bioinform Comput Biol. 2010 Feb;8(1):131-46. doi: 10.1142/s0219720010004586.
8
Biological event composition.生物事件组成。
BMC Bioinformatics. 2012 Jun 26;13 Suppl 11(Suppl 11):S7. doi: 10.1186/1471-2105-13-S11-S7.
9
Identification of research hypotheses and new knowledge from scientific literature.从科学文献中识别研究假设和新知识。
BMC Med Inform Decis Mak. 2018 Jun 25;18(1):46. doi: 10.1186/s12911-018-0639-1.
10
A semi-supervised learning framework for biomedical event extraction based on hidden topics.基于隐主题的生物医学事件抽取的半监督学习框架。
Artif Intell Med. 2015 May;64(1):51-8. doi: 10.1016/j.artmed.2015.03.004. Epub 2015 Apr 1.

引用本文的文献

1
A novel corpus of molecular to higher-order events that facilitates the understanding of the pathogenic mechanisms of idiopathic pulmonary fibrosis.一种新的分子到更高阶事件的语料库,有助于理解特发性肺纤维化的发病机制。
Sci Rep. 2023 Apr 12;13(1):5986. doi: 10.1038/s41598-023-32915-8.
2
A survey on clinical natural language processing in the United Kingdom from 2007 to 2022.2007年至2022年英国临床自然语言处理调查。
NPJ Digit Med. 2022 Dec 21;5(1):186. doi: 10.1038/s41746-022-00730-6.
3
An automated approach to identify scientific publications reporting pharmacokinetic parameters.

本文引用的文献

1
Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora.生物语义范围语料库和基因事件语料库中基于语言范围和基于生物事件的推测与否定标注。
J Biomed Semantics. 2011 Oct 6;2 Suppl 5(Suppl 5):S8. doi: 10.1186/2041-1480-2-S5-S8.
2
Recent progress in automatically extracting information from the pharmacogenomic literature.从药物基因组学文献中自动提取信息的最新进展。
Pharmacogenomics. 2010 Oct;11(10):1467-89. doi: 10.2217/pgs.10.136.
3
Biomedical negation scope detection with conditional random fields.
一种识别报告药代动力学参数的科学出版物的自动化方法。
Wellcome Open Res. 2021 Apr 21;6:88. doi: 10.12688/wellcomeopenres.16718.1. eCollection 2021.
4
Antibody Exchange: Information extraction of biological antibody donation and a web-portal to find donors and seekers.抗体交换:生物抗体捐赠的信息提取及一个用于寻找捐赠者和需求者的网络平台。
Data (Basel). 2017 Dec;2(4). doi: 10.3390/data2040038. Epub 2017 Nov 21.
5
Annotation and detection of drug effects in text for pharmacovigilance.用于药物警戒的文本中药物效应的标注与检测。
J Cheminform. 2018 Aug 13;10(1):37. doi: 10.1186/s13321-018-0290-y.
6
Identification of research hypotheses and new knowledge from scientific literature.从科学文献中识别研究假设和新知识。
BMC Med Inform Decis Mak. 2018 Jun 25;18(1):46. doi: 10.1186/s12911-018-0639-1.
7
Using uncertainty to link and rank evidence from biomedical literature for model curation.利用不确定性将生物医学文献中的证据进行链接和排序,以用于模型编纂。
Bioinformatics. 2017 Dec 1;33(23):3784-3792. doi: 10.1093/bioinformatics/btx466.
8
Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.从文本和大规模数据分析中提取基因与疾病之间的关系:对转化研究的启示。
BMC Bioinformatics. 2015 Feb 21;16:55. doi: 10.1186/s12859-015-0472-9.
基于条件随机场的生物医学否定范围检测。
J Am Med Inform Assoc. 2010 Nov-Dec;17(6):696-701. doi: 10.1136/jamia.2010.003228.
4
Event extraction with complex event classification using rich features.利用丰富特征进行复杂事件分类的事件抽取。
J Bioinform Comput Biol. 2010 Feb;8(1):131-46. doi: 10.1142/s0219720010004586.
5
The Negatome database: a reference set of non-interacting protein pairs.Negatome 数据库:一组非相互作用的蛋白质对参考集。
Nucleic Acids Res. 2010 Jan;38(Database issue):D540-4. doi: 10.1093/nar/gkp1026. Epub 2009 Nov 17.
6
Construction of an annotated corpus to support biomedical information extraction.构建带注释语料库以支持生物医学信息抽取。
BMC Bioinformatics. 2009 Oct 23;10:349. doi: 10.1186/1471-2105-10-349.
7
Evaluating contributions of natural language parsers to protein-protein interaction extraction.评估自然语言解析器对蛋白质-蛋白质相互作用提取的贡献。
Bioinformatics. 2009 Feb 1;25(3):394-400. doi: 10.1093/bioinformatics/btn631. Epub 2008 Dec 9.
8
The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes.生物显微镜语料库:标注了不确定性、否定及其范围的生物医学文本。
BMC Bioinformatics. 2008 Nov 19;9 Suppl 11(Suppl 11):S9. doi: 10.1186/1471-2105-9-S11-S9.
9
Defrosting the digital library: bibliographic tools for the next generation web.解冻数字图书馆:面向下一代网络的书目工具
PLoS Comput Biol. 2008 Oct;4(10):e1000204. doi: 10.1371/journal.pcbi.1000204. Epub 2008 Oct 31.
10
What are decision trees?决策树是什么?
Nat Biotechnol. 2008 Sep;26(9):1011-3. doi: 10.1038/nbt0908-1011.