一种用于临床文本的快速、准确且可推广的基于启发式的否定检测算法。

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text.

作者信息

Slater Karin, Bradlow William, Motti Dino Fa, Hoehndorf Robert, Ball Simon, Gkoutos Georgios V

机构信息

College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, University of Birmingham, UK; Institute of Translational Medicine, University Hospitals Birmingham, NHS Foundation Trust, UK; University Hospitals Birmingham NHS Foundation Trust, Edgbaston, Birmingham, UK.

Institute of Translational Medicine, University Hospitals Birmingham, NHS Foundation Trust, UK; University Hospitals Birmingham NHS Foundation Trust, Edgbaston, Birmingham, UK.

出版信息

Comput Biol Med. 2021 Mar;130:104216. doi: 10.1016/j.compbiomed.2021.104216. Epub 2021 Jan 16.

DOI:10.1016/j.compbiomed.2021.104216

PMID:33484944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7910278/

Abstract

Negation detection is an important task in biomedical text mining. Particularly in clinical settings, it is of critical importance to determine whether findings mentioned in text are present or absent. Rule-based negation detection algorithms are a common approach to the task, and more recent investigations have resulted in the development of rule-based systems utilising the rich grammatical information afforded by typed dependency graphs. However, interacting with these complex representations inevitably necessitates complex rules, which are time-consuming to develop and do not generalise well. We hypothesise that a heuristic approach to determining negation via dependency graphs could offer a powerful alternative. We describe and implement an algorithm for negation detection based on grammatical distance from a negatory construct in a typed dependency graph. To evaluate the algorithm, we develop two testing corpora comprised of sentences of clinical text extracted from the MIMIC-III database and documents related to hypertrophic cardiomyopathy patients routinely collected at University Hospitals Birmingham NHS trust. Gold-standard validation datasets were built by a combination of human annotation and examination of algorithm error. Finally, we compare the performance of our approach with four other rule-based algorithms on both gold-standard corpora. The presented algorithm exhibits the best performance by f-measure over the MIMIC-III dataset, and a similar performance to the syntactic negation detection systems over the HCM dataset. It is also the fastest of the dependency-based negation systems explored in this study. Our results show that while a single heuristic approach to dependency-based negation detection is ignorant to certain advanced cases, it nevertheless forms a powerful and stable method, requiring minimal training and adaptation between datasets. As such, it could present a drop-in replacement or augmentation for many-rule negation approaches in clinical text-mining pipelines, particularly for cases where adaptation and rule development is not required or possible.

摘要

否定检测是生物医学文本挖掘中的一项重要任务。特别是在临床环境中，确定文本中提到的发现是否存在至关重要。基于规则的否定检测算法是完成这项任务的常用方法，最近的研究导致了利用类型依存关系图提供的丰富语法信息开发基于规则的系统。然而，与这些复杂的表示进行交互不可避免地需要复杂的规则，这些规则开发耗时且泛化性不佳。我们假设通过依存关系图确定否定的启发式方法可能提供一种强大的替代方案。我们描述并实现了一种基于类型依存关系图中与否定结构的语法距离进行否定检测的算法。为了评估该算法，我们开发了两个测试语料库，它们由从MIMIC-III数据库中提取的临床文本句子以及伯明翰大学医院国民保健服务信托基金常规收集的肥厚型心肌病患者相关文档组成。通过人工标注和算法错误检查相结合的方式构建了金标准验证数据集。最后，我们在两个金标准语料库上比较了我们的方法与其他四种基于规则的算法的性能。所提出的算法在MIMIC-III数据集上通过F值表现出最佳性能，在肥厚型心肌病数据集上与句法否定检测系统表现相似。它也是本研究中探索的基于依存关系的否定系统中最快的。我们的结果表明，虽然基于依存关系的否定检测的单一启发式方法对某些高级情况不敏感，但它仍然形成了一种强大且稳定的方法，在不同数据集之间所需的训练和调整最少。因此，它可以作为临床文本挖掘管道中多规则否定方法的直接替代或补充，特别是在不需要或不可能进行调整和规则开发的情况下。

相似文献

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text.

Comput Biol Med. 2021 Mar;130:104216. doi: 10.1016/j.compbiomed.2021.104216. Epub 2021 Jan 16.

DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.

J Biomed Inform. 2015 Apr;54:213-9. doi: 10.1016/j.jbi.2015.02.010. Epub 2015 Mar 16.

Negation recognition in clinical natural language processing using a combination of the NegEx algorithm and a convolutional neural network.

BMC Med Inform Decis Mak. 2023 Oct 13;23(1):216. doi: 10.1186/s12911-023-02301-5.

Does BERT need domain adaptation for clinical negation detection?

J Am Med Inform Assoc. 2020 Apr 1;27(4):584-591. doi: 10.1093/jamia/ocaa001.

Automatic negation detection in narrative pathology reports.

Artif Intell Med. 2015 May;64(1):41-50. doi: 10.1016/j.artmed.2015.03.001. Epub 2015 Mar 24.

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study.

JMIR Med Inform. 2020 Dec 3;8(12):e18953. doi: 10.2196/18953.

Portable automatic text classification for adverse drug reaction detection via multi-corpus training.

J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.

Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.

PeerJ Comput Sci. 2022 Mar 7;8:e913. doi: 10.7717/peerj-cs.913. eCollection 2022.

Exploiting graph kernels for high performance biomedical relation extraction.

J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.

Biomedical negation scope detection with conditional random fields.

J Am Med Inform Assoc. 2010 Nov-Dec;17(6):696-701. doi: 10.1136/jamia.2010.003228.

引用本文的文献

Artificial intelligence to enhance clinical value across the spectrum of cardiovascular healthcare.

Eur Heart J. 2023 Mar 1;44(9):713-725. doi: 10.1093/eurheartj/ehac758.

Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods.

BMC Bioinformatics. 2023 Jan 9;24(1):10. doi: 10.1186/s12859-022-05130-x.

Transforming epilepsy research: A systematic review on natural language processing applications.

Epilepsia. 2023 Feb;64(2):292-305. doi: 10.1111/epi.17474. Epub 2022 Dec 19.

Exploring Descriptions of Movement Through Geovisual Analytics.

KN J Cartogr Geogr Inf. 2022;72(1):5-27. doi: 10.1007/s42489-022-00098-3. Epub 2022 Feb 24.

Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity.

Front Digit Health. 2021 Dec 6;3:781227. doi: 10.3389/fdgth.2021.781227. eCollection 2021.

Multi-faceted semantic clustering with text-derived phenotypes.

Comput Biol Med. 2021 Nov;138:104904. doi: 10.1016/j.compbiomed.2021.104904. Epub 2021 Sep 27.

Improved characterisation of clinical text through ontology-based vocabulary expansion.

J Biomed Semantics. 2021 Apr 12;12(1):7. doi: 10.1186/s13326-021-00241-5.

本文引用的文献

The Role of a Deep-Learning Method for Negation Detection in Patient Cohort Identification from Electroencephalography Reports.

AMIA Annu Symp Proc. 2018 Dec 5;2018:1018-1027. eCollection 2018.

Comparison of 2 Natural Language Processing Methods for Identification of Bleeding Among Critically Ill Patients.

JAMA Netw Open. 2018 Oct 5;1(6):e183451. doi: 10.1001/jamanetworkopen.2018.3451.

CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.

BMC Med Inform Decis Mak. 2018 Jun 25;18(1):47. doi: 10.1186/s12911-018-0623-9.

NegBio: a high-performance tool for negation and uncertainty detection in radiology reports.

AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:188-196. eCollection 2018.

MIMIC-III, a freely accessible critical care database.

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.

J Biomed Inform. 2015 Apr;54:213-9. doi: 10.1016/j.jbi.2015.02.010. Epub 2015 Mar 16.

The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data.

Nucleic Acids Res. 2014 Jan;42(Database issue):D966-74. doi: 10.1093/nar/gkt1026. Epub 2013 Nov 11.

Dependency Parser-based Negation Detection in Clinical Narratives.

AMIA Jt Summits Transl Sci Proc. 2012;2012:1-8. Epub 2012 Mar 19.

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.

ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

J Biomed Inform. 2009 Oct;42(5):839-51. doi: 10.1016/j.jbi.2009.05.002. Epub 2009 May 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于临床文本的快速、准确且可推广的基于启发式的否定检测算法。

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text.

作者信息

Slater Karin, Bradlow William, Motti Dino Fa, Hoehndorf Robert, Ball Simon, Gkoutos Georgios V

机构信息

Institute of Translational Medicine, University Hospitals Birmingham, NHS Foundation Trust, UK; University Hospitals Birmingham NHS Foundation Trust, Edgbaston, Birmingham, UK.

出版信息

Comput Biol Med. 2021 Mar;130:104216. doi: 10.1016/j.compbiomed.2021.104216. Epub 2021 Jan 16.

DOI:10.1016/j.compbiomed.2021.104216

PMID:33484944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7910278/

Abstract

摘要

一种用于临床文本的快速、准确且可推广的基于启发式的否定检测算法。

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于临床文本的快速、准确且可推广的基于启发式的否定检测算法。

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献