临床领域中首字母缩略词和缩写词词义消歧的挑战与实用方法。

Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.

作者信息

Moon Sungrim, McInnes Bridget, Melton Genevieve B

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.

出版信息

Healthc Inform Res. 2015 Jan;21(1):35-42. doi: 10.4258/hir.2015.21.1.35. Epub 2015 Jan 31.

DOI:10.4258/hir.2015.21.1.35

PMID:25705556

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4330198/

Abstract

OBJECTIVES

Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear whether techniques effective for acronym and abbreviation WSD from biomedical literature are sufficient.

METHODS

The authors discuss feature selection for automated techniques and challenges with WSD of acronyms and abbreviations in the clinical domain.

RESULTS

There are significant challenges associated with the informal nature of clinical text, such as typographical errors and incomplete sentences; difficulty with insufficient clinical resources, such as clinical sense inventories; and obstacles with privacy and security for conducting research with clinical text. Although we anticipated that using sophisticated techniques, such as biomedical terminologies, semantic types, part-of-speech, and language modeling, would be needed for feature selection with automated machine learning approaches, we found instead that simple techniques, such as bag-of-words, were quite effective in many cases. Factors, such as majority sense prevalence and the degree of separateness between sense meanings, were also important considerations.

CONCLUSIONS

The first lesson is that a comprehensive understanding of the unique characteristics of clinical text is important for automatic acronym and abbreviation WSD. The second lesson learned is that investigators may find that using simple approaches is an effective starting point for these tasks. Finally, similar to other WSD tasks, an understanding of baseline majority sense rates and separateness between senses is important. Further studies and practical solutions are needed to better address these issues.

摘要

目的

尽管临床文本中的首字母缩略词和缩写在日常中广泛使用，但相对较少的研究聚焦于医疗领域中首字母缩略词和缩写的词义消歧（WSD）。由于临床记录具有独特的特征，尚不清楚来自生物医学文献的对首字母缩略词和缩写进行词义消歧的有效技术是否足够。

方法

作者讨论了自动化技术的特征选择以及临床领域中首字母缩略词和缩写的词义消歧所面临的挑战。

结果

临床文本的非正式性质带来了重大挑战，如排版错误和句子不完整；临床资源不足带来困难，如临床意义清单；以及使用临床文本进行研究时的隐私和安全障碍。尽管我们预计使用复杂技术，如生物医学术语、语义类型、词性和语言建模，对于自动化机器学习方法的特征选择是必要的，但我们反而发现简单技术，如词袋模型，在许多情况下相当有效。诸如多数意义流行率和意义之间的分离程度等因素也是重要的考虑因素。

结论

第一个教训是，全面理解临床文本的独特特征对于自动进行首字母缩略词和缩写的词义消歧很重要。第二个教训是，研究人员可能会发现使用简单方法是这些任务的有效起点。最后，与其他词义消歧任务类似，了解基线多数意义率和意义之间的分离程度很重要。需要进一步的研究和实际解决方案来更好地解决这些问题。

相似文献

Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.

Healthc Inform Res. 2015 Jan;21(1):35-42. doi: 10.4258/hir.2015.21.1.35. Epub 2015 Jan 31.

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

BMC Bioinformatics. 2006 Jul 5;7:334. doi: 10.1186/1471-2105-7-334.

A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources.

J Am Med Inform Assoc. 2014 Mar-Apr;21(2):299-307. doi: 10.1136/amiajnl-2012-001506. Epub 2013 Jun 27.

Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations.

AMIA Annu Symp Proc. 2012;2012:1310-9. Epub 2012 Nov 3.

A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.

Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.

A deep database of medical abbreviations and acronyms for natural language processing.

Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4.

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).

J Am Med Inform Assoc. 2017 Apr 1;24(e1):e79-e86. doi: 10.1093/jamia/ocw109.

Abbreviation and acronym disambiguation in clinical discourse.

AMIA Annu Symp Proc. 2005;2005:589-93.

The CLASSE GATOR (CLinical Acronym SenSE disambiGuATOR): A Method for predicting acronym sense from neonatal clinical notes.

Int J Med Inform. 2020 May;137:104101. doi: 10.1016/j.ijmedinf.2020.104101. Epub 2020 Feb 14.

A multi-aspect comparison study of supervised word sense disambiguation.

J Am Med Inform Assoc. 2004 Jul-Aug;11(4):320-31. doi: 10.1197/jamia.M1533. Epub 2004 Apr 2.

引用本文的文献

Analyzing the Creation and Use of Abbreviations in Cardiology and Cardiac Imaging Society Guidelines.

JACC Adv. 2025 Jan 13;4(2):101561. doi: 10.1016/j.jacadv.2024.101561. eCollection 2025 Feb.

Challenges of clinical accompaniment amongst undergraduate nursing students: University of KwaZulu-Natal.

Health SA. 2024 Jul 5;29:2535. doi: 10.4102/hsag.v29i0.2535. eCollection 2024.

Clinical Note Structural Knowledge Improves Word Sense Disambiguation.

AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:515-524. eCollection 2024.

Large-scale identification of undiagnosed hepatic steatosis using natural language processing.

EClinicalMedicine. 2023 Aug 9;62:102149. doi: 10.1016/j.eclinm.2023.102149. eCollection 2023 Aug.

Predicting future falls in older people using natural language processing of general practitioners' clinical notes.

Age Ageing. 2023 Apr 1;52(4). doi: 10.1093/ageing/afad046.

Deciphering clinical abbreviations with a privacy protecting machine learning system.

Nat Commun. 2022 Dec 2;13(1):7456. doi: 10.1038/s41467-022-35007-9.

Automated Mapping of Real-world Oncology Laboratory Data to LOINC.

AMIA Annu Symp Proc. 2022 Feb 21;2021:611-620. eCollection 2021.

A deep database of medical abbreviations and acronyms for natural language processing.

Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4.

Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.

Int J Med Inform. 2019 Sep;129:334-341. doi: 10.1016/j.ijmedinf.2019.06.028. Epub 2019 Jul 6.

A method for harmonization of clinical abbreviation and acronym sense inventories.

J Biomed Inform. 2018 Dec;88:62-69. doi: 10.1016/j.jbi.2018.11.004. Epub 2018 Nov 7.

本文引用的文献

Word sense disambiguation via semantic type classification.

AMIA Annu Symp Proc. 2008 Nov 6;2008:177-81.

Methods for building sense inventories of abbreviations in clinical notes.

J Am Med Inform Assoc. 2009 Jan-Feb;16(1):103-8. doi: 10.1197/jamia.M2927. Epub 2008 Oct 24.

Medical abbreviations: writing little and communicating less.

Arch Dis Child. 2008 Oct;93(10):816-7. doi: 10.1136/adc.2008.141473.

A study of abbreviations in clinical notes.

AMIA Annu Symp Proc. 2007 Oct 11;2007:821-5.

Extracting information from textual documents in the electronic health record: a review of recent research.

Yearb Med Inform. 2008:128-44.

Word sense disambiguation across two domains: biomedical literature and clinical notes.

J Biomed Inform. 2008 Dec;41(6):1088-100. doi: 10.1016/j.jbi.2008.02.003. Epub 2008 Mar 4.

Abbreviations and acronyms in healthcare: when shorter isn't sweeter.

Pediatr Nurs. 2007 Sep-Oct;33(5):392-8.

A comparative study of supervised learning as applied to acronym expansion in clinical reports.

AMIA Annu Symp Proc. 2006;2006:399-403.

ADAM: another database of abbreviations in MEDLINE.

Bioinformatics. 2006 Nov 15;22(22):2813-8. doi: 10.1093/bioinformatics/btl480. Epub 2006 Sep 18.

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

BMC Bioinformatics. 2006 Jul 5;7:334. doi: 10.1186/1471-2105-7-334.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

临床领域中首字母缩略词和缩写词词义消歧的挑战与实用方法。

Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献