Suppr超能文献

临床领域中首字母缩略词和缩写词词义消歧的挑战与实用方法。

Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.

作者信息

Moon Sungrim, McInnes Bridget, Melton Genevieve B

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.

出版信息

Healthc Inform Res. 2015 Jan;21(1):35-42. doi: 10.4258/hir.2015.21.1.35. Epub 2015 Jan 31.

Abstract

OBJECTIVES

Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear whether techniques effective for acronym and abbreviation WSD from biomedical literature are sufficient.

METHODS

The authors discuss feature selection for automated techniques and challenges with WSD of acronyms and abbreviations in the clinical domain.

RESULTS

There are significant challenges associated with the informal nature of clinical text, such as typographical errors and incomplete sentences; difficulty with insufficient clinical resources, such as clinical sense inventories; and obstacles with privacy and security for conducting research with clinical text. Although we anticipated that using sophisticated techniques, such as biomedical terminologies, semantic types, part-of-speech, and language modeling, would be needed for feature selection with automated machine learning approaches, we found instead that simple techniques, such as bag-of-words, were quite effective in many cases. Factors, such as majority sense prevalence and the degree of separateness between sense meanings, were also important considerations.

CONCLUSIONS

The first lesson is that a comprehensive understanding of the unique characteristics of clinical text is important for automatic acronym and abbreviation WSD. The second lesson learned is that investigators may find that using simple approaches is an effective starting point for these tasks. Finally, similar to other WSD tasks, an understanding of baseline majority sense rates and separateness between senses is important. Further studies and practical solutions are needed to better address these issues.

摘要

目的

尽管临床文本中的首字母缩略词和缩写在日常中广泛使用,但相对较少的研究聚焦于医疗领域中首字母缩略词和缩写的词义消歧(WSD)。由于临床记录具有独特的特征,尚不清楚来自生物医学文献的对首字母缩略词和缩写进行词义消歧的有效技术是否足够。

方法

作者讨论了自动化技术的特征选择以及临床领域中首字母缩略词和缩写的词义消歧所面临的挑战。

结果

临床文本的非正式性质带来了重大挑战,如排版错误和句子不完整;临床资源不足带来困难,如临床意义清单;以及使用临床文本进行研究时的隐私和安全障碍。尽管我们预计使用复杂技术,如生物医学术语、语义类型、词性和语言建模,对于自动化机器学习方法的特征选择是必要的,但我们反而发现简单技术,如词袋模型,在许多情况下相当有效。诸如多数意义流行率和意义之间的分离程度等因素也是重要的考虑因素。

结论

第一个教训是,全面理解临床文本的独特特征对于自动进行首字母缩略词和缩写的词义消歧很重要。第二个教训是,研究人员可能会发现使用简单方法是这些任务的有效起点。最后,与其他词义消歧任务类似,了解基线多数意义率和意义之间的分离程度很重要。需要进一步的研究和实际解决方案来更好地解决这些问题。

相似文献

1
Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.
Healthc Inform Res. 2015 Jan;21(1):35-42. doi: 10.4258/hir.2015.21.1.35. Epub 2015 Jan 31.
2
3
A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources.
J Am Med Inform Assoc. 2014 Mar-Apr;21(2):299-307. doi: 10.1136/amiajnl-2012-001506. Epub 2013 Jun 27.
5
A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.
Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.
6
A deep database of medical abbreviations and acronyms for natural language processing.
Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4.
9
The CLASSE GATOR (CLinical Acronym SenSE disambiGuATOR): A Method for predicting acronym sense from neonatal clinical notes.
Int J Med Inform. 2020 May;137:104101. doi: 10.1016/j.ijmedinf.2020.104101. Epub 2020 Feb 14.
10
A multi-aspect comparison study of supervised word sense disambiguation.
J Am Med Inform Assoc. 2004 Jul-Aug;11(4):320-31. doi: 10.1197/jamia.M1533. Epub 2004 Apr 2.

引用本文的文献

1
Analyzing the Creation and Use of Abbreviations in Cardiology and Cardiac Imaging Society Guidelines.
JACC Adv. 2025 Jan 13;4(2):101561. doi: 10.1016/j.jacadv.2024.101561. eCollection 2025 Feb.
2
Challenges of clinical accompaniment amongst undergraduate nursing students: University of KwaZulu-Natal.
Health SA. 2024 Jul 5;29:2535. doi: 10.4102/hsag.v29i0.2535. eCollection 2024.
3
Clinical Note Structural Knowledge Improves Word Sense Disambiguation.
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:515-524. eCollection 2024.
4
Large-scale identification of undiagnosed hepatic steatosis using natural language processing.
EClinicalMedicine. 2023 Aug 9;62:102149. doi: 10.1016/j.eclinm.2023.102149. eCollection 2023 Aug.
6
Deciphering clinical abbreviations with a privacy protecting machine learning system.
Nat Commun. 2022 Dec 2;13(1):7456. doi: 10.1038/s41467-022-35007-9.
7
Automated Mapping of Real-world Oncology Laboratory Data to LOINC.
AMIA Annu Symp Proc. 2022 Feb 21;2021:611-620. eCollection 2021.
8
A deep database of medical abbreviations and acronyms for natural language processing.
Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4.
10
A method for harmonization of clinical abbreviation and acronym sense inventories.
J Biomed Inform. 2018 Dec;88:62-69. doi: 10.1016/j.jbi.2018.11.004. Epub 2018 Nov 7.

本文引用的文献

1
Word sense disambiguation via semantic type classification.
AMIA Annu Symp Proc. 2008 Nov 6;2008:177-81.
2
Methods for building sense inventories of abbreviations in clinical notes.
J Am Med Inform Assoc. 2009 Jan-Feb;16(1):103-8. doi: 10.1197/jamia.M2927. Epub 2008 Oct 24.
3
Medical abbreviations: writing little and communicating less.
Arch Dis Child. 2008 Oct;93(10):816-7. doi: 10.1136/adc.2008.141473.
4
A study of abbreviations in clinical notes.
AMIA Annu Symp Proc. 2007 Oct 11;2007:821-5.
6
Word sense disambiguation across two domains: biomedical literature and clinical notes.
J Biomed Inform. 2008 Dec;41(6):1088-100. doi: 10.1016/j.jbi.2008.02.003. Epub 2008 Mar 4.
9
ADAM: another database of abbreviations in MEDLINE.
Bioinformatics. 2006 Nov 15;22(22):2813-8. doi: 10.1093/bioinformatics/btl480. Epub 2006 Sep 18.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验