实时临床缩写词消歧的初步研究

A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.

作者信息

Wu Y, Denny J C, Rosenbloom S T, Miller R A, Giuse D A, Song M, Xu H

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston , Houston, Texas, USA.

Department of Biomedical Informatics Camridge, Vanderbilt University , Nashville, Tennessee, USA.

出版信息

Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.

DOI:10.4338/ACI-2014-10-RA-0088

PMID:26171081

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4493336/

Abstract

OBJECTIVE

To save time, healthcare providers frequently use abbreviations while authoring clinical documents. Nevertheless, abbreviations that authors deem unambiguous often confuse other readers, including clinicians, patients, and natural language processing (NLP) systems. Most current clinical NLP systems "post-process" notes long after clinicians enter them into electronic health record systems (EHRs). Such post-processing cannot guarantee 100% accuracy in abbreviation identification and disambiguation, since multiple alternative interpretations exist.

METHODS

Authors describe a prototype system for real-time Clinical Abbreviation Recognition and Disambiguation (rCARD) - i.e., a system that interacts with authors during note generation to verify correct abbreviation senses. The rCARD system design anticipates future integration with web-based clinical documentation systems to improve quality of healthcare records. When clinicians enter documents, rCARD will automatically recognize each abbreviation. For abbreviations with multiple possible senses, rCARD will show a ranked list of possible meanings with the best predicted sense at the top. The prototype application embodies three word sense disambiguation (WSD) methods to predict the correct senses of abbreviations. We then conducted three experments to evaluate rCARD, including 1) a performance evaluation of different WSD methods; 2) a time evaluation of real-time WSD methods; and 3) a user study of typing clinical sentences with abbreviations using rCARD.

RESULTS

Using 4,721 sentences containing 25 commonly observed, highly ambiguous clinical abbreviations, our evaluation showed that the best profile-based method implemented in rCARD achieved a reasonable WSD accuracy of 88.8% (comparable to SVM - 89.5%) and the cost of time for the different WSD methods are also acceptable (ranging from 0.630 to 1.649 milliseconds within the same network). The preliminary user study also showed that the extra time costs by rCARD were about 5% of total document entry time and users did not feel a significant delay when using rCARD for clinical document entry.

CONCLUSION

The study indicates that it is feasible to integrate a real-time, NLP-enabled abbreviation recognition and disambiguation module with clinical documentation systems.

摘要

目的

为节省时间，医疗保健提供者在撰写临床文档时经常使用缩写。然而，作者认为明确无误的缩写常常会使其他读者感到困惑，包括临床医生、患者和自然语言处理（NLP）系统。目前大多数临床NLP系统在临床医生将记录录入电子健康记录系统（EHR）很久之后才进行“后处理”。由于存在多种不同的解释，这种后处理无法保证缩写识别和消除歧义的准确率达到100%。

方法

作者描述了一种用于实时临床缩写识别与消除歧义（rCARD）的原型系统，即一种在生成记录时与作者交互以验证正确缩写含义的系统。rCARD系统设计预期未来将与基于网络的临床文档系统集成，以提高医疗记录的质量。当临床医生录入文档时，rCARD将自动识别每个缩写。对于有多种可能含义的缩写，rCARD将显示一个可能含义的排序列表，最佳预测含义排在首位。该原型应用体现了三种词义消歧（WSD）方法来预测缩写的正确含义。然后我们进行了三项实验来评估rCARD，包括：1）不同WSD方法的性能评估；2）实时WSD方法的时间评估；3）使用rCARD输入含缩写临床句子的用户研究。

结果

使用包含25个常见、高度模糊临床缩写的4721个句子，我们的评估表明，rCARD中实现的基于最佳配置文件的方法实现了合理的WSD准确率，为88.8%（与支持向量机（SVM）的89.5%相当），并且不同WSD方法的时间成本也是可以接受的（在同一网络内从0.630毫秒到1.649毫秒不等）。初步用户研究还表明，rCARD带来的额外时间成本约为文档录入总时间的5%，并且用户在使用rCARD进行临床文档录入时并未感到明显延迟。

结论

该研究表明，将实时、启用NLP的缩写识别和消除歧义模块与临床文档系统集成是可行的。

相似文献

A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.

Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).

J Am Med Inform Assoc. 2017 Apr 1;24(e1):e79-e86. doi: 10.1093/jamia/ocw109.

Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

AMIA Annu Symp Proc. 2012;2012:1004-13. Epub 2012 Nov 3.

Towards Comprehensive Clinical Abbreviation Disambiguation Using Machine-Labeled Training Data.

AMIA Annu Symp Proc. 2017 Feb 10;2016:560-569. eCollection 2016.

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

BMC Bioinformatics. 2006 Jul 5;7:334. doi: 10.1186/1471-2105-7-334.

Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations.

AMIA Annu Symp Proc. 2012;2012:1310-9. Epub 2012 Nov 3.

Word Sense Disambiguation of clinical abbreviations with hyperdimensional computing.

AMIA Annu Symp Proc. 2013 Nov 16;2013:1007-16. eCollection 2013.

Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.

JMIR Med Inform. 2024 Oct 1;12:e56955. doi: 10.2196/56955.

Determining the difficulty of Word Sense Disambiguation.

J Biomed Inform. 2014 Feb;47:83-90. doi: 10.1016/j.jbi.2013.09.009. Epub 2013 Sep 26.

A multi-aspect comparison study of supervised word sense disambiguation.

J Am Med Inform Assoc. 2004 Jul-Aug;11(4):320-31. doi: 10.1197/jamia.M1533. Epub 2004 Apr 2.

引用本文的文献

Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.

JMIR Med Inform. 2024 Oct 1;12:e56955. doi: 10.2196/56955.

A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry.

Graefes Arch Clin Exp Ophthalmol. 2023 Nov;261(11):3335-3344. doi: 10.1007/s00417-023-06190-2. Epub 2023 Aug 3.

Leveraging Active Learning for Failure Mode Acquisition.

Sensors (Basel). 2023 Mar 4;23(5):2818. doi: 10.3390/s23052818.

Deciphering clinical abbreviations with a privacy protecting machine learning system.

Nat Commun. 2022 Dec 2;13(1):7456. doi: 10.1038/s41467-022-35007-9.

Defining Phenotypes from Clinical Data to Drive Genomic Research.

Annu Rev Biomed Data Sci. 2018 Jul;1:69-92. doi: 10.1146/annurev-biodatasci-080917-013335. Epub 2018 Apr 25.

A deep database of medical abbreviations and acronyms for natural language processing.

Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4.

Complexities, variations, and errors of numbering within clinical notes: the potential impact on information extraction and cohort-identification.

BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):75. doi: 10.1186/s12911-019-0784-1.

A method for harmonization of clinical abbreviation and acronym sense inventories.

J Biomed Inform. 2018 Dec;88:62-69. doi: 10.1016/j.jbi.2018.11.004. Epub 2018 Nov 7.

Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.

Yearb Med Inform. 2016 Nov 10(1):234-239. doi: 10.15265/IY-2016-049.

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).

J Am Med Inform Assoc. 2017 Apr 1;24(e1):e79-e86. doi: 10.1093/jamia/ocw109.

本文引用的文献

Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations.

AMIA Annu Symp Proc. 2012;2012:1310-9. Epub 2012 Nov 3.

Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

AMIA Annu Symp Proc. 2012;2012:1004-13. Epub 2012 Nov 3.

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.

AMIA Annu Symp Proc. 2012;2012:997-1003. Epub 2012 Nov 3.

Colometer: a real-time quality feedback system for screening colonoscopy.

World J Gastroenterol. 2012 Aug 28;18(32):4270-7. doi: 10.3748/wjg.v18.i32.4270.

Real-time clinical decision support system with data stream mining.

J Biomed Biotechnol. 2012;2012:580186. doi: 10.1155/2012/580186. Epub 2012 Jul 18.

A new clustering method for detecting rare senses of abbreviations in clinical notes.

J Biomed Inform. 2012 Dec;45(6):1075-83. doi: 10.1016/j.jbi.2012.06.003. Epub 2012 Jun 25.

A real-time screening alert improves patient recruitment efficiency.

AMIA Annu Symp Proc. 2011;2011:1489-98. Epub 2011 Oct 22.

Knowing we practise good medicine: implementing the electronic medical record in family practice.

Can Fam Physician. 2010 Jan;56(1):15-6, e1-3.

Medical abbreviations: writing little and communicating less.

Arch Dis Child. 2008 Oct;93(10):816-7. doi: 10.1136/adc.2008.141473.

A study of abbreviations in clinical notes.

AMIA Annu Symp Proc. 2007 Oct 11;2007:821-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

实时临床缩写词消歧的初步研究

A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献