规范首字母缩略词以帮助患者理解临床文本：2013年共享/交叉语言评估论坛电子健康挑战赛，任务2

Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2.

作者信息

Mowery Danielle L, South Brett R, Christensen Lee, Leng Jianwei, Peltonen Laura-Maria, Salanterä Sanna, Suominen Hanna, Martinez David, Velupillai Sumithra, Elhadad Noémie, Savova Guergana, Pradhan Sameer, Chapman Wendy W

机构信息

Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA.

Nursing Science, University of Turku, and Turku University Hospital, Turku, Finland.

出版信息

J Biomed Semantics. 2016 Jul 1;7:43. doi: 10.1186/s13326-016-0084-y.

DOI:10.1186/s13326-016-0084-y

PMID:27370271

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4930590/

Abstract

BACKGROUND

The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term.

METHODS

In this study, we evaluate 1) accuracy of participating systems' normalizing short forms compared to a majority sense baseline approach, 2) performance of participants' systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems' normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms.

RESULTS

The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy.

CONCLUSION

Short form normalization continues to be a challenging problem. Short form normalization systems perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base with missed concept unique identifiers from the ShARe test set to further support patient understanding of unfamiliar medical terms.

摘要

背景

ShARe/CLEF电子健康挑战实验室旨在推动自然语言处理和信息检索技术的发展，以帮助患者理解其临床报告。在临床文本中，首字母缩略词和缩写词（也称为简称）可能让患者难以理解。对于2013年三项共享任务之一（任务2），我们生成了一个标准化为统一医学语言系统的临床简称参考标准。该参考标准可通过链接到带有注释简称的通俗易懂描述的网络资源，或用更简化的通俗术语替换简称，来提高患者的理解能力。

方法

在本研究中，我们评估了：1）与多数语义基线方法相比，参与系统对简称进行标准化的准确性；2）参与系统对具有可变多数语义分布的简称的性能；3）报告参与系统在测试集和消费者健康词汇表（一个通俗医学术语词汇表）之间对共享标准化概念进行标准化的准确性。

结果

五个参与团队提交的最佳系统的准确率在43%至72%之间。多数语义基线方法的性能排名第二。参与系统对具有两种或更多低歧义语义（多数语义大于80%）的简称进行标准化的准确率在52%至78%之间，对具有两种或更多中等歧义语义（多数语义在50%至80%之间）的简称进行标准化的准确率在23%至57%之间，对具有两种或更多高歧义语义（多数语义小于50%）的简称进行标准化的准确率在2%至45%之间。关于ShARe测试集，69%的简称注释包含与消费者健康词汇表的通用概念唯一标识符。对于这2594个可能的注释，参与系统的准确率在50%至75%之间。

结论

简称标准化仍然是一个具有挑战性的问题。简称标准化系统的准确率从中等到合理。消费者健康词汇表可以通过从ShARe测试集中遗漏的概念唯一标识符来丰富其知识库，以进一步支持患者对不熟悉医学术语的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c808/4930590/74a83cfadc18/13326_2016_84_Fig1_HTML.jpg

相似文献

Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2.规范首字母缩略词以帮助患者理解临床文本：2013年共享/交叉语言评估论坛电子健康挑战赛，任务2

J Biomed Semantics. 2016 Jul 1;7:43. doi: 10.1186/s13326-016-0084-y.

The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.2019 年全国自然语言处理（NLP）临床挑战（n2c2）/开放健康自然语言处理（OHNLP）临床记录临床概念规范化共享任务。

J Am Med Inform Assoc. 2020 Oct 1;27(10):1529-1537. doi: 10.1093/jamia/ocaa106.

Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.评估临床病历中疾病识别和规范化的当前技术水平。

J Am Med Inform Assoc. 2015 Jan;22(1):143-54. doi: 10.1136/amiajnl-2013-002544. Epub 2014 Aug 21.

A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources.使用临床笔记和医学词典资源创建的临床缩写和首字母缩略词感知清单。

J Am Med Inform Assoc. 2014 Mar-Apr;21(2):299-307. doi: 10.1136/amiajnl-2012-001506. Epub 2013 Jun 27.

Challenges in clinical natural language processing for automated disorder normalization.临床自然语言处理中自动疾病标准化的挑战。

J Biomed Inform. 2015 Oct;57:28-37. doi: 10.1016/j.jbi.2015.07.010. Epub 2015 Jul 14.

Clinical Information Extraction at the CLEF eHealth Evaluation lab 2016.2016年CLEF电子健康评估实验室的临床信息提取

CEUR Workshop Proc. 2016 Sep;1609:28-42.

Distinction between medical and non-medical usages of short forms in clinical narratives.临床记录中缩写词医学用法与非医学用法的区分。

AMIA Annu Symp Proc. 2018 Apr 16;2017:1302-1311. eCollection 2017.

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.生物医学领域中的机器学习与词义消歧：设计与评估问题

BMC Bioinformatics. 2006 Jul 5;7:334. doi: 10.1186/1471-2105-7-334.

Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.从与表型特别相关的生物医学文本中生成银标准概念注释。

PLoS One. 2015 Jan 21;10(1):e0116040. doi: 10.1371/journal.pone.0116040. eCollection 2015.

Normalizing clinical terms using learned edit distance patterns.使用学习到的编辑距离模式对临床术语进行规范化。

J Am Med Inform Assoc. 2016 Mar;23(2):380-6. doi: 10.1093/jamia/ocv108. Epub 2015 Jul 31.

引用本文的文献

Word sense disambiguation of acronyms in clinical narratives.临床叙述中首字母缩略词的词义消歧

Front Digit Health. 2024 Feb 28;6:1282043. doi: 10.3389/fdgth.2024.1282043. eCollection 2024.

A scoping review of publicly available language tasks in clinical natural language processing.临床自然语言处理中公开可用语言任务的范围综述

J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127.

Clinical Term Normalization Using Learned Edit Patterns and Subconcept Matching: System Development and Evaluation.使用学习到的编辑模式和子概念匹配进行临床术语标准化：系统开发与评估

JMIR Med Inform. 2021 Jan 14;9(1):e23104. doi: 10.2196/23104.

Annotation and extraction of age and temporally-related events from clinical histories.从临床病历中注释和提取年龄及时间相关事件。

BMC Med Inform Decis Mak. 2020 Dec 30;20(Suppl 11):338. doi: 10.1186/s12911-020-01333-5.

Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.医学概念规范化中的歧义：电子健康记录数据集的类型和覆盖范围分析。

J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.

本文引用的文献

J Am Med Inform Assoc. 2014 Mar-Apr;21(2):299-307. doi: 10.1136/amiajnl-2012-001506. Epub 2013 Jun 27.

Patient access to electronic health record: a comparative study on laws, policies and procedures in selected countries.患者获取电子健康记录：对部分国家法律、政策和程序的比较研究。

Med Arch. 2013;67(1):63-7. doi: 10.5455/medarh.2013.67.63-67.

Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations.临床文本中首字母缩略词和缩写词的自动消歧：窗口与训练规模考量

AMIA Annu Symp Proc. 2012;2012:1310-9. Epub 2012 Nov 3.

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.当前临床自然语言处理系统在处理出院小结中缩写词方面的比较研究。

AMIA Annu Symp Proc. 2012;2012:997-1003. Epub 2012 Nov 3.

Towards an international electronic repository and virtual laboratory of open data and open-source software for telehealth research: comparison of international, Australian and Finnish privacy policies.迈向远程医疗研究开放数据与开源软件的国际电子知识库及虚拟实验室：国际、澳大利亚和芬兰隐私政策比较

Stud Health Technol Inform. 2012;182:153-60.

Inviting patients to read their doctors' notes: a quasi-experimental study and a look ahead.邀请患者阅读医生的记录：一项准实验研究及前瞻性观察。

Ann Intern Med. 2012 Oct 2;157(7):461-70. doi: 10.7326/0003-4819-157-7-201210020-00002.

Patient understanding of emergency department discharge instructions: where are knowledge deficits greatest?患者对急诊科出院医嘱的理解：知识缺陷最大的地方在哪里？

Acad Emerg Med. 2012 Sep;19(9):E1035-44. doi: 10.1111/j.1553-2712.2012.01425.x.

Using UMLS lexical resources to disambiguate abbreviations in clinical text.利用统一医学语言系统（UMLS）词汇资源消除临床文本中的缩写歧义。

AMIA Annu Symp Proc. 2011;2011:715-22. Epub 2011 Oct 22.

Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.克服临床文本自然语言处理的障碍：共享任务的作用及对其他创造性解决方案的需求。

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):540-3. doi: 10.1136/amiajnl-2011-000465.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛：临床文本中的概念、断言和关系

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

规范首字母缩略词以帮助患者理解临床文本：2013年共享/交叉语言评估论坛电子健康挑战赛，任务2

Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献