• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床叙述中的无监督缩写扩展

Unsupervised Abbreviation Expansion in Clinical Narratives.

作者信息

Oleynik Michel, Kreuzthaler Markus, Schulz Stefan

机构信息

Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.

出版信息

Stud Health Technol Inform. 2017;245:539-543.

PMID:29295153
Abstract

Clinical narratives are typically produced under time pressure, which incites the use of abbreviations and acronyms. To expand such short forms in a correct way eases text comprehension and further semantic processing. We propose a completely unsupervised and data-driven algorithm for the resolution of non-lexicalised and potentially ambiguous abbreviations. Based on the lookup of word bigrams and unigrams extracted from a corpus of 30,000 pseudonymised cardiology reports in German, our method achieved an F1 score of 0.91, evaluated with a test set of 200 text excerpts. The results are statistically significantly better (p < 0.001) than a baseline approach and show that a simple and domain-independent strategy may be enough to resolve abbreviations when a large corpus of similar texts is available. Further work is needed to combine this strategy with sentence and abbreviation detection modules, to adapt it to acronym resolution and to evaluate it with different datasets.

摘要

临床叙述通常是在时间压力下生成的,这促使人们使用缩写和首字母缩略词。以正确的方式展开这些简短形式有助于文本理解和进一步的语义处理。我们提出了一种完全无监督且数据驱动的算法,用于解决非词汇化且可能有歧义的缩写。基于从30000份德语假名化心脏病学报告语料库中提取的单词二元组和一元组的查找,我们的方法在200个文本摘录的测试集上评估,F1分数达到了0.91。结果在统计学上显著优于基线方法(p < 0.001),表明当有大量相似文本语料库时,一种简单且与领域无关的策略可能足以解决缩写问题。需要进一步开展工作,将该策略与句子和缩写检测模块相结合,使其适用于首字母缩略词解析,并使用不同的数据集进行评估。

相似文献

1
Unsupervised Abbreviation Expansion in Clinical Narratives.临床叙述中的无监督缩写扩展
Stud Health Technol Inform. 2017;245:539-543.
2
Detection of sentence boundaries and abbreviations in clinical narratives.临床叙述中句子边界和缩写的检测。
BMC Med Inform Decis Mak. 2015;15 Suppl 2(Suppl 2):S4. doi: 10.1186/1472-6947-15-S2-S4. Epub 2015 Jun 15.
3
Improving Layman Readability of Clinical Narratives with Unsupervised Synonym Replacement.通过无监督同义词替换提高临床叙述对非专业人士的可读性。
Stud Health Technol Inform. 2018;247:725-729.
4
Disambiguation of acronyms in clinical narratives with large language models.利用大型语言模型对临床叙述中的缩略语进行消歧。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2040-2046. doi: 10.1093/jamia/ocae157.
5
An easily implemented method for abbreviation expansion for the medical domain in Japanese text. A preliminary study.一种用于日语医学文本领域缩写扩展的易于实现的方法。一项初步研究。
Methods Inf Med. 2013;52(1):51-61. doi: 10.3414/ME12-01-0040. Epub 2012 Dec 7.
6
Abbreviation and acronym disambiguation in clinical discourse.临床语篇中的缩写词和首字母缩略词消歧
AMIA Annu Symp Proc. 2005;2005:589-93.
7
Link-topic model for biomedical abbreviation disambiguation.用于生物医学缩写词消歧的链接主题模型
J Biomed Inform. 2015 Feb;53:367-80. doi: 10.1016/j.jbi.2014.12.013. Epub 2014 Dec 30.
8
Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.通过一对一分类法对临床缩写进行消歧:算法开发和验证研究。
JMIR Med Inform. 2024 Oct 1;12:e56955. doi: 10.2196/56955.
9
Processing of Short-Form Content in Clinical Narratives: Systematic Scoping Review.临床叙事中短格式内容的处理:系统范围综述。
J Med Internet Res. 2024 Sep 26;26:e57852. doi: 10.2196/57852.
10
Disambiguation of Medical Abbreviations in French with Supervised Methods.使用监督方法消除法语医学缩写的歧义
Stud Health Technol Inform. 2021 May 27;281:313-317. doi: 10.3233/SHTI210171.

引用本文的文献

1
Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data.临床文档语料库——真实语料库、翻译语料库和合成替代语料库,以及各类领域替代语料库:语料库设计多样性调查,重点关注德语文本数据
JAMIA Open. 2025 May 14;8(3):ooaf024. doi: 10.1093/jamiaopen/ooaf024. eCollection 2025 Jun.
2
Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.医学概念规范化中的歧义:电子健康记录数据集的类型和覆盖范围分析。
J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.
3
Language model-based automatic prefix abbreviation expansion method for biomedical big data analysis.
用于生物医学大数据分析的基于语言模型的自动前缀缩写扩展方法
Future Gener Comput Syst. 2019 Sep;98:238-251. doi: 10.1016/j.future.2019.01.016. Epub 2019 Mar 28.
4
Improving the Path from Diagnoses to Documentation: A Cognitive Review Tool for Clinical Notes and Administrative Records.改善从诊断到记录的流程:临床笔记和行政记录的认知审查工具
AMIA Annu Symp Proc. 2018 Dec 5;2018:518-526. eCollection 2018.