• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于日语医学文本领域缩写扩展的易于实现的方法。一项初步研究。

An easily implemented method for abbreviation expansion for the medical domain in Japanese text. A preliminary study.

作者信息

Shinohara E Y, Aramaki E, Imai T, Miura Y, Tonoike M, Ohkuma T, Masuichi H, Ohe K

机构信息

Department of Planning, Information and Management, The University of Tokyo Hospital, Tokyo, Japan.

出版信息

Methods Inf Med. 2013;52(1):51-61. doi: 10.3414/ME12-01-0040. Epub 2012 Dec 7.

DOI:10.3414/ME12-01-0040
PMID:23223786
Abstract

BACKGROUND

One of the barriers for the effective use of computerized health-care related text is the ambiguity of abbreviations. To date, the task of disambiguating abbreviations has been treated as a classification task based on surrounding words. Application of this framework for languages that have no word boundaries requires pre-processing to segment a sentence into separate word sequences. While the segmentation processing is often a source of problem, it is unknown whether word information is really requisite for abbreviation expansion.

OBJECTIVES

The present study examined and compared abbreviation expansion methods with and without the incorporation of word information as a preliminary study.

METHODS

We implemented two abbreviation expansion methods: 1) a morpheme-based method that relied on word information and therefore required pre-processing, and 2) a character-based method that relied on simple character information. We compared the expansion accuracies for these two methods using eight medical abbreviations. Experimental data were automatically built as a pseudo-annotated corpus using the Internet.

RESULTS

As a result of the experiment, accuracies for the character-based method were from 0.890 to 0.942 while accuracies for the morpheme-based method were from 0.796 to 0.932. The character-based method significantly outperformed the morpheme-based method for three of the eight abbreviations (p < 0.05). For the remaining five abbreviations, no significant differences were found between the two methods.

CONCLUSIONS

Character information may be a good alternative in terms of simplicity to morphological information for abbreviation expansion in English medical abbreviations appeared in Japanese texts on the Internet.

摘要

背景

有效利用计算机化的医疗相关文本的障碍之一是缩写的歧义性。迄今为止,消除缩写歧义的任务一直被视为基于周围单词的分类任务。对于没有单词边界的语言,应用此框架需要进行预处理,将句子分割成单独的单词序列。虽然分割处理往往是问题的一个来源,但缩写扩展是否真的需要单词信息尚不清楚。

目的

本研究作为一项初步研究,检验并比较了纳入和未纳入单词信息的缩写扩展方法。

方法

我们实施了两种缩写扩展方法:1)基于词素的方法,该方法依赖单词信息,因此需要预处理;2)基于字符的方法,该方法依赖简单的字符信息。我们使用八个医学缩写比较了这两种方法的扩展准确率。实验数据通过互联网自动构建为一个伪注释语料库。

结果

实验结果显示,基于字符的方法的准确率在0.890至0.942之间,而基于词素的方法的准确率在0.796至0.932之间。在八个缩写中的三个上,基于字符的方法显著优于基于词素的方法(p < 0.05)。对于其余五个缩写,两种方法之间未发现显著差异。

结论

对于互联网上日语文本中出现的英语医学缩写的缩写扩展,就简单性而言,字符信息可能是形态信息的一个良好替代。

相似文献

1
An easily implemented method for abbreviation expansion for the medical domain in Japanese text. A preliminary study.一种用于日语医学文本领域缩写扩展的易于实现的方法。一项初步研究。
Methods Inf Med. 2013;52(1):51-61. doi: 10.3414/ME12-01-0040. Epub 2012 Dec 7.
2
Detection of sentence boundaries and abbreviations in clinical narratives.临床叙述中句子边界和缩写的检测。
BMC Med Inform Decis Mak. 2015;15 Suppl 2(Suppl 2):S4. doi: 10.1186/1472-6947-15-S2-S4. Epub 2015 Jun 15.
3
Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.通过一对一分类法对临床缩写进行消歧:算法开发和验证研究。
JMIR Med Inform. 2024 Oct 1;12:e56955. doi: 10.2196/56955.
4
Link-topic model for biomedical abbreviation disambiguation.用于生物医学缩写词消歧的链接主题模型
J Biomed Inform. 2015 Feb;53:367-80. doi: 10.1016/j.jbi.2014.12.013. Epub 2014 Dec 30.
5
Towards Comprehensive Clinical Abbreviation Disambiguation Using Machine-Labeled Training Data.利用机器标注训练数据实现临床缩写词的全面消歧
AMIA Annu Symp Proc. 2017 Feb 10;2016:560-569. eCollection 2016.
6
Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles.使用MEDLINE作为知识来源来消除全文生物医学期刊文章中缩写词和首字母缩略词的歧义。
J Biomed Inform. 2007 Apr;40(2):150-9. doi: 10.1016/j.jbi.2006.06.001. Epub 2006 Jun 7.
7
A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).从冗长表述到简短缩写的漫长历程:开发一个用于临床缩写识别与消歧的开源框架(CARD)
J Am Med Inform Assoc. 2017 Apr 1;24(e1):e79-e86. doi: 10.1093/jamia/ocw109.
8
A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.实时临床缩写词消歧的初步研究
Appl Clin Inform. 2015 Jun 3;6(2):364-74. doi: 10.4338/ACI-2014-10-RA-0088. eCollection 2015.
9
Abbreviation and acronym disambiguation in clinical discourse.临床语篇中的缩写词和首字母缩略词消歧
AMIA Annu Symp Proc. 2005;2005:589-93.
10
Unsupervised Abbreviation Expansion in Clinical Narratives.临床叙述中的无监督缩写扩展
Stud Health Technol Inform. 2017;245:539-543.

引用本文的文献

1
Clinical Natural Language Processing in languages other than English: opportunities and challenges.非英语语言的临床自然语言处理:机遇与挑战。
J Biomed Semantics. 2018 Mar 30;9(1):12. doi: 10.1186/s13326-018-0179-8.