• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用交互式词对齐创建医学英语-瑞典语词典。

Creating a medical English-Swedish dictionary using interactive word alignment.

作者信息

Nyström Mikael, Merkel Magnus, Ahrenberg Lars, Zweigenbaum Pierre, Petersson Håkan, Ahlfeldt Hans

机构信息

Department of Biomedical Engineering, Linköpings universitet, SE-58185 Linköping, Sweden.

出版信息

BMC Med Inform Decis Mak. 2006 Oct 12;6:35. doi: 10.1186/1472-6947-6-35.

DOI:10.1186/1472-6947-6-35
PMID:17034649
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1624822/
Abstract

BACKGROUND

This paper reports on a parallel collection of rubrics from the medical terminology systems ICD-10, ICF, MeSH, NCSP and KSH97-P and its use for semi-automatic creation of an English-Swedish dictionary of medical terminology. The methods presented are relevant for many other West European language pairs than English-Swedish.

METHODS

The medical terminology systems were collected in electronic format in both English and Swedish and the rubrics were extracted in parallel language pairs. Initially, interactive word alignment was used to create training data from a sample. Then the training data were utilised in automatic word alignment in order to generate candidate term pairs. The last step was manual verification of the term pair candidates.

RESULTS

A dictionary of 31,000 verified entries has been created in less than three man weeks, thus with considerably less time and effort needed compared to a manual approach, and without compromising quality. As a side effect of our work we found 40 different translation problems in the terminology systems and these results indicate the power of the method for finding inconsistencies in terminology translations. We also report on some factors that may contribute to making the process of dictionary creation with similar tools even more expedient. Finally, the contribution is discussed in relation to other ongoing efforts in constructing medical lexicons for non-English languages.

CONCLUSION

In three man weeks we were able to produce a medical English-Swedish dictionary consisting of 31,000 entries and also found hidden translation errors in the utilized medical terminology systems.

摘要

背景

本文报告了从医学术语系统ICD - 10、ICF、MeSH、NCSP和KSH97 - P中并行收集的类目及其在半自动创建医学术语英 - 瑞典语词典中的应用。所介绍的方法适用于许多其他西欧语言对,而非仅限于英 - 瑞典语。

方法

以电子格式收集英语和瑞典语的医学术语系统,并提取平行语言对中的类目。最初,使用交互式单词对齐从样本中创建训练数据。然后将训练数据用于自动单词对齐,以生成候选术语对。最后一步是对候选术语对进行人工验证。

结果

在不到三人周的时间内创建了一个包含31,000个经过验证条目的词典,因此与手动方法相比所需的时间和精力大大减少,且不影响质量。作为我们工作的一个附带结果,我们在术语系统中发现了40个不同的翻译问题,这些结果表明该方法在发现术语翻译不一致方面的强大功能。我们还报告了一些可能有助于使用类似工具更便捷地创建词典过程中的因素。最后,讨论了与其他正在进行的为非英语语言构建医学词典的努力相关的贡献。

结论

在三人周内,我们能够制作一个包含31,000个条目的医学英 - 瑞典语词典,并在所用的医学术语系统中发现了隐藏的翻译错误。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/6c5e6857663d/1472-6947-6-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/e4b233ca5806/1472-6947-6-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/07c713bc2772/1472-6947-6-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/6c5e6857663d/1472-6947-6-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/e4b233ca5806/1472-6947-6-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/07c713bc2772/1472-6947-6-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edb9/1624822/6c5e6857663d/1472-6947-6-35-3.jpg

相似文献

1
Creating a medical English-Swedish dictionary using interactive word alignment.使用交互式词对齐创建医学英语-瑞典语词典。
BMC Med Inform Decis Mak. 2006 Oct 12;6:35. doi: 10.1186/1472-6947-6-35.
2
Creating a medical dictionary using word alignment: the influence of sources and resources.利用词对齐创建医学词典:来源与资源的影响
BMC Med Inform Decis Mak. 2007 Nov 23;7:37. doi: 10.1186/1472-6947-7-37.
3
Automatic lexeme acquisition for a multilingual medical subword thesaurus.用于多语言医学子词词典的自动词元获取。
Int J Med Inform. 2007 Feb-Mar;76(2-3):184-9. doi: 10.1016/j.ijmedinf.2006.05.032. Epub 2006 Jul 12.
4
Contribution to terminology internationalization by word alignment in parallel corpora.通过平行语料库中的词对齐对术语国际化的贡献。
AMIA Annu Symp Proc. 2006;2006:185-9.
5
Experiments with Cross-Language Information Retrieval on a Health Portal for Psychology and Psychotherapy.在一个心理学与心理治疗健康门户网站上进行跨语言信息检索的实验。
Stud Health Technol Inform. 2016;228:75-9.
6
Semi-automatic construction of the Chinese-English MeSH using Web-based term translation method.基于网络术语翻译方法的汉英医学主题词表半自动构建
AMIA Annu Symp Proc. 2005;2005:475-9.
7
Translating medical terminologies through word alignment in parallel text corpora.通过平行文本语料库中的词对齐来翻译医学术语。
J Biomed Inform. 2009 Aug;42(4):692-701. doi: 10.1016/j.jbi.2009.03.002. Epub 2009 Mar 9.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
MorphoSaurus--design and evaluation of an interlingua-based, cross-language document retrieval engine for the medical domain.形态恐龙——一种基于中间语言的医学领域跨语言文档检索引擎的设计与评估。
Methods Inf Med. 2005;44(4):537-45.
10
Adaptation of machine translation for multilingual information retrieval in the medical domain.医学领域中用于多语言信息检索的机器翻译适配
Artif Intell Med. 2014 Jul;61(3):165-85. doi: 10.1016/j.artmed.2014.01.004. Epub 2014 Feb 5.

引用本文的文献

1
Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes.用中文常用同义词丰富国际临床术语表,并在医生记录中进行概念识别。
BMC Med Inform Decis Mak. 2017 May 2;17(1):54. doi: 10.1186/s12911-017-0455-z.
2
Creating a medical dictionary using word alignment: the influence of sources and resources.利用词对齐创建医学词典:来源与资源的影响
BMC Med Inform Decis Mak. 2007 Nov 23;7:37. doi: 10.1186/1472-6947-7-37.
3
Contribution to terminology internationalization by word alignment in parallel corpora.

本文引用的文献

1
Enriching medical terminologies: an approach based on aligned corpora.丰富医学术语:一种基于对齐语料库的方法。
Stud Health Technol Inform. 2006;124:747-52.
2
Interchanging lexical information for a multilingual dictionary.为多语言词典交换词汇信息。
AMIA Annu Symp Proc. 2005;2005:31-5.
3
Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval.多语言医学术语的自动处理:在叙词表扩充和跨语言信息检索中的应用
通过平行语料库中的词对齐对术语国际化的贡献。
AMIA Annu Symp Proc. 2006;2006:185-9.
Artif Intell Med. 2005 Feb;33(2):111-24. doi: 10.1016/j.artmed.2004.07.015.
4
UMLF: a unified medical lexicon for French.UMLF:法语统一医学词汇表。
Int J Med Inform. 2005 Mar;74(2-4):119-24. doi: 10.1016/j.ijmedinf.2004.03.010.
5
UMLS language and vocabulary tools.统一医学语言系统的语言和词汇工具。
AMIA Annu Symp Proc. 2003;2003:798.
6
The German specialist lexicon.德国专业词典。
Proc AMIA Symp. 2002:884-8.
7
The connection between terms used in medical records and coding system: a study on Swedish primary health care data.医疗记录中使用的术语与编码系统之间的联系:一项关于瑞典初级卫生保健数据的研究。
Med Inform Internet Med. 2001 Apr-Jun;26(2):87-99.
8
Automatic extraction of linguistic knowledge from an international classification.从国际分类中自动提取语言知识。
Stud Health Technol Inform. 1998;52 Pt 1:581-5.
9
Desiderata for controlled medical vocabularies in the twenty-first century.21世纪受控医学词汇的必备条件。
Methods Inf Med. 1998 Nov;37(4-5):394-403.
10
Medical dictionaries for patient encoding systems: a methodology.用于患者编码系统的医学词典:一种方法
Artif Intell Med. 1998 Sep-Oct;14(1-2):201-14. doi: 10.1016/s0933-3657(98)00023-2.