Suppr超能文献

在可比的医学语料库中寻找法英翻译。

Looking for French-English translations in comparable medical corpora.

作者信息

Chiao Yun-Chuang, Zweigenbaum P

机构信息

STIM/DSI, Assistance Publique - Hôpitaux de Paris, Paris Cedex 13, 75634, France. [ycc, pz]@biomath.jussieu.fr

出版信息

Proc AMIA Symp. 2002:150-4.

Abstract

Cross-language retrieval of medical information needs to translate input queries into target language queries. It must be prepared to cope with 'new' words not yet listed in a multilingual lexicon. We address the issue of finding translational equivalents of such 'unknown' words from French to English in the medical domain. We rely on non-parallel, comparable corpora and an initial bilingual medical lexicon. We compare the distributional contexts of source and target words, testing several weighting factors and similarity measures. For the best combination (the Jaccard similarity measure with or without weighting), the correct translation is found in the top 10 candidates for more than 60% of the test words. This shows the potential of this technique to help extending bilingual medical lexicons.

摘要

医学信息的跨语言检索需要将输入查询翻译为目标语言查询。它必须准备好应对多语言词典中尚未列出的“新”词。我们解决了在医学领域中寻找从法语到英语的此类“未知”词的翻译对等词的问题。我们依赖于非平行的、可比较的语料库和一个初始的双语医学词典。我们比较源词和目标词的分布语境,测试了几个加权因子和相似度度量。对于最佳组合(带或不带加权的杰卡德相似度度量),超过60%的测试词在排名前10的候选词中找到了正确的翻译。这表明了该技术在帮助扩展双语医学词典方面的潜力。

相似文献

4
Automatic lexeme acquisition for a multilingual medical subword thesaurus.
Int J Med Inform. 2007 Feb-Mar;76(2-3):184-9. doi: 10.1016/j.ijmedinf.2006.05.032. Epub 2006 Jul 12.
5
Evaluating a pivot-based approach for bilingual lexicon extraction.
Comput Intell Neurosci. 2015;2015:434153. doi: 10.1155/2015/434153. Epub 2015 Apr 23.
8
Translating medical terminologies through word alignment in parallel text corpora.
J Biomed Inform. 2009 Aug;42(4):692-701. doi: 10.1016/j.jbi.2009.03.002. Epub 2009 Mar 9.

本文引用的文献

1
CISMeF: a structured health resource guide.
Methods Inf Med. 2000 Mar;39(1):30-5.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验