在可比的医学语料库中寻找法英翻译。

Looking for French-English translations in comparable medical corpora.

作者信息

Chiao Yun-Chuang, Zweigenbaum P

机构信息

STIM/DSI, Assistance Publique - Hôpitaux de Paris, Paris Cedex 13, 75634, France. [ycc, pz]@biomath.jussieu.fr

出版信息

Proc AMIA Symp. 2002:150-4.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2244154/

Abstract

Cross-language retrieval of medical information needs to translate input queries into target language queries. It must be prepared to cope with 'new' words not yet listed in a multilingual lexicon. We address the issue of finding translational equivalents of such 'unknown' words from French to English in the medical domain. We rely on non-parallel, comparable corpora and an initial bilingual medical lexicon. We compare the distributional contexts of source and target words, testing several weighting factors and similarity measures. For the best combination (the Jaccard similarity measure with or without weighting), the correct translation is found in the top 10 candidates for more than 60% of the test words. This shows the potential of this technique to help extending bilingual medical lexicons.

摘要

医学信息的跨语言检索需要将输入查询翻译为目标语言查询。它必须准备好应对多语言词典中尚未列出的“新”词。我们解决了在医学领域中寻找从法语到英语的此类“未知”词的翻译对等词的问题。我们依赖于非平行的、可比较的语料库和一个初始的双语医学词典。我们比较源词和目标词的分布语境，测试了几个加权因子和相似度度量。对于最佳组合（带或不带加权的杰卡德相似度度量），超过60%的测试词在排名前10的候选词中找到了正确的翻译。这表明了该技术在帮助扩展双语医学词典方面的潜力。

相似文献

1

Looking for French-English translations in comparable medical corpora.

Proc AMIA Symp. 2002:150-4.

2

Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval.

Artif Intell Med. 2005 Feb;33(2):111-24. doi: 10.1016/j.artmed.2004.07.015.

3

The effect of a general lexicon in corpus-based identification of French-English medical word translations.

Stud Health Technol Inform. 2003;95:397-402.

4

Automatic lexeme acquisition for a multilingual medical subword thesaurus.

Int J Med Inform. 2007 Feb-Mar;76(2-3):184-9. doi: 10.1016/j.ijmedinf.2006.05.032. Epub 2006 Jul 12.

5

Evaluating a pivot-based approach for bilingual lexicon extraction.

Comput Intell Neurosci. 2015;2015:434153. doi: 10.1155/2015/434153. Epub 2015 Apr 23.

6

Aligning words in French-English non-parallel medical texts: effect of term frequency distributions.

Stud Health Technol Inform. 2004;107(Pt 1):23-7.

7

Experiments in cross-language medical information retrieval using a mixing translation module.

Stud Health Technol Inform. 2004;107(Pt 2):946-9.

8

Translating medical terminologies through word alignment in parallel text corpora.

J Biomed Inform. 2009 Aug;42(4):692-701. doi: 10.1016/j.jbi.2009.03.002. Epub 2009 Mar 9.

9

Lost in translation? A multilingual Query Builder improves the quality of PubMed queries: a randomised controlled trial.

BMC Med Inform Decis Mak. 2017 Jul 3;17(1):94. doi: 10.1186/s12911-017-0490-9.

10

MorphoSaurus--design and evaluation of an interlingua-based, cross-language document retrieval engine for the medical domain.

Methods Inf Med. 2005;44(4):537-45.

引用本文的文献

1

Translating the Foundational Model of Anatomy into French using knowledge-based and lexical methods.

BMC Med Inform Decis Mak. 2011 Oct 26;11:65. doi: 10.1186/1472-6947-11-65.

2

A twofold strategy for translating a medical terminology into French.

AMIA Annu Symp Proc. 2010 Nov 13;2010:152-6.

3

Towards a multilingual medical lexicon.

AMIA Annu Symp Proc. 2006;2006:534-8.

4

Contribution to terminology internationalization by word alignment in parallel corpora.

AMIA Annu Symp Proc. 2006;2006:185-9.

5

Semi-automatic construction of the Chinese-English MeSH using Web-based term translation method.

AMIA Annu Symp Proc. 2005;2005:475-9.

6

Interchanging lexical information for a multilingual dictionary.

AMIA Annu Symp Proc. 2005;2005:31-5.

7

Corpus-based associations provide additional morphological variants to medical terminologies.

AMIA Annu Symp Proc. 2003;2003:768-72.

本文引用的文献

1

CISMeF: a structured health resource guide.

Methods Inf Med. 2000 Mar;39(1):30-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。