Névéol Aurélie, Mork James G, Aronson Alan R, Darmoni Stéfan J
Laboratoire PSI-FRE 2645 CNRS -INSA de Rouen, France.
AMIA Annu Symp Proc. 2005;2005:565-9.
This paper presents the evaluation of two MeSH indexing systems for French and English on a parallel corpus.
We describe two automatic MeSH in-dexing systems - MTI for English, and MAIF for French. The French version of the evaluation resources has been manually indexed with MeSH keyword/qualifier pairs. This professional indexing is used as our gold standard in the evaluation of both systems on keyword retrieval.
The English system (MTI) obtains significantly better precision and recall (78% precision and 21% recall at rank 1, vs. 37%. precision and 6% recall for MAIF ). Moreover, the performance of both systems can be optimised by the break-age function used by the French system (MAIF), which selects an adaptive number of descriptors for each resource indexed.
MTI achieves better performance. However, both systems have features that can benefit each other.
本文介绍了在一个平行语料库上对两种用于法语和英语的医学主题词表(MeSH)索引系统的评估。
我们描述了两种自动MeSH索引系统——用于英语的MTI和用于法语的MAIF。评估资源的法语版本已使用MeSH关键词/限定词对进行了人工索引。这种专业索引在两个系统的关键词检索评估中用作我们的黄金标准。
英语系统(MTI)获得了显著更高的精确率和召回率(在排名第1时精确率为78%,召回率为21%,而MAIF的精确率为37%,召回率为6%)。此外,两种系统的性能都可以通过法语系统(MAIF)使用的断词函数进行优化,该函数为每个索引资源选择自适应数量的描述符。
MTI性能更佳。然而,两种系统都有可相互受益的特性。