Rocha R A, Huff S M
Department of Medical Informatics, University of Utah 84112.
Proc Annu Symp Comput Appl Med Care. 1994:172-6.
A program for matching between controlled medical vocabularies has been developed which adopts methods used in the domain of Information Retrieval. This program combines a stemmer based on fragments of words (digrams) with a similarity function. The proposed stemmer did not require any knowledge about word-formation rules and helped the identification of several kinds of word variants. The adopted similarity function assigned the highest score to the best candidate match in 99.0% of the cases.
已开发出一种用于受控医学词汇匹配的程序,该程序采用了信息检索领域中使用的方法。此程序将基于单词片段(双字母组)的词干提取器与相似性函数相结合。所提出的词干提取器不需要任何关于构词规则的知识,并有助于识别多种单词变体。所采用的相似性函数在99.0%的情况下将最高分赋予最佳候选匹配项。