Lovis C, Baud R, Rassinoux A M, Michel P A, Scherrer J R
Department of Internal Medicine, University State Hospital of Geneva, Switzerland.
Artif Intell Med. 1998 Sep-Oct;14(1-2):201-14. doi: 10.1016/s0933-3657(98)00023-2.
Medical language is highly compositional and makes extensive use of common roots, especially Latino-Greek roots. Besides words devoted to common sense, medical language presents some typical characteristics, especially on morphological and semantic aspects of word formation. Morphological decomposition and identification precedes semantic analysis. It is only when these two prerequisites are fulfilled that an attempt to grasp the meaning of a whole expression is made possible. The main aim of the proposed approach is that of coping with 'the lack of coverage of the medical lexical knowledge', in order to help physicians find the correct international classification for diseases (ICD) codes for a written diagnosis. The proposed methodology allows the development of a powerful dynamic dictionary dedicated to natural language processing in the field of diagnoses and narrative procedures. It describes the design of an analyser that can profit from a dictionary. The methods used have proved to be efficient for various classifications, s well as for multiple languages, as the system presently supports French, German, English and Dutch for ICD-9 and ICD-10 classifications.
医学语言具有高度的组合性,大量使用通用词根,尤其是拉丁 - 希腊词根。除了用于常识的词汇外,医学语言还呈现出一些典型特征,特别是在构词的形态和语义方面。形态分解和识别先于语义分析。只有当这两个前提条件都满足时,才有可能尝试理解整个表达式的含义。所提出方法的主要目的是应对“医学词汇知识覆盖不足”的问题,以帮助医生为书面诊断找到正确的国际疾病分类(ICD)代码。所提出的方法允许开发一个强大的动态词典,用于诊断和叙述程序领域的自然语言处理。它描述了一种可以从词典中受益的分析器的设计。所使用的方法已被证明对于各种分类以及多种语言都是有效的,因为该系统目前支持法语、德语、英语和荷兰语的ICD - 9和ICD - 10分类。