Baud R H, Rassinoux A M, Scherrer J R
Centre d'Informatique Hospitalière, University State Hospital of Geneva, Switzerland.
Methods Inf Med. 1992 Jun;31(2):117-25.
For medical records, the challenge for the present decade is Natural Language Processing (NLP) of texts, and the construction of an adequate Knowledge Representation. This article describes the components of an NLP system, which is currently being developed in the Geneva Hospital, and within the European Community's AIM programme. They are: a Natural Language Analyser, a Conceptual Graphs Builder, a Data Base Storage component, a Query Processor, a Natural Language Generator and, in addition, a Translator, a Diagnosis Encoding System and a Literature Indexing System. Taking advantage of a closed domain of knowledge, defined around a medical specialty, a method called proximity processing has been developed. In this situation no parser of the initial text is needed, and the system is based on semantical information of near words in sentences. The benefits are: easy implementation, portability between languages, robustness towards badly-formed sentences, and a sound representation using conceptual graphs.
对于医疗记录而言,当前十年面临的挑战是文本的自然语言处理(NLP)以及构建适当的知识表示。本文描述了一个正在日内瓦医院以及欧洲共同体的AIM计划中开发的NLP系统的组成部分。它们是:一个自然语言分析器、一个概念图构建器、一个数据库存储组件、一个查询处理器、一个自然语言生成器,此外还有一个翻译器、一个诊断编码系统和一个文献索引系统。利用围绕医学专业定义的封闭知识领域,开发了一种称为邻近处理的方法。在这种情况下,不需要对初始文本进行解析器,并且该系统基于句子中相邻单词的语义信息。其优点是:易于实现、语言之间的可移植性、对格式错误句子的鲁棒性以及使用概念图的合理表示。