Bell D S, Pattison-Gordon E, Greenes R A
Harvard Medical School, Brigham and Women's Hospital, Boston, MA, USA.
J Am Med Inform Assoc. 1994 May-Jun;1(3):249-62. doi: 10.1136/jamia.1994.95236156.
Development of methods for building concept models to support structured data entry and image retrieval in chest radiography.
An organizing model for chest-radiographic reporting was built by analyzing manually a set of natural-language chest-radiograph reports. During model building, clinician-informaticians judged alternative conceptual structures according to four criteria: content of clinically relevant detail, provision for semantic constraints, provision for canonical forms, and simplicity. The organizing model was applied in representing three sample reports in their entirety. To explore the potential for automatic model discovery, the representation of one sample report was compared with the noun phrases derived from the same report by the CLARIT natural-language processing system.
The organizing model for chest-radiographic reporting consists of 62 concept types and 17 relations, arranged in an inheritance network. The broadest types in the model include finding, anatomic locus, procedure, attribute, and status. Diagnoses are modeled as a subtype of finding. Representing three sample reports in their entirety added 79 narrower concept types. Some CLARIT noun phrases suggested valid associations among subtypes of finding, status, and anatomic locus.
A manual modeling process utilizing explicitly stated criteria for making modeling decisions produced an organizing model that showed consistency in early testing. A combination of top-down and bottom-up modeling was required. Natural-language processing may inform model building, but algorithms that would replace manual modeling were not discovered. Further progress in modeling will require methods for objective model evaluation and tools for formalizing the model-building process.
开发用于构建概念模型的方法,以支持胸部X光摄影中的结构化数据录入和图像检索。
通过人工分析一组自然语言的胸部X光报告,构建了一个用于胸部X光报告的组织模型。在模型构建过程中,临床信息学家根据四个标准判断替代概念结构:临床相关细节的内容、语义约束的提供、规范形式的提供以及简单性。该组织模型被应用于完整呈现三个示例报告。为了探索自动模型发现的潜力,将一个示例报告的表示与CLARIT自然语言处理系统从同一报告中派生的名词短语进行了比较。
胸部X光报告的组织模型由62种概念类型和17种关系组成,排列在一个继承网络中。模型中最宽泛的类型包括发现、解剖部位、程序、属性和状态。诊断被建模为发现的一个子类型。完整呈现三个示例报告增加了79种更具体的概念类型。一些CLARIT名词短语表明了发现、状态和解剖部位子类型之间的有效关联。
利用明确规定的建模决策标准进行的手动建模过程产生了一个在早期测试中显示出一致性的组织模型。需要自上而下和自下而上建模相结合。自然语言处理可能为模型构建提供信息,但尚未发现能够取代手动建模的算法。建模的进一步进展将需要客观模型评估方法和使模型构建过程形式化的工具。