Halper Michael, Gu Huanying, Perl Yehoshua, Ochs Christopher
Information Technology Department, New Jersey Institute of Technology, Newark, NJ 07102, USA.
Computer Science Department, New York Institute of Technology, New York, NY 10023, USA.
Artif Intell Med. 2015 May;64(1):1-16. doi: 10.1016/j.artmed.2015.03.005. Epub 2015 Apr 2.
Terminologies and terminological systems have assumed important roles in many medical information processing environments, giving rise to the "big knowledge" challenge when terminological content comprises tens of thousands to millions of concepts arranged in a tangled web of relationships. Use and maintenance of knowledge structures on that scale can be daunting. The notion of abstraction network is presented as a means of facilitating the usability, comprehensibility, visualization, and quality assurance of terminologies.
An abstraction network overlays a terminology's underlying network structure at a higher level of abstraction. In particular, it provides a more compact view of the terminology's content, avoiding the display of minutiae. General abstraction network characteristics are discussed. Moreover, the notion of meta-abstraction network, existing at an even higher level of abstraction than a typical abstraction network, is described for cases where even the abstraction network itself represents a case of "big knowledge." Various features in the design of abstraction networks are demonstrated in a methodological survey of some existing abstraction networks previously developed and deployed for a variety of terminologies.
The applicability of the general abstraction-network framework is shown through use-cases of various terminologies, including the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT), the Medical Entities Dictionary (MED), and the Unified Medical Language System (UMLS). Important characteristics of the surveyed abstraction networks are provided, e.g., the magnitude of the respective size reduction referred to as the abstraction ratio. Specific benefits of these alternative terminology-network views, particularly their use in terminology quality assurance, are discussed. Examples of meta-abstraction networks are presented.
The "big knowledge" challenge constitutes the use and maintenance of terminological structures that comprise tens of thousands to millions of concepts and their attendant complexity. The notion of abstraction network has been introduced as a tool in helping to overcome this challenge, thus enhancing the usefulness of terminologies. Abstraction networks have been shown to be applicable to a variety of existing biomedical terminologies, and these alternative structural views hold promise for future expanded use with additional terminologies.
术语和术语系统在许多医学信息处理环境中发挥着重要作用,当术语内容包含数以万计到数百万个以错综复杂的关系网络排列的概念时,便引发了“大知识”挑战。在如此规模上使用和维护知识结构可能令人望而生畏。本文提出抽象网络的概念,作为一种促进术语的可用性、可理解性、可视化及质量保证的手段。
抽象网络在更高的抽象层次上覆盖术语的基础网络结构。具体而言,它提供了术语内容更简洁的视图,避免了细节的展示。讨论了抽象网络的一般特征。此外,对于即使抽象网络本身也代表“大知识”情况的案例,描述了元抽象网络的概念,其存在于比典型抽象网络更高的抽象层次。在对先前为各种术语开发和部署的一些现有抽象网络的方法学调查中,展示了抽象网络设计中的各种特征。
通过各种术语的用例,包括医学系统命名法 - 临床术语(SNOMED CT)、医学实体词典(MED)和统一医学语言系统(UMLS),展示了一般抽象网络框架的适用性。提供了所调查抽象网络的重要特征,例如被称为抽象率的各自规模缩减幅度。讨论了这些替代术语网络视图的具体益处,特别是它们在术语质量保证中的应用。给出了元抽象网络的示例。
“大知识”挑战在于使用和维护包含数以万计到数百万个概念及其伴随复杂性的术语结构。引入抽象网络的概念作为帮助克服这一挑战的工具,从而提高术语的实用性。抽象网络已被证明适用于各种现有的生物医学术语,并且这些替代结构视图有望在未来与更多术语一起得到更广泛的应用。