Conway Mike, Khojoyan Artem, Fana Fariba, Scuba William, Castine Melissa, Mowery Danielle, Chapman Wendy, Jupp Simon
Department of Biomedical Informatics, University of Utah, 421 Wakara Way, Salt Lake City, 84108 UT United States.
Independent software developer, Kyiv, Ukraine.
J Biomed Semantics. 2016 Apr 4;7:5. doi: 10.1186/s13326-015-0043-z. eCollection 2016.
The Simple Knowledge Organization System (SKOS) was introduced to the wider research community by a 2005 World Wide Web Consortium (W3C) working draft, and further developed and refined in a 2009 W3C recommendation. Since then, SKOS has become the de facto standard for representing and sharing thesauri, lexicons, vocabularies, taxonomies, and classification schemes. In this paper, we describe the development of a web-based, free, open-source SKOS editor built for the development, curation, and management of small to medium-sized lexicons for health-related Natural Language Processing (NLP).
The web-based SKOS editor allows users to create, curate, version, manage, and visualise SKOS resources. We tested the system against five widely-used, publicly-available SKOS vocabularies of various sizes and found that the editor is suitable for the development and management of small to medium-size lexicons. Qualitative testing has focussed on using the editor to develop lexical resources to drive NLP applications in two domains. First, developing a lexicon to support an Electronic Health Record-based NLP system for the automatic identification of pneumonia symptoms. Second, creating a taxonomy of lexical cues associated with Diagnostic and Statistical Manual of Mental Disorders (DSM-5) diagnoses with the goal of facilitating the automatic identification of symptoms associated with depression from short, informal texts.
The SKOS editor we have developed is - to the best of our knowledge - the first free, open-source, web-based, SKOS editor capable of creating, curating, versioning, managing, and visualising SKOS lexicons.
简单知识组织系统(SKOS)由万维网联盟(W3C)2005年的工作草案引入更广泛的研究社区,并在2009年的W3C推荐中进一步开发和完善。从那时起,SKOS已成为表示和共享叙词表、词典、词汇表、分类法和分类方案的事实上的标准。在本文中,我们描述了一个基于网络的、免费的、开源的SKOS编辑器的开发,该编辑器用于开发、管理和管理与健康相关的自然语言处理(NLP)中小型词汇表。
基于网络的SKOS编辑器允许用户创建、管理、版本控制、管理和可视化SKOS资源。我们针对五个广泛使用的、公开可用的不同大小的SKOS词汇表对该系统进行了测试,发现该编辑器适用于中小型词汇表的开发和管理。定性测试集中在使用该编辑器开发词汇资源,以驱动两个领域的NLP应用程序。第一,开发一个词汇表以支持基于电子健康记录的NLP系统,用于自动识别肺炎症状。第二,创建一个与《精神疾病诊断与统计手册》(DSM-5)诊断相关的词汇线索分类法,目的是便于从简短的非正式文本中自动识别与抑郁症相关的症状。
据我们所知,我们开发的SKOS编辑器是第一个能够创建、管理、版本控制、管理和可视化SKOS词汇表的免费、开源、基于网络的SKOS编辑器。