Ji Xiaonan, Ritter Alan, Yen Po-Yin
Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA; Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA.
Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA.
J Biomed Inform. 2017 May;69:33-42. doi: 10.1016/j.jbi.2017.03.007. Epub 2017 Mar 14.
Systematic Reviews (SRs) are utilized to summarize evidence from high quality studies and are considered the preferred source of evidence-based practice (EBP). However, conducting SRs can be time and labor intensive due to the high cost of article screening. In previous studies, we demonstrated utilizing established (lexical) article relationships to facilitate the identification of relevant articles in an efficient and effective manner. Here we propose to enhance article relationships with background semantic knowledge derived from Unified Medical Language System (UMLS) concepts and ontologies.
We developed a pipelined semantic concepts representation process to represent articles from an SR into an optimized and enriched semantic space of UMLS concepts. Throughout the process, we leveraged concepts and concept relations encoded in biomedical ontologies (SNOMED-CT and MeSH) within the UMLS framework to prompt concept features of each article. Article relationships (similarities) were established and represented as a semantic article network, which was readily applied to assist with the article screening process. We incorporated the concept of active learning to simulate an interactive article recommendation process, and evaluated the performance on 15 completed SRs. We used work saved over sampling at 95% recall (WSS95) as the performance measure.
We compared the WSS95 performance of our ontology-based semantic approach to existing lexical feature approaches and corpus-based semantic approaches, and found that we had better WSS95 in most SRs. We also had the highest average WSS95 of 43.81% and the highest total WSS95 of 657.18%.
We demonstrated using ontology-based semantics to facilitate the identification of relevant articles for SRs. Effective concepts and concept relations derived from UMLS ontologies can be utilized to establish article semantic relationships. Our approach provided a promising performance and can easily apply to any SR topics in the biomedical domain with generalizability.
系统评价(SRs)用于总结高质量研究的证据,被认为是循证实践(EBP)的首选证据来源。然而,由于文章筛选成本高昂,进行系统评价可能耗费大量时间和人力。在之前的研究中,我们证明了利用已建立的(词汇)文章关系,能够以高效且有效的方式促进相关文章的识别。在此,我们提议利用从统一医学语言系统(UMLS)概念和本体中获取的背景语义知识来增强文章关系。
我们开发了一个流水线式语义概念表示过程,将系统评价中的文章表示到UMLS概念的优化且丰富的语义空间中。在整个过程中,我们利用UMLS框架内生物医学本体(SNOMED-CT和MeSH)中编码的概念和概念关系来突出每篇文章的概念特征。建立文章关系(相似度)并将其表示为语义文章网络,该网络可直接用于辅助文章筛选过程。我们纳入主动学习的概念来模拟交互式文章推荐过程,并在15项已完成的系统评价中评估其性能。我们使用95%召回率下节省的工作量(WSS95)作为性能指标。
我们将基于本体的语义方法与现有的词汇特征方法和基于语料库的语义方法的WSS95性能进行了比较,发现在大多数系统评价中我们具有更好的WSS95。我们还拥有最高的平均WSS95,为43.81%,以及最高的总WSS95,为657.18%。
我们证明了使用基于本体的语义来促进系统评价中相关文章的识别。从UMLS本体中得出的有效概念和概念关系可用于建立文章语义关系。我们的方法展现出了良好的性能,并且能够轻松应用于生物医学领域的任何系统评价主题,具有通用性。