Kiossoglou Philip, Borda Ann, Gray Kathleen, Martin-Sanchez Fernando, Verspoor Karin, Lopez-Campos Guillermo
Health and Biomedical Informatics Centre, The University of Melbourne, Parkville, Victoria, Australia.
Stud Health Technol Inform. 2017;245:457-461.
Scientific advancement and the development of new research fields bring uncertainties about what the current topics of research emphasis are and thus, what new knowledge might need to be represented. The exposome is an example of one such new field for which these uncertainties exist. The exposome is the analogue to the genome, from an environmental exposure perspective; research on the exposome has gained momentum only since 2011. In this work, we propose a generally applicable methodology that aims to characterise the landscape of a new research area based on linguistic analysis of its associated publications. Using abstracts of 261 exposome research articles, we illustrate a methodology that combines (1) inductive analysis based on word frequency counts, and term analysis to identify the topics, methods and applications of the new field and (2) deductive analysis using the NCBO Ontology Recommender to identify to what extent this new area is covered by current knowledge representation tools. Applying this method to the exposome literature, we uncover both the current focus of exposome research and the ontologies that are most relevant to the domain.
科学进步和新研究领域的发展带来了关于当前研究重点主题的不确定性,进而也带来了关于可能需要呈现哪些新知识的不确定性。暴露组就是存在这些不确定性的此类新领域之一。从环境暴露的角度来看,暴露组类似于基因组;对暴露组的研究自2011年才开始兴起。在这项工作中,我们提出了一种普遍适用的方法,旨在基于对相关出版物的语言分析来描绘一个新研究领域的全貌。我们使用261篇暴露组研究文章的摘要,阐述了一种结合以下两点的方法:(1)基于词频统计的归纳分析以及术语分析,以识别新领域的主题、方法和应用;(2)使用NCBO本体推荐器进行演绎分析,以确定当前知识表示工具对这一新领域的覆盖程度。将此方法应用于暴露组文献,我们揭示了暴露组研究的当前重点以及与该领域最相关的本体。