Mina Eleni, Thompson Mark, Kaliyaperumal Rajaram, Zhao Jun, der Horst van Eelke, Tatum Zuotian, Hettne Kristina M, Schultes Erik A, Mons Barend, Roos Marco
Department of Human Genetics, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands.
Department of Zoology, University of Oxford, Oxford, UK.
J Biomed Semantics. 2015 Feb 9;6:5. doi: 10.1186/2041-1480-6-5. eCollection 2015.
Data from high throughput experiments often produce far more results than can ever appear in the main text or tables of a single research article. In these cases, the majority of new associations are often archived either as supplemental information in an arbitrary format or in publisher-independent databases that can be difficult to find. These data are not only lost from scientific discourse, but are also elusive to automated search, retrieval and processing. Here, we use the nanopublication model to make scientific assertions that were concluded from a workflow analysis of Huntington's Disease data machine-readable, interoperable, and citable. We followed the nanopublication guidelines to semantically model our assertions as well as their provenance metadata and authorship. We demonstrate interoperability by linking nanopublication provenance to the Research Object model. These results indicate that nanopublications can provide an incentive for researchers to expose data that is interoperable and machine-readable for future use and preservation for which they can get credits for their effort. Nanopublications can have a leading role into hypotheses generation offering opportunities to produce large-scale data integration.
高通量实验产生的数据往往比一篇研究论文的正文或表格中所能呈现的结果多得多。在这些情况下,大多数新的关联通常会以任意格式作为补充信息存档,或者存于难以查找的非出版商专属数据库中。这些数据不仅在科学论述中消失,而且难以通过自动化搜索、检索和处理获取。在此,我们使用纳米出版物模型,使从亨廷顿舞蹈症数据工作流程分析得出的科学论断具有机器可读性、互操作性和可引用性。我们遵循纳米出版物指南,对我们的论断及其出处元数据和作者信息进行语义建模。通过将纳米出版物出处与研究对象模型相链接,我们展示了互操作性。这些结果表明,纳米出版物可以激励研究人员公开具有互操作性和机器可读性的数据,以供未来使用和保存,研究人员为此付出的努力也能得到认可。纳米出版物在假设生成方面可以发挥主导作用,为大规模数据整合提供机会。