Mathew Cherian, Güntsch Anton, Obst Matthias, Vicario Saverio, Haines Robert, Williams Alan R, de Jong Yde, Goble Carole
Freie Universität Berlin, Botanic Garden and Botanical Museum Berlin-Dahlem, Berlin, Germany.
Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden.
Biodivers Data J. 2014 Dec 11(2):e4221. doi: 10.3897/BDJ.2.e4221. eCollection 2014.
The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.
用于分析和预测物种分布的数据编译与清理是一个耗时的过程,需要对生物多样性信息基础设施提供的数据格式和服务应用程序编程接口有扎实的理解。我们设计并实现了一个基于Taverna的数据精炼工作流程,该流程将分类数据检索、数据清理和数据选择集成到一个一致的、基于标准且有效的系统中,隐藏了底层服务基础设施的复杂性。该工作流程既可以在本地免费使用,也可以通过一个无需用户额外安装软件的网络门户使用。