Arend Daniel, Lange Matthias, Chen Jinbo, Colmsee Christian, Flemming Steffen, Hecht Denny, Scholz Uwe
Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstr, 3, 06466 Stadt Seeland, Germany.
BMC Bioinformatics. 2014 Jun 24;15:214. doi: 10.1186/1471-2105-15-214.
The life-science community faces a major challenge in handling "big data", highlighting the need for high quality infrastructures capable of sharing and publishing research data. Data preservation, analysis, and publication are the three pillars in the "big data life cycle". The infrastructures currently available for managing and publishing data are often designed to meet domain-specific or project-specific requirements, resulting in the repeated development of proprietary solutions and lower quality data publication and preservation overall.
e!DAL is a lightweight software framework for publishing and sharing research data. Its main features are version tracking, metadata management, information retrieval, registration of persistent identifiers (DOI), an embedded HTTP(S) server for public data access, access as a network file system, and a scalable storage backend. e!DAL is available as an API for local non-shared storage and as a remote API featuring distributed applications. It can be deployed "out-of-the-box" as an on-site repository.
e!DAL was developed based on experiences coming from decades of research data management at the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK). Initially developed as a data publication and documentation infrastructure for the IPK's role as a data center in the DataCite consortium, e!DAL has grown towards being a general data archiving and publication infrastructure. The e!DAL software has been deployed into the Maven Central Repository. Documentation and Software are also available at: http://edal.ipk-gatersleben.de.
生命科学领域在处理“大数据”方面面临重大挑战,这凸显了对能够共享和发布研究数据的高质量基础设施的需求。数据保存、分析和发布是“大数据生命周期”的三大支柱。当前可用于管理和发布数据的基础设施通常是为满足特定领域或特定项目的要求而设计的,这导致专有解决方案的重复开发以及整体数据发布和保存质量较低。
e!DAL是一个用于发布和共享研究数据的轻量级软件框架。其主要功能包括版本跟踪、元数据管理、信息检索、持久标识符(DOI)注册、用于公共数据访问的嵌入式HTTP(S)服务器、作为网络文件系统进行访问以及可扩展的存储后端。e!DAL既可以作为用于本地非共享存储的API使用,也可以作为具有分布式应用程序的远程API使用。它可以作为现场存储库“开箱即用”地进行部署。
e!DAL是基于莱布尼茨植物遗传与作物研究所(IPK)数十年研究数据管理经验而开发的。最初作为IPK在DataCite联盟中作为数据中心的数据发布和文档基础设施而开发,e!DAL已发展成为一个通用的数据存档和发布基础设施。e!DAL软件已部署到Maven中央存储库中。文档和软件也可在以下网址获取:http://edal.ipk-gatersleben.de。