Suppr超能文献

DVID:分布式版本化面向图像的数据服务。

DVID: Distributed Versioned Image-Oriented Dataservice.

机构信息

Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA, United States.

出版信息

Front Neural Circuits. 2019 Feb 5;13:5. doi: 10.3389/fncir.2019.00005. eCollection 2019.

Abstract

Open-source software development has skyrocketed in part due to community tools like github.com, which allows publication of code as well as the ability to create branches and push accepted modifications back to the original repository. As the number and size of EM-based datasets increases, the connectomics community faces similar issues when we publish snapshot data corresponding to a publication. Ideally, there would be a mechanism where remote collaborators could modify branches of the data and then flexibly reintegrate results via moderated acceptance of changes. The DVID system provides a web-based connectomics API and the first steps toward such a distributed versioning approach to EM-based connectomics datasets. Through its use as the central data resource for Janelia's FlyEM team, we have integrated the concepts of distributed versioning into reconstruction workflows, allowing support for proofreader training and segmentation experiments through branched, versioned data. DVID also supports persistence to a variety of storage systems from high-speed local SSDs to cloud-based object stores, which allows its deployment on laptops as well as large servers. The tailoring of the backend storage to each type of connectomics data leads to efficient storage and fast queries. DVID is freely available as open-source software with an increasing number of supported storage options.

摘要

开源软件开发在一定程度上得到了飞速发展,这要归功于像 github.com 这样的社区工具,它允许发布代码,并且能够创建分支,并将被接受的修改推回到原始存储库。随着基于 EM 的数据集的数量和规模的增加,当我们发布与出版物相对应的快照数据时,连接组学社区也面临着类似的问题。理想情况下,应该有一种机制,让远程协作者能够修改数据的分支,然后通过对更改的有节制的接受,灵活地重新整合结果。DVID 系统提供了基于网络的连接组学 API,以及实现基于 EM 的连接组学数据集的分布式版本控制方法的第一步。通过将其用作 Janelia 的 FlyEM 团队的中央数据资源,我们已经将分布式版本控制的概念集成到重建工作流程中,允许通过分支、版本化的数据支持校对员培训和分割实验。DVID 还支持从高速本地 SSD 到基于云的对象存储等各种存储系统的持久性,这使得它可以部署在笔记本电脑和大型服务器上。根据每种类型的连接组学数据对后端存储进行定制,可实现高效存储和快速查询。DVID 是一个免费的开源软件,支持越来越多的存储选项。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2f0/6371063/1b85fdc48c36/fncir-13-00005-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验