Lee Preston V, Dinu Valentin
Department of Biomedical Informatics, Arizona State University, 13212 East Shea Boulevard, Scottsdale, AZ, 85259, USA.
BMC Bioinformatics. 2014 Dec 21;15(1):424. doi: 10.1186/s12859-014-0424-9.
Centralized silos of genomic data are architecturally easier to initially design, develop and deploy than distributed models. However, as interoperability pains in EHR/EMR, HIE and other collaboration-centric life sciences domains have taught us, the core challenge of networking genomics systems is not in the construction of individual silos, but the interoperability of those deployments in a manner embracing the heterogeneous needs, terms and infrastructure of collaborating parties. This article demonstrates the adaptation of BitTorrent to private collaboration networks in an authenticated, authorized and encrypted manner while retaining the same characteristics of standard BitTorrent.
The BitTorious portal was sucessfully used to manage many concurrent domestic Bittorrent clients across the United States: exchanging genomics data payloads in excess of 500GiB using the uTorrent client software on Linux, OSX and Windows platforms. Individual nodes were sporadically interrupted to verify the resilience of the system to outages of a single client node as well as recovery of nodes resuming operation on intermittent Internet connections.
The authorization-based extension of Bittorrent and accompanying BitTorious reference tracker and user management web portal provide a free, standards-based, general purpose and extensible data distribution system for large 'omics collaborations.
与分布式模型相比,基因组数据的集中式存储库在架构上最初设计、开发和部署起来更容易。然而,正如电子健康记录/电子病历、卫生信息交换及其他以合作为中心的生命科学领域中出现的互操作性难题所教会我们的那样,基因组学系统联网的核心挑战不在于单个存储库的构建,而在于这些部署之间的互操作性,要以一种满足合作方异构需求、术语和基础设施的方式来实现。本文展示了如何以经过身份验证、授权和加密的方式将BitTorrent应用于私有合作网络,同时保留标准BitTorrent的相同特性。
BitTorious门户成功用于管理美国各地许多并发的家用BitTorrent客户端:在Linux、OSX和Windows平台上使用uTorrent客户端软件交换超过500GiB的基因组数据有效载荷。偶尔会中断单个节点,以验证系统对单个客户端节点故障的恢复能力,以及在间歇性互联网连接上恢复运行的节点的恢复能力。
基于授权的BitTorrent扩展以及配套的BitTorious参考跟踪器和用户管理门户网站为大型“组学”合作提供了一个免费的、基于标准的、通用且可扩展的数据分发系统。