Lee Preston V, Dinu Valentin
Department of Biomedical Informatics, Arizona State University, 13212 East Shea Boulevard, Scottsdale, AZ, 85259, USA.
BMC Bioinformatics. 2015 Nov 4;16:364. doi: 10.1186/s12859-015-0779-6.
Our publication of the BitTorious portal [1] demonstrated the ability to create a privatized distributed data warehouse of sufficient magnitude for real-world bioinformatics studies using minimal changes to the standard BitTorrent tracker protocol. In this second phase, we release a new server-side specification to accept anonymous philantropic storage donations by the general public, wherein a small portion of each user's local disk may be used for archival of scientific data. We have implementated the server-side announcement and control portions of this BitTorrent extension into v3.0.0 of the BitTorious portal, upon which compatible clients may be built.
Automated test cases for the BitTorious Volunteer extensions have been added to the portal's v3.0.0 release, supporting validation of the "peer affinity" concept and announcement protocol introduced by this specification. Additionally, a separate reference implementation of affinity calculation has been provided in C++ for informaticians wishing to integrate into libtorrent-based projects.
The BitTorrent "affinity" extensions as provided in the BitTorious portal reference implementation allow data publishers to crowdsource the extreme storage prerequisites for research in "big data" fields. With sufficient awareness and adoption of BitTorious Volunteer-based clients by the general public, the BitTorious portal may be able to provide peta-scale storage resources to the scientific community at relatively insignificant financial cost.
我们发布的BitTorious门户[1]展示了使用对标准BitTorrent跟踪器协议进行最小更改来创建一个规模足够大的私有化分布式数据仓库以用于实际生物信息学研究的能力。在第二阶段,我们发布了一个新的服务器端规范,以接受公众的匿名慈善存储捐赠,其中每个用户的本地磁盘的一小部分可用于科学数据存档。我们已将此BitTorrent扩展的服务器端公告和控制部分实现到BitTorious门户的v3.0.0版本中,在此基础上可以构建兼容的客户端。
针对BitTorious志愿者扩展的自动化测试用例已添加到门户的v3.0.0版本中,支持对本规范引入的“对等亲和力”概念和公告协议进行验证。此外,还为希望集成到基于libtorrent的项目中的信息学家提供了一个用C++编写的亲和力计算的单独参考实现。
BitTorious门户参考实现中提供的BitTorrent“亲和力”扩展允许数据发布者众包“大数据”领域研究的极端存储需求。如果公众对基于BitTorious志愿者的客户端有足够的认知和采用,BitTorious门户或许能够以相对微不足道的财务成本为科学界提供千万亿字节规模的存储资源。