Suppr超能文献

区块链赋能的不可变、分布式、高可用的临床研究活动日志系统,用于从多个机构进行联邦 COVID-19 数据分析。

Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions.

机构信息

UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California, USA.

Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA.

出版信息

J Am Med Inform Assoc. 2023 May 19;30(6):1167-1178. doi: 10.1093/jamia/ocad049.

Abstract

OBJECTIVE

We aimed to develop a distributed, immutable, and highly available cross-cloud blockchain system to facilitate federated data analysis activities among multiple institutions.

MATERIALS AND METHODS

We preprocessed 9166 COVID-19 Structured Query Language (SQL) code, summary statistics, and user activity logs, from the GitHub repository of the Reliable Response Data Discovery for COVID-19 (R2D2) Consortium. The repository collected local summary statistics from participating institutions and aggregated the global result to a COVID-19-related clinical query, previously posted by clinicians on a website. We developed both on-chain and off-chain components to store/query these activity logs and their associated queries/results on a blockchain for immutability, transparency, and high availability of research communication. We measured run-time efficiency of contract deployment, network transactions, and confirmed the accuracy of recorded logs compared to a centralized baseline solution.

RESULTS

The smart contract deployment took 4.5 s on an average. The time to record an activity log on blockchain was slightly over 2 s, versus 5-9 s for baseline. For querying, each query took on an average less than 0.4 s on blockchain, versus around 2.1 s for baseline.

DISCUSSION

The low deployment, recording, and querying times confirm the feasibility of our cross-cloud, blockchain-based federated data analysis system. We have yet to evaluate the system on a larger network with multiple nodes per cloud, to consider how to accommodate a surge in activities, and to investigate methods to lower querying time as the blockchain grows.

CONCLUSION

Blockchain technology can be used to support federated data analysis among multiple institutions.

摘要

目的

我们旨在开发一种分布式、不可变且高可用的跨云区块链系统,以促进多个机构之间的联合数据分析活动。

材料和方法

我们预处理了来自 COVID-19 可靠响应数据发现(R2D2)联盟的 GitHub 存储库中的 9166 个 COVID-19 SQL 代码、汇总统计信息和用户活动日志。该存储库从参与机构收集本地汇总统计信息,并将全球结果聚合到与 COVID-19 相关的临床查询中,这些查询是临床医生之前在网站上发布的。我们开发了链上和链下组件,以将这些活动日志及其相关查询/结果存储/查询到区块链上,以实现研究通信的不可变、透明性和高可用性。我们测量了合同部署、网络事务的运行时效率,并确认了与集中式基线解决方案相比记录日志的准确性。

结果

智能合约部署的平均时间为 4.5 秒。在区块链上记录活动日志的时间略超过 2 秒,而基线则需要 5-9 秒。对于查询,每个查询在区块链上平均需要不到 0.4 秒,而基线则需要大约 2.1 秒。

讨论

低部署、记录和查询时间证实了我们基于区块链的跨云联合数据分析系统的可行性。我们尚未在具有每个云多个节点的更大网络上评估该系统,以考虑如何适应活动的激增,并研究随着区块链增长降低查询时间的方法。

结论

区块链技术可用于支持多个机构之间的联合数据分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcbd/10198529/4dcb7dd874e6/ocad049f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验