von Maltitz Marcel, Ballhausen Hendrik, Kaul David, Fleischmann Daniel F, Niyazi Maximilian, Belka Claus, Carle Georg
Chair of Network Architectures and Services, Department of Informatics, Technical University of Munich, TUM, Garching, Germany.
Department of Radiation Oncology, University Hospital, Ludwig-Maximilians-Universität München, LMU, Munich, Germany.
JMIR Med Inform. 2021 Jan 18;9(1):e22158. doi: 10.2196/22158.
Patient data is considered particularly sensitive personal data. Privacy regulations strictly govern the use of patient data and restrict their exchange. However, medical research can benefit from multicentric studies in which patient data from different institutions are pooled and evaluated together. Thus, the goals of data utilization and data protection are in conflict. Secure multiparty computation (SMPC) solves this conflict because it allows direct computation on distributed proprietary data-held by different data owners-in a secure way without exchanging private data.
The objective of this work was to provide a proof-of-principle of secure and privacy-preserving multicentric computation by SMPC with real-patient data over the free internet. A privacy-preserving log-rank test for the Kaplan-Meier estimator was implemented and tested in both an experimental setting and a real-world setting between two university hospitals.
The domain of survival analysis is particularly relevant in clinical research. For the Kaplan-Meier estimator, we provided a secure version of the log-rank test. It was based on the SMPC realization SPDZ and implemented via the FRESCO framework in Java. The complexity of the algorithm was explored both for synthetic data and for real-patient data in a proof-of-principle over the internet between two clinical institutions located in Munich and Berlin, Germany.
We obtained a functional realization of an SMPC-based log-rank evaluation. This implementation was assessed with respect to performance and scaling behavior. We showed that network latency strongly influences execution time of our solution. Furthermore, we identified a lower bound of 2 Mbit/s for the transmission rate that has to be fulfilled for unimpeded communication. In contrast, performance of the participating parties have comparatively low influence on execution speed, since the peer-side processing is parallelized and the computational time only constitutes 30% to 50% even with optimal network settings. In the real-world setting, our computation between three parties over the internet, processing 100 items each, took approximately 20 minutes.
We showed that SMPC is applicable in the medical domain. A secure version of commonly used evaluation methods for clinical studies is possible with current implementations of SMPC. Furthermore, we infer that its application is practically feasible in terms of execution time.
患者数据被视为特别敏感的个人数据。隐私法规严格管控患者数据的使用并限制其交换。然而,医学研究可从多中心研究中受益,在多中心研究里,来自不同机构的患者数据被集中起来共同评估。因此,数据利用目标与数据保护目标存在冲突。安全多方计算(SMPC)解决了这一冲突,因为它允许以安全方式对由不同数据所有者持有的分布式专有数据进行直接计算,而无需交换私有数据。
本研究的目的是通过SMPC在免费互联网上使用真实患者数据,提供安全且保护隐私的多中心计算的原理证明。在两家大学医院之间的实验环境和现实环境中,实施并测试了针对Kaplan-Meier估计量的隐私保护对数秩检验。
生存分析领域在临床研究中尤为重要。对于Kaplan-Meier估计量,我们提供了对数秩检验的安全版本。它基于SMPC实现SPDZ,并通过Java中的FRESCO框架实现。在位于德国慕尼黑和柏林的两个临床机构之间通过互联网进行原理证明时,针对合成数据和真实患者数据探讨了算法的复杂性。
我们获得了基于SMPC的对数秩评估的功能实现。对该实现进行了性能和扩展行为方面的评估。我们表明网络延迟对我们解决方案的执行时间有很大影响。此外,我们确定了通信畅通所需的传输速率下限为2 Mbit/s。相比之下,参与方的性能对执行速度的影响相对较小,因为对等方处理是并行化的,即使在最佳网络设置下,计算时间也仅占30%至50%。在现实环境中,我们在互联网上三方之间进行计算,各方处理100个项目,大约需要20分钟。
我们表明SMPC适用于医学领域。利用当前SMPC实现方式,可为临床研究常用评估方法提供安全版本。此外,我们推断其在执行时间方面的应用在实际中是可行的。