Technische Universität Darmstadt, Schnittspahnstraße 10, 64287, Darmstadt, Germany.
German Cancer Research Center, Im Neuenheimer Feld 580, 69120, Heidelberg, Germany.
J Transl Med. 2022 Oct 8;20(1):458. doi: 10.1186/s12967-022-03671-6.
The low number of patients suffering from any given rare diseases poses a difficult problem for medical research: With the exception of some specialized biobanks and disease registries, potential study participants' information are disjoint and distributed over many medical institutions. Whenever some of those facilities are in close proximity, a significant overlap of patients can reasonably be expected, further complicating statistical study feasibility assessments and data gathering. Due to the sensitive nature of medical records and identifying data, data transfer and joint computations are often forbidden by law or associated with prohibitive amounts of effort. To alleviate this problem and to support rare disease research, we developed the Mainzelliste Secure EpiLinker (MainSEL) record linkage framework, a secure Multi-Party Computation based application using trusted-third-party-less cryptographic protocols to perform privacy-preserving record linkage with high security guarantees. In this work, we extend MainSEL to allow the record linkage based calculation of the number of common patients between institutions. This allows privacy-preserving statistical feasibility estimations for further analyses and data consolidation. Additionally, we created easy to deploy software packages using microservice containerization and continuous deployment/continuous integration. We performed tests with medical researchers using MainSEL in real-world medical IT environments, using synthetic patient data.
We show that MainSEL achieves practical runtimes, performing 10 000 comparisons in approximately 5 minutes. Our approach proved to be feasible in a wide range of network settings and use cases. The "lessons learned" from the real-world testing show the need to explicitly support and document the usage and deployment for both analysis pipeline integration and researcher driven ad-hoc analysis use cases, thus clarifying the wide applicability of our software. MainSEL is freely available under: https://github.com/medicalinformatics/MainSEL CONCLUSIONS: MainSEL performs well in real-world settings and is a useful tool not only for rare disease research, but medical research in general. It achieves practical runtimes, improved security guarantees compared to existing solutions, and is simple to deploy in strict clinical IT environments. Based on the "lessons learned" from the real-word testing, we hope to enable a wide range of medical researchers to meet their needs and requirements using modern privacy-preserving technologies.
患有任何给定罕见病的患者人数较少,这给医学研究带来了一个难题:除了一些专门的生物库和疾病登记处外,潜在研究参与者的信息是不相关的,分布在许多医疗机构中。每当这些设施中的一些设施相邻时,可以合理地预期会有大量的患者重叠,这进一步增加了统计研究可行性评估和数据收集的复杂性。由于医疗记录和识别数据的敏感性,数据传输和联合计算通常受到法律禁止或与大量工作相关联。为了解决这个问题并支持罕见病研究,我们开发了 Mainzelliste Secure EpiLinker(MainSEL)记录链接框架,这是一个安全的多方计算应用程序,使用无需可信第三方的加密协议来执行具有高度安全保证的隐私保护记录链接。在这项工作中,我们扩展了 MainSEL,以允许基于记录链接的机构间共同患者数量的计算。这允许进行隐私保护的统计可行性估计,以便进行进一步分析和数据合并。此外,我们使用微服务容器化和持续部署/持续集成创建了易于部署的软件包。我们使用 MainSEL 在真实的医疗 IT 环境中使用合成患者数据对医学研究人员进行了测试。
我们表明,MainSEL 可以实现实际的运行时,大约 5 分钟即可完成 10000 次比较。我们的方法在广泛的网络设置和用例中证明是可行的。从实际测试中“吸取的经验教训”表明,需要明确支持和记录分析管道集成和研究人员驱动的临时分析用例的使用和部署,从而清楚地阐明了我们软件的广泛适用性。MainSEL 可在以下网址免费获得:https://github.com/medicalinformatics/MainSEL
MainSEL 在真实环境中表现良好,不仅对罕见病研究,而且对一般医学研究都是一种有用的工具。它实现了实际的运行时,与现有解决方案相比提高了安全性保证,并且在严格的临床 IT 环境中易于部署。基于实际测试中“吸取的经验教训”,我们希望使广泛的医学研究人员能够使用现代隐私保护技术满足他们的需求和要求。