Netherlands Comprehensive Cancer Organization (IKNL), Eindhoven, NL.
Maastricht University Medical Centre+, Maastricht, NL.
AMIA Annu Symp Proc. 2021 Jan 25;2020:870-877. eCollection 2020.
Answering many of the research questions in the field of cancer informatics requires incorporating and centralizing data that are hosted by different parties. Federated Learning (FL) has emerged as a new approach in which a global model can be generated without disclosing private patient data by keeping them at their original location. Flexible, user-friendly, and robust infrastructures are crucial for bringing FL solutions to the day-to-day work of the cancer epidemiologist. In this paper, we present an open source priVAcy preserviNg federaTed leArninG infrastructurE for Secure Insight eXchange, VANTAGE6. We provide a detailed description of its conceptual design, modular architecture, and components. We also show a few examples where VANTAGE6 has been successfully used in research on observational cancer data. Developing and deploying technology to support federated analyses - such as VANTAGE6 - will pave the way for the adoption and mainstream practice of this new approach for analyzing decentralized data.
要回答癌症信息学领域的许多研究问题,需要整合和集中由不同方托管的数据。联邦学习 (FL) 是一种新方法,它可以通过保持数据在原始位置而不披露私人患者数据来生成全局模型。灵活、用户友好且强大的基础架构对于将 FL 解决方案应用于癌症流行病学家的日常工作至关重要。在本文中,我们提出了一个用于安全洞察交换的开源隐私保护联邦学习基础设施,即 VANTAGE6。我们详细描述了其概念设计、模块化架构和组件。我们还展示了 VANTAGE6 在观察性癌症数据研究中成功应用的几个示例。开发和部署支持联邦分析的技术(如 VANTAGE6)将为采用和主流实践这种用于分析分散数据的新方法铺平道路。