University Hospital Cologne, Cologne, Germany.
School of Medicine, Technical University of Munich, Munich, Germany.
Sci Data. 2020 Dec 10;7(1):435. doi: 10.1038/s41597-020-00773-y.
The Lean European Open Survey on SARS-CoV-2 Infected Patients (LEOSS) is a European registry for studying the epidemiology and clinical course of COVID-19. To support evidence-generation at the rapid pace required in a pandemic, LEOSS follows an Open Science approach, making data available to the public in real-time. To protect patient privacy, quantitative anonymization procedures are used to protect the continuously published data stream consisting of 16 variables on the course and therapy of COVID-19 from singling out, inference and linkage attacks. We investigated the bias introduced by this process and found that it has very little impact on the quality of output data. Current laws do not specify requirements for the application of formal anonymization methods, there is a lack of guidelines with clear recommendations and few real-world applications of quantitative anonymization procedures have been described in the literature. We therefore believe that our work can help others with developing urgently needed anonymization pipelines for their projects.
LEOSS(欧洲严重急性呼吸综合征冠状病毒 2 感染患者精益开放式调查)是一个研究 COVID-19 流行病学和临床过程的欧洲注册处。为了在大流行期间所需的快速速度支持证据生成,LEOSS 采用开放科学方法,实时向公众提供数据。为了保护患者隐私,采用定量匿名化程序来保护由 16 个变量组成的关于 COVID-19 病程和治疗的连续发布数据流,以防止单独识别、推断和链接攻击。我们调查了该过程引入的偏差,发现它对输出数据的质量几乎没有影响。现行法律没有规定正式匿名化方法的应用要求,缺乏明确建议的指南,文献中也很少描述定量匿名化程序的实际应用。因此,我们认为我们的工作可以帮助其他人开发其项目急需的匿名化管道。