Suppr超能文献

Anonymization of Health Insurance Claims Data for Medication Safety Assessments.

作者信息

Halilovic Mehmed, Otte Karen, Meurers Thierry, Alibone Marco, Ludwig Marion, Riedel Nico, Wolter Steven, Kühnel Lisa, Hess Steffen, Prasser Fabian

机构信息

Medical Informatics Group, Berlin Institute of Health atCharité, Germany.

InGef - Institut für angewandte Gesundheitsforschung Berlin GmbH, Berlin, Deutschland.

出版信息

Stud Health Technol Inform. 2025 Sep 3;331:283-291. doi: 10.3233/SHTI251407.

Abstract

INTRODUCTION

The re-use of health insurance claims data for research purposes can provide valuable insights to improve patient care. However, as health data is often highly sensitive and subject to strict regulatory frameworks, the privacy of individuals must be protected. Anonymization is a common approach to do so, but finding an effective strategy is challenging due to an inherent trade-off between privacy protection and data utility. A structured approach is needed to balance these objectives and guide the selection of appropriate anonymization strategies.

METHODS

In this paper, we present a systematic evaluation of twelve anonymization strategies applied to German health insurance claims data that has previously been used in a drug safety study. The dataset consisted of 1727 records and 45 variables. Based on a structured threat modeling, we compare a conservative and a threat modeling-based approach, each with six different privacy models and risk thresholds using the ARX Data Anonymization Tool. We assess general data utility and empirically evaluate residual privacy risks using both the Anonymeter framework and a membership inference attack.

RESULTS

Our results show that conservative anonymization ensures strong privacy protection but reduces data utility. In contrast, threat modeling retains more utility while still providing acceptable privacy under moderate thresholds.

CONCLUSION

The proposed process enables a systematic comparison of privacy-utility trade-offs and can be adapted to other medical datasets. Our findings highlight the importance of context-specific anonymization strategies and empirical risk evaluation to guide anonymized data sharing in healthcare.

摘要

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验