Department of Health Services Research and Policy, London School of Hygiene and Tropical Medicine, 15-17 Tavistock Place, London, WC1H 9SH, UK; Clinical Effectiveness Unit, Royal College of Surgeons of England, London WC2A 3PE, UK.
Department of Medical Statistics, London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, UK.
J Clin Epidemiol. 2021 Aug;136:136-145. doi: 10.1016/j.jclinepi.2021.04.015. Epub 2021 Apr 28.
Probabilistic linkage can link patients from different clinical databases without the need for personal information. If accurate linkage can be achieved, it would accelerate the use of linked datasets to address important clinical and public health questions.
We developed a step-by-step process for probabilistic linkage of national clinical and administrative datasets without personal information, and validated it against deterministic linkage using patient identifiers.
We used electronic health records from the National Bowel Cancer Audit and Hospital Episode Statistics databases for 10,566 bowel cancer patients undergoing emergency surgery in the English National Health Service.
Probabilistic linkage linked 81.4% of National Bowel Cancer Audit records to Hospital Episode Statistics, vs. 82.8% using deterministic linkage. No systematic differences were seen between patients that were and were not linked, and regression models for mortality and length of hospital stay according to patient and tumour characteristics were not sensitive to the linkage approach.
Probabilistic linkage was successful in linking national clinical and administrative datasets for patients undergoing a major surgical procedure. It allows analysts outside highly secure data environments to undertake linkage while minimizing costs and delays, protecting data security, and maintaining linkage quality.
概率链接可以在不需要个人信息的情况下将来自不同临床数据库的患者进行链接。如果能够实现准确的链接,那么它将加速使用链接数据集来解决重要的临床和公共卫生问题。
我们开发了一种无需个人信息即可对国家临床和管理数据集进行概率链接的逐步流程,并使用患者标识符对其进行了确定性链接的验证。
我们使用了英国国家肠癌审计和医院入院统计数据库中的电子健康记录,其中包括 10566 名在英国国家卫生服务体系中接受紧急手术的肠癌患者。
概率链接将 81.4%的国家肠癌审计记录与医院入院统计数据进行了链接,而确定性链接的比例为 82.8%。未链接和已链接患者之间没有观察到系统差异,并且根据患者和肿瘤特征的死亡率和住院时间的回归模型对链接方法不敏感。
概率链接成功地将接受主要手术的患者的国家临床和管理数据集进行了链接。它允许在高度安全的数据环境之外的分析师进行链接,同时最小化成本和延迟,保护数据安全,并保持链接质量。