连体神经网络增强型心电图可重新识别匿名医疗数据。

Siamese neural network-enhanced electrocardiography can re-identify anonymized healthcare data.

作者信息

Macierzanka Krzysztof, Sau Arunashis, Patlatzoglou Konstantinos, Pastika Libor, Sieliwonczyk Ewa, Gurnani Mehak, Peters Nicholas S, Waks Jonathan W, Kramer Daniel B, Ng Fu Siong

机构信息

National Heart and Lung Institute, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, UK.

Department of Cardiology, Hammersmith Hospital, Imperial College Healthcare NHS Trust, Du Cane Road, London W12 0NN, UK.

出版信息

Eur Heart J Digit Health. 2025 Feb 25;6(3):417-426. doi: 10.1093/ehjdh/ztaf011. eCollection 2025 May.

DOI:10.1093/ehjdh/ztaf011

PMID:40395429

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12088719/

Abstract

AIMS

Many research databases with anonymized patient data contain electrocardiograms (ECGs) from which traditional identifiers have been removed. We evaluated the ability of artificial intelligence (AI) methods to determine the similarity between ECGs and assessed whether they have the potential to be misused to re-identify individuals from anonymized datasets.

METHODS AND RESULTS

We utilized a convolutional Siamese neural network (SNN) architecture, which derives a Euclidean distance similarity metric between two input ECGs. A secondary care dataset of 864 283 ECGs (72 455 subjects) was used. Siamese neural network-electrocardiogram (SNN-ECG) achieves an accuracy of 91.68% when classifying between 2 689 124 same-subject pairs and 2 689 124 different-subject pairs. This performance increases to 93.61% and 95.97% in outpatient and normal ECG subsets. In a simulated 'motivated intruder' test, SNN-ECG can identify individuals from large datasets. In datasets of 100, 1000, 10 000, and 20 000 ECGs, where only one ECG is also from the reference individual, it achieves success rates of 79.2%, 62.6%, 45.0%, and 40.0%, respectively. If this was random, the success would be 1%, 0.1%, 0.01%, and 0.005%, respectively. Additional basic information, like subject sex or age-range, enhances performance further. We also found that, on the subject level, ECG pair similarity is clinically relevant; greater ECG dissimilarity associates with all-cause mortality [hazard ratio, 1.22 (1.21-1.23), < 0.0001] and is additive to an AI-ECG model trained for mortality prediction.

CONCLUSION

Anonymized ECGs retain information that may facilitate subject re-identification, raising privacy and data protection concerns. However, SNN-ECG models also have positive uses and can enhance risk prediction of cardiovascular disease.

摘要

目的

许多包含匿名患者数据的研究数据库中都有已去除传统标识符的心电图（ECG）。我们评估了人工智能（AI）方法确定心电图之间相似性的能力，并评估了它们是否有可能被滥用，以便从匿名数据集中重新识别个体。

方法与结果

我们使用了一种卷积连体神经网络（SNN）架构，该架构可得出两个输入心电图之间的欧几里得距离相似性度量。使用了一个包含864283份心电图（72455名受试者）的二级护理数据集。连体神经网络心电图（SNN-ECG）在对2689124对同受试者对和2689124对不同受试者对进行分类时，准确率达到91.68%。在门诊和正常心电图子集中，这一性能分别提高到93.61%和95.97%。在模拟的“有动机的入侵者”测试中，SNN-ECG可以从大型数据集中识别个体。在分别包含100、1000、10000和20000份心电图的数据集中，其中只有一份心电图也来自参考个体，其成功率分别为79.2%、62.6%、45.0%和40.0%。如果是随机的，成功率分别为1%、0.1%、0.01%和0.005%。额外的基本信息，如受试者性别或年龄范围，可进一步提高性能。我们还发现，在个体层面上，心电图对的相似性具有临床相关性；心电图差异越大与全因死亡率相关[风险比，1.22（1.21-1.23），<0.0001]，并且是用于死亡率预测的AI-ECG模型的附加因素。

结论

匿名心电图保留了可能有助于个体重新识别的信息，引发了隐私和数据保护方面的担忧。然而，SNN-ECG模型也有积极用途，并且可以增强心血管疾病的风险预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ec0/12088719/aa19246f93b3/ztaf011_ga.jpg

相似文献

Siamese neural network-enhanced electrocardiography can re-identify anonymized healthcare data.

Eur Heart J Digit Health. 2025 Feb 25;6(3):417-426. doi: 10.1093/ehjdh/ztaf011. eCollection 2025 May.

Artificial intelligence-enhanced electrocardiography for the identification of a sex-related cardiovascular risk continuum: a retrospective cohort study.

Lancet Digit Health. 2025 Mar;7(3):e184-e194. doi: 10.1016/j.landig.2024.12.003.

An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction.

Lancet. 2019 Sep 7;394(10201):861-867. doi: 10.1016/S0140-6736(19)31721-0. Epub 2019 Aug 1.

A comparison of artificial intelligence-enhanced electrocardiography approaches for the prediction of time to mortality using electrocardiogram images.

Eur Heart J Digit Health. 2024 Nov 18;6(2):180-189. doi: 10.1093/ehjdh/ztae090. eCollection 2025 Mar.

Artificial intelligence-enhanced electrocardiography for accurate diagnosis and management of cardiovascular diseases.

J Electrocardiol. 2024 Mar-Apr;83:30-40. doi: 10.1016/j.jelectrocard.2024.01.006. Epub 2024 Jan 28.

Artificial intelligence electrocardiogram-predicted biological age gap and mortality: Capturing dynamic risk with multiple electrocardiograms.

Heart Rhythm. 2025 May 12. doi: 10.1016/j.hrthm.2025.05.009.

Artificial Intelligence-Enhanced Electrocardiography for Prediction of Incident Hypertension.

JAMA Cardiol. 2025 Mar 1;10(3):214-223. doi: 10.1001/jamacardio.2024.4796.

Use of Artificial Intelligence and Deep Neural Networks in Evaluation of Patients With Electrocardiographically Concealed Long QT Syndrome From the Surface 12-Lead Electrocardiogram.

JAMA Cardiol. 2021 May 1;6(5):532-538. doi: 10.1001/jamacardio.2020.7422.

Artificial intelligence age prediction using electrocardiogram data: Exploring biological age differences.

Heart Rhythm. 2024 Sep 27. doi: 10.1016/j.hrthm.2024.09.046.

Artificial intelligence-estimated biological heart age using a 12-lead electrocardiogram predicts mortality and cardiovascular outcomes.

Front Cardiovasc Med. 2023 Apr 13;10:1137892. doi: 10.3389/fcvm.2023.1137892. eCollection 2023.

引用本文的文献

Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century.

Philos Trans A Math Phys Eng Sci. 2025 May 8;383(2296):20230384. doi: 10.1098/rsta.2023.0384.

本文引用的文献

Prognostic Significance and Associations of Neural Network-Derived Electrocardiographic Features.

Circ Cardiovasc Qual Outcomes. 2024 Dec;17(12):e010602. doi: 10.1161/CIRCOUTCOMES.123.010602. Epub 2024 Nov 14.

Artificial intelligence-enabled electrocardiogram for mortality and cardiovascular risk estimation: a model development and validation study.

Lancet Digit Health. 2024 Nov;6(11):e791-e802. doi: 10.1016/S2589-7500(24)00172-9.

Few-shot transfer learning for personalized atrial fibrillation detection using patient-based siamese network with single-lead ECG records.

Artif Intell Med. 2023 Oct;144:102644. doi: 10.1016/j.artmed.2023.102644. Epub 2023 Sep 1.

BRAVEHEART: Open-source software for automated electrocardiographic and vectorcardiographic analysis.

Comput Methods Programs Biomed. 2023 Dec;242:107798. doi: 10.1016/j.cmpb.2023.107798. Epub 2023 Sep 12.

Ensemble Siamese Network (ESN) Using ECG Signals for Human Authentication in Smart Healthcare System.

Sensors (Basel). 2023 May 13;23(10):4727. doi: 10.3390/s23104727.

Using deep learning-derived image features in radiologic time series to make personalised predictions: proof of concept in colonic transit data.

Eur Radiol. 2023 Nov;33(11):8376-8386. doi: 10.1007/s00330-023-09769-9. Epub 2023 Jun 7.

Artificial intelligence-enabled electrocardiogram to distinguish atrioventricular re-entrant tachycardia from atrioventricular nodal re-entrant tachycardia.

Cardiovasc Digit Health J. 2023 Jan 31;4(2):60-67. doi: 10.1016/j.cvdhj.2023.01.004. eCollection 2023 Apr.

Convolutional Neural Network for Individual Identification Using Phase Space Reconstruction of Electrocardiogram.

Sensors (Basel). 2023 Mar 16;23(6):3164. doi: 10.3390/s23063164.

Artificial intelligence-enabled electrocardiogram to distinguish cavotricuspid isthmus dependence from other atrial tachycardia mechanisms.

Eur Heart J Digit Health. 2022 Aug 17;3(3):405-414. doi: 10.1093/ehjdh/ztac042. eCollection 2022 Sep.

Feature fusion Siamese network for breast cancer detection comparing current and prior mammograms.

Med Phys. 2022 Jun;49(6):3654-3669. doi: 10.1002/mp.15598. Epub 2022 Apr 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

连体神经网络增强型心电图可重新识别匿名医疗数据。

Siamese neural network-enhanced electrocardiography can re-identify anonymized healthcare data.

作者信息

Macierzanka Krzysztof, Sau Arunashis, Patlatzoglou Konstantinos, Pastika Libor, Sieliwonczyk Ewa, Gurnani Mehak, Peters Nicholas S, Waks Jonathan W, Kramer Daniel B, Ng Fu Siong

机构信息

National Heart and Lung Institute, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, UK.

Department of Cardiology, Hammersmith Hospital, Imperial College Healthcare NHS Trust, Du Cane Road, London W12 0NN, UK.