Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:1969-1975. doi: 10.1109/EMBC46164.2021.9630880.
ECGs analysis is an important tool in cardiac diagnosis. ECG data also have the potential to be used as a biometric source that allows precise person identification similar to the widely used fingerprint and iris recognition techniques. However, this phenomenon also raises serious privacy concerns. In this study, we provide a complete, multi-step ECG identification algorithm using a private database of ECG recordings. We train and validate our AI model on approximately 40k patients which makes this study by far the largest research project in this field. Moreover, our best model attained an exceptionally high accuracy of 94.56%. In addition to discussing the general implications of the deployment of such systems related to privacy, for the first time, we also assess the accuracy of ECG-based identification for distinct heart condition groups (and combinations of such conditions) and the corresponding privacy implications. For instance, we discovered that in contrast to initial expectation that identification accuracy for healthy normal sinus rhythm should be the highest, the identification accuracy is higher for patients with sinus tachycardia or patients who are diagnosed with both ST changes and supraventricular tachycardia. This puts these patients at a higher risk of privacy issues due to re-identification. On the other hand, we observed that patients with premature ventricular contractions have an identification accuracy as low as 78.54%. The identification rate for patients with a pacemaker is 80.2%.Clinical relevance-While ECG as a biometric can be a potentially useful technology, it also raises serious concerns regarding the privacy of cardiac patients. Especially, the ECG Identification algorithms empowered by deep learning can increase the risk of re-identification.
心电图分析是心脏诊断的重要工具。心电图数据也有可能被用作生物识别源,类似于广泛使用的指纹和虹膜识别技术,可以进行精确的人员识别。然而,这种现象也引发了严重的隐私问题。在这项研究中,我们使用私人的心电图记录数据库提供了一个完整的、多步骤的心电图识别算法。我们在大约 40000 名患者身上训练和验证我们的人工智能模型,这使得这项研究迄今为止是该领域最大的研究项目。此外,我们的最佳模型达到了异常高的 94.56%的准确率。除了讨论与隐私相关的此类系统部署的一般影响外,我们还是首次评估了基于心电图的识别对于不同心脏状况组(以及这些状况的组合)的准确性以及相应的隐私影响。例如,我们发现,与最初的预期相反,即健康正常窦性心律的识别准确率应该最高,窦性心动过速或同时患有 ST 段改变和室上性心动过速的患者的识别准确率更高。这使得这些患者由于重新识别而面临更高的隐私问题风险。另一方面,我们观察到患有室性期前收缩的患者的识别准确率低至 78.54%。装有起搏器的患者的识别率为 80.2%。临床意义-虽然心电图作为一种生物识别技术可能是一种潜在有用的技术,但它也引发了关于心脏患者隐私的严重问题。特别是,深度学习支持的心电图识别算法可能会增加重新识别的风险。