Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA.
Department of Biomedical Engineering, Duke University, Durham, NC, USA.
Lancet Digit Health. 2023 Apr;5(4):e239-e247. doi: 10.1016/S2589-7500(22)00234-5. Epub 2023 Feb 14.
Wearable devices have made it easier to generate and share data collected on individuals. This systematic review seeks to investigate whether deidentifying data from wearable devices is sufficient to protect the privacy of individuals in datasets. We searched Web of Science, IEEE Xplore Digital Library, PubMed, Scopus, and the ACM Digital Library on Dec 6, 2021 (PROSPERO registration number CRD42022312922). We also performed manual searches in journals of interest until April 12, 2022. Although our search strategy had no language restrictions, all retrieved studies were in English. We included studies showing reidentification, identification, or authentication with data from wearable devices. Our search retrieved 17 625 studies, and 72 studies met our inclusion criteria. We designed a custom assessment tool for study quality and risk of bias assessments. 64 studies were classified as high quality and eight as moderate quality, and we did not detect any bias in any of the included studies. Correct identification rates were typically 86-100%, indicating a high risk of reidentification. Additionally, as little as 1-300 s of recording were required to enable reidentification from sensors that are generally not thought to generate identifiable information, such as electrocardiograms. These findings call for concerted efforts to rethink methods for data sharing to promote advances in research innovation while preventing the loss of individual privacy.
可穿戴设备使得生成和共享个人数据变得更加容易。本系统评价旨在调查从可穿戴设备中去除身份识别数据是否足以保护数据集中个人的隐私。我们于 2021 年 12 月 6 日在 Web of Science、IEEE Xplore Digital Library、PubMed、Scopus 和 ACM Digital Library 进行了检索(PROSPERO 注册号:CRD42022312922)。我们还在感兴趣的期刊中进行了手动检索,直到 2022 年 4 月 12 日。尽管我们的检索策略没有语言限制,但所有检索到的研究均为英文。我们纳入了展示了可穿戴设备数据重识别、识别或认证的研究。我们的检索共获取了 17625 项研究,其中 72 项符合纳入标准。我们设计了一个自定义评估工具,用于评估研究质量和偏倚风险。64 项研究被归类为高质量,8 项为中等质量,我们未在任何纳入研究中发现偏倚。正确识别率通常为 86-100%,表明重识别风险较高。此外,只需 1-300 秒的记录即可从通常被认为不会生成可识别信息的传感器(如心电图)中进行重识别。这些发现呼吁各方共同努力,重新思考数据共享方法,在促进研究创新的同时,防止个人隐私的丧失。