大数据时代的患者隐私
Patient Privacy in the Era of Big Data.
机构信息
National Library of Medicine, National Institutes of Health, Maryland, ABD.
出版信息
Balkan Med J. 2018 Jan 20;35(1):8-17. doi: 10.4274/balkanmedj.2017.0966. Epub 2017 Sep 13.
Privacy was defined as a fundamental human right in the Universal Declaration of Human Rights at the 1948 United Nations General Assembly. However, there is still no consensus on what constitutes privacy. In this review, we look at the evolution of privacy as a concept from the era of Hippocrates to the era of social media and big data. To appreciate the modern measures of patient privacy protection and correctly interpret the current regulatory framework in the United States, we need to analyze and understand the concepts of individually identifiable information, individually identifiable health information, protected health information, and de-identification. The Privacy Rule of the Health Insurance Portability and Accountability Act defines the regulatory framework and casts a balance between protective measures and access to health information for secondary (scientific) use. The rule defines the conditions when health information is protected by law and how protected health information can be de-identified for secondary use. With the advents of artificial intelligence and computational linguistics, computational text de-identification algorithms produce de-identified results nearly as well as those produced by human experts, but much faster, more consistently and basically for free. Modern clinical text de-identification systems now pave the road to big data and enable scientists to access de-identified clinical information while firmly protecting patient privacy. However, clinical text de-identification is not a perfect process. In order to maximize the protection of patient privacy and to free clinical and scientific information from the confines of electronic healthcare systems, all stakeholders, including patients, health institutions and institutional review boards, scientists and the scientific communities, as well as regulatory and law enforcement agencies must collaborate closely. On the one hand, public health laws and privacy regulations define rules and responsibilities such as requesting and granting only the amount of health information that is necessary for the scientific study. On the other hand, developers of de-identification systems provide guidelines to use different modes of operations to maximize the effectiveness of their tools and the success of de-identification. Institutions with clinical repositories need to follow these rules and guidelines closely to successfully protect patient privacy. To open the gates of big data to scientific communities, healthcare institutions need to be supported in their de-identification and data sharing efforts by the public, scientific communities, and local, state, and federal legislators and government agencies.
隐私在 1948 年联合国大会的《世界人权宣言》中被定义为一项基本人权。然而,对于什么构成隐私,仍然没有共识。在这篇综述中,我们回顾了从希波克拉底时代到社交媒体和大数据时代隐私概念的演变。为了欣赏现代患者隐私保护措施,并正确解读美国当前的监管框架,我们需要分析和理解可识别个人信息、可识别个人健康信息、受保护健康信息和去识别化的概念。《健康保险流通与责任法案》的隐私规则定义了监管框架,并在保护措施和获取医疗信息以进行二次(科学)使用之间取得平衡。该规则定义了健康信息受法律保护的条件,以及如何对受保护健康信息进行去识别化以进行二次使用。随着人工智能和计算语言学的出现,计算文本去识别算法生成的去识别结果几乎与人类专家一样好,但速度更快、更一致,而且基本上是免费的。现代临床文本去识别系统现在为大数据铺平了道路,使科学家能够访问去识别的临床信息,同时牢牢保护患者隐私。然而,临床文本去识别并不是一个完美的过程。为了最大限度地保护患者隐私,并使临床和科学信息从电子医疗保健系统的限制中解放出来,所有利益相关者,包括患者、医疗机构和机构审查委员会、科学家和科学界,以及监管和执法机构,都必须密切合作。一方面,公共卫生法和隐私法规定义了规则和责任,例如仅请求和授予进行科学研究所需的健康信息量。另一方面,去识别系统的开发者提供了使用不同操作模式的指南,以最大限度地提高其工具的有效性和去识别的成功率。拥有临床存储库的机构需要密切遵循这些规则和指南,以成功保护患者隐私。为了向科学界开放大数据的大门,医疗机构需要得到公众、科学界以及地方、州和联邦立法者和政府机构的支持,以进行去识别和数据共享。