• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.用于多中心研究的电子健康记录数据去识别和匿名化策略。
Med Care. 2012 Jul;50 Suppl(Suppl):S82-101. doi: 10.1097/MLR.0b013e3182585355.
2
De-identification of free text data containing personal health information: a scoping review of reviews.去标识化包含个人健康信息的自由文本数据:综述的综述。
Int J Popul Data Sci. 2023 Dec 12;8(1):2153. doi: 10.23889/ijpds.v8i1.2153. eCollection 2023.
3
Automatic de-identification of textual documents in the electronic health record: a review of recent research.电子健康记录中文本文件的自动去识别:近期研究综述。
BMC Med Res Methodol. 2010 Aug 2;10:70. doi: 10.1186/1471-2288-10-70.
4
Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.生物医学文献中匿名化和去识别化的使用与理解:范围综述
J Med Internet Res. 2019 May 31;21(5):e13484. doi: 10.2196/13484.
5
Patient Privacy in the Era of Big Data.大数据时代的患者隐私
Balkan Med J. 2018 Jan 20;35(1):8-17. doi: 10.4274/balkanmedj.2017.0966. Epub 2017 Sep 13.
6
Nonspecific deidentification of date-like text in deidentified clinical notes enables reidentification of dates.去识别化的临床记录中类似日期的非特定信息的去识别化处理可使日期被重新识别。
J Am Med Inform Assoc. 2022 Oct 7;29(11):1967-1971. doi: 10.1093/jamia/ocac147.
7
deidentify.去识别化
AMIA Annu Symp Proc. 2018 Apr 16;2017:485-494. eCollection 2017.
8
Development and evaluation of an open source software tool for deidentification of pathology reports.用于病理报告去识别化的开源软件工具的开发与评估
BMC Med Inform Decis Mak. 2006 Mar 6;6:12. doi: 10.1186/1472-6947-6-12.
9
De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports.病历叙述报告中地址、日期及字母数字标识符的去识别化处理
AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.
10
De-Identification of Medical Narrative Data.医学叙事数据的去识别化
Stud Health Technol Inform. 2017;244:23-27.

引用本文的文献

1
Predicting Emergency Severity Index (ESI) level, hospital admission, and admitting ward in an emergency department using data-driven machine learning.使用数据驱动的机器学习预测急诊科的急诊严重程度指数(ESI)级别、住院情况及收治科室。
BMC Med Inform Decis Mak. 2025 Jul 28;25(1):281. doi: 10.1186/s12911-025-02941-9.
2
Medical laboratory data-based models: opportunities, obstacles, and solutions.基于医学实验室数据的模型:机遇、障碍与解决方案。
J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.
3
DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization.DIRI:使用大语言模型进行对抗性患者重新识别以评估临床文本匿名化
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:355-364. eCollection 2025.
4
Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.放射学报告的自动匿名化:公开可用的自然语言处理与大语言模型的比较
Eur Radiol. 2025 May;35(5):2634-2641. doi: 10.1007/s00330-024-11148-x. Epub 2024 Oct 31.
5
TARGETING UNDERREPRESENTED POPULATIONS IN PRECISION MEDICINE: A FEDERATED TRANSFER LEARNING APPROACH.精准医学中针对代表性不足人群:一种联邦迁移学习方法。
Ann Appl Stat. 2023 Dec;17(4):2970-2992. doi: 10.1214/23-AOAS1747. Epub 2023 Oct 30.
6
Radiology and multi-scale data integration for precision oncology.用于精准肿瘤学的放射学与多尺度数据整合
NPJ Precis Oncol. 2024 Jul 26;8(1):158. doi: 10.1038/s41698-024-00656-0.
7
What Do We Mean by Sharing of Patient Data? DaSH: A Data Sharing Hierarchy of Privacy and Ethical Challenges.我们所说的患者数据共享是什么意思?DaSH:数据共享的隐私和伦理挑战层次结构。
Appl Clin Inform. 2024 Oct;15(5):833-841. doi: 10.1055/a-2373-3291. Epub 2024 Jul 25.
8
Summary of the National Cancer Institute 2023 Virtual Workshop on Medical Image De-identification-Part 1: Report of the MIDI Task Group - Best Practices and Recommendations, Tools for Conventional Approaches to De-identification, International Approaches to De-identification, and Industry Panel on Image De-identification.美国国立癌症研究所2023年医学图像去识别化虚拟研讨会总结 - 第1部分:MIDI任务组报告 - 最佳实践与建议、传统去识别化方法的工具、国际去识别化方法以及图像去识别化行业小组
J Imaging Inform Med. 2025 Feb;38(1):1-15. doi: 10.1007/s10278-024-01182-y. Epub 2024 Jul 12.
9
Symptom-BERT: Enhancing Cancer Symptom Detection in EHR Clinical Notes.症状-BERT:增强电子健康记录临床记录中的癌症症状检测
J Pain Symptom Manage. 2024 Aug;68(2):190-198.e1. doi: 10.1016/j.jpainsymman.2024.05.015. Epub 2024 May 23.
10
Regulatory Issues in Electronic Health Records for Adolescent HIV Research: Strategies and Lessons Learned.青少年艾滋病病毒研究电子健康记录中的监管问题:策略与经验教训
JMIR Form Res. 2024 May 2;8:e46420. doi: 10.2196/46420.

本文引用的文献

1
Deidentification of facial images using composites.使用合成图像对面部图像进行去识别处理。
J Oral Maxillofac Surg. 2011 Dec;69(12):3026-31. doi: 10.1016/j.joms.2011.01.011. Epub 2011 May 20.
2
The MITRE Identification Scrubber Toolkit: design, training, and assessment.MITRE 识别清理工具包:设计、培训和评估。
Int J Med Inform. 2010 Dec;79(12):849-59. doi: 10.1016/j.ijmedinf.2010.09.007. Epub 2010 Oct 14.
3
De-identification procedure and sample quality of the post-clinical test samples at the bio-repository of the National Cancer Center Hospital (NCCH) in Tokyo.东京国家癌症中心医院(NCCH)生物库中临床后测试样本的去识别程序和样本质量。
Jpn J Clin Oncol. 2011 Feb;41(2):295-8. doi: 10.1093/jjco/hyq172. Epub 2010 Sep 18.
4
De-identifying Swedish clinical text - refinement of a gold standard and experiments with Conditional random fields.对瑞典临床文本进行去识别处理——完善金标准并进行条件随机场实验。
J Biomed Semantics. 2010 Apr 12;1(1):6. doi: 10.1186/2041-1480-1-6.
5
De-identification of primary care electronic medical records free-text data in Ontario, Canada.加拿大安大略省初级保健电子病历自由文本数据的去识别化。
BMC Med Inform Decis Mak. 2010 Jun 18;10:35. doi: 10.1186/1472-6947-10-35.
6
Embedding a hiding function in a portable electronic health record for privacy preservation.在便携式电子健康记录中嵌入隐藏功能以保护隐私。
J Med Syst. 2010 Jun;34(3):313-20. doi: 10.1007/s10916-008-9243-8.
7
Using a pipeline to improve de-identification performance.使用管道来提高去识别性能。
AMIA Annu Symp Proc. 2009 Nov 14;2009:447-51.
8
Inductive creation of an annotation schema and a reference standard for de-identification of VA electronic clinical notes.归纳创建用于对退伍军人事务部电子临床记录进行去识别处理的注释模式和参考标准。
AMIA Annu Symp Proc. 2009 Nov 14;2009:416-20.
9
An open source toolkit for medical imaging de-identification.一个用于医学影像去标识化的开源工具包。
Eur Radiol. 2010 Aug;20(8):1896-904. doi: 10.1007/s00330-010-1745-3. Epub 2010 Mar 4.
10
Effects of personal identifier resynthesis on clinical text de-identification.个人标识符再合成对临床文本去识别的影响。
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):159-68. doi: 10.1136/jamia.2009.002212.

用于多中心研究的电子健康记录数据去识别和匿名化策略。

Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

机构信息

Stanford Sleep Medicine Center, Redwood City, CA 94063-5704, USA.

出版信息

Med Care. 2012 Jul;50 Suppl(Suppl):S82-101. doi: 10.1097/MLR.0b013e3182585355.

DOI:10.1097/MLR.0b013e3182585355
PMID:22692265
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6502465/
Abstract

BACKGROUND

De-identification and anonymization are strategies that are used to remove patient identifiers in electronic health record data. The use of these strategies in multicenter research studies is paramount in importance, given the need to share electronic health record data across multiple environments and institutions while safeguarding patient privacy.

METHODS

Systematic literature search using keywords of de-identify, deidentify, de-identification, deidentification, anonymize, anonymization, data scrubbing, and text scrubbing. Search was conducted up to June 30, 2011 and involved 6 different common literature databases. A total of 1798 prospective citations were identified, and 94 full-text articles met the criteria for review and the corresponding articles were obtained. Search results were supplemented by review of 26 additional full-text articles; a total of 120 full-text articles were reviewed.

RESULTS

A final sample of 45 articles met inclusion criteria for review and discussion. Articles were grouped into text, images, and biological sample categories. For text-based strategies, the approaches were segregated into heuristic, lexical, and pattern-based systems versus statistical learning-based systems. For images, approaches that de-identified photographic facial images and magnetic resonance image data were described. For biological samples, approaches that managed the identifiers linked with these samples were discussed, particularly with respect to meeting the anonymization requirements needed for Institutional Review Board exemption under the Common Rule.

CONCLUSIONS

Current de-identification strategies have their limitations, and statistical learning-based systems have distinct advantages over other approaches for the de-identification of free text. True anonymization is challenging, and further work is needed in the areas of de-identification of datasets and protection of genetic information.

摘要

背景

去识别和匿名化是用于去除电子健康记录数据中患者标识符的策略。鉴于需要在多个环境和机构之间共享电子健康记录数据,同时保护患者隐私,因此在多中心研究中使用这些策略至关重要。

方法

使用去识别、去标识、去识别、去标识、匿名化、匿名化、数据清洗和文本清洗等关键词进行系统文献检索。搜索截止日期为 2011 年 6 月 30 日,涉及 6 个不同的常用文献数据库。共确定了 1798 条前瞻性引用,有 94 篇全文文章符合审查标准,并获得了相应的文章。通过对 26 篇额外全文文章的回顾,补充了搜索结果;共审查了 120 篇全文文章。

结果

最终有 45 篇文章符合审查和讨论的纳入标准。文章分为文本、图像和生物样本类别。对于基于文本的策略,方法分为启发式、词汇和基于模式的系统与基于统计学习的系统。对于图像,描述了用于去识别摄影面部图像和磁共振图像数据的方法。对于生物样本,讨论了管理与这些样本相关联的标识符的方法,特别是在满足机构审查委员会豁免的常见规则下的匿名化要求方面。

结论

当前的去识别策略存在其局限性,基于统计学习的系统在去识别自由文本方面具有明显优于其他方法的优势。真正的匿名化具有挑战性,需要在数据集去识别和保护遗传信息方面进一步开展工作。