• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A Comparative Analysis of Speed and Accuracy for Three Off-the-Shelf De-Identification Tools.三种现成去识别工具的速度与准确性比较分析
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:241-250. eCollection 2020.
2
An Extensible Evaluation Framework Applied to Clinical Text Deidentification Natural Language Processing Tools: Multisystem and Multicorpus Study.应用于临床文本去标识化自然语言处理工具的可扩展评估框架:多系统和多语料库研究。
J Med Internet Res. 2024 May 28;26:e55676. doi: 10.2196/55676.
3
An evaluation of existing text de-identification tools for use with patient progress notes from Australian general practice.对澳大利亚全科医疗中用于患者病程记录的现有文本去识别工具的评估。
Int J Med Inform. 2023 May;173:105021. doi: 10.1016/j.ijmedinf.2023.105021. Epub 2023 Feb 11.
4
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.深度学习方法在跨机构环境下对临床记录进行去识别的研究。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):232. doi: 10.1186/s12911-019-0935-4.
5
Text de-identification for privacy protection: a study of its impact on clinical text information content.用于隐私保护的文本去识别化:对其对临床文本信息内容影响的一项研究
J Biomed Inform. 2014 Aug;50:142-50. doi: 10.1016/j.jbi.2014.01.011. Epub 2014 Feb 3.
6
Patient Privacy in the Era of Big Data.大数据时代的患者隐私
Balkan Med J. 2018 Jan 20;35(1):8-17. doi: 10.4274/balkanmedj.2017.0966. Epub 2017 Sep 13.
7
Customization scenarios for de-identification of clinical notes.临床记录去识别的定制化场景。
BMC Med Inform Decis Mak. 2020 Jan 30;20(1):14. doi: 10.1186/s12911-020-1026-2.
8
The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.机器给予,机器又夺走:隐藏在明处的鹦鹉攻击对临床文本去识别。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1536-1544. doi: 10.1093/jamia/ocz114.
9
Challenges and Insights in Using HIPAA Privacy Rule for Clinical Text Annotation.使用《健康保险流通与责任法案》隐私规则进行临床文本注释的挑战与见解。
AMIA Annu Symp Proc. 2015 Nov 5;2015:707-16. eCollection 2015.
10
De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports.病历叙述报告中地址、日期及字母数字标识符的去识别化处理
AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.

引用本文的文献

1
Exploring Freely Available Data Tools to Support Open Data and Open Science.探索免费可用的数据工具以支持开放数据和开放科学。
J Hosp Librariansh. 2024;24(2):104-111. doi: 10.1080/15323269.2024.2326787. Epub 2024 Apr 9.
2
An Extensible Evaluation Framework Applied to Clinical Text Deidentification Natural Language Processing Tools: Multisystem and Multicorpus Study.应用于临床文本去标识化自然语言处理工具的可扩展评估框架:多系统和多语料库研究。
J Med Internet Res. 2024 May 28;26:e55676. doi: 10.2196/55676.
3
Web-Based Application Based on Human-in-the-Loop Deep Learning for Deidentifying Free-Text Data in Electronic Medical Records: Development and Usability Study.基于人在回路深度学习的电子病历自由文本数据去识别化的网络应用程序:开发与可用性研究
Interact J Med Res. 2023 Aug 25;12:e46322. doi: 10.2196/46322.
4
Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes.拓扑和重描述可从临床表型中检测到多种替代的生物学途径。
Exp Biol Med (Maywood). 2022 Nov;247(22):2015-2024. doi: 10.1177/15353702221126671. Epub 2022 Nov 18.
5
Investigation of the Utility of Features in a Clinical De-identification Model: A Demonstration Using EHR Pathology Reports for Advanced NSCLC Patients.临床去识别模型中特征效用的研究:使用晚期非小细胞肺癌患者的电子健康记录病理报告进行的示范
Front Digit Health. 2022 Feb 16;4:728922. doi: 10.3389/fdgth.2022.728922. eCollection 2022.
6
Ensuring a safe(r) harbor: Excising personally identifiable information from structured electronic health record data.确保更安全的避风港:从结构化电子健康记录数据中删除个人身份信息。
J Clin Transl Sci. 2021 Dec 9;6(1):e10. doi: 10.1017/cts.2021.880. eCollection 2022.
7
Building a best-in-class automated de-identification tool for electronic health records through ensemble learning.通过集成学习构建用于电子健康记录的一流自动去识别工具。
Patterns (N Y). 2021 May 12;2(6):100255. doi: 10.1016/j.patter.2021.100255. eCollection 2021 Jun 11.
8
An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature.两种用于 COVID-19 文献的商业深度学习信息检索系统的评估。
J Am Med Inform Assoc. 2021 Jan 15;28(1):132-137. doi: 10.1093/jamia/ocaa271.

本文引用的文献

1
A survey of practices for the use of electronic health records to support research recruitment.一项关于使用电子健康记录支持研究招募的实践调查。
J Clin Transl Sci. 2017 Aug;1(4):246-252. doi: 10.1017/cts.2017.301.
2
De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.去识别精神科入院记录:2016 年 CEGS N-GRID 共享任务跟踪 1 概述。
J Biomed Inform. 2017 Nov;75S:S4-S18. doi: 10.1016/j.jbi.2017.06.011. Epub 2017 Jun 11.
3
Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.用于纵向临床记录去识别化的自动化系统:2014年i2b2/德克萨斯大学健康科学中心共享任务赛道1概述
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S11-S19. doi: 10.1016/j.jbi.2015.06.007. Epub 2015 Jul 28.
4
A review of approaches to identifying patient phenotype cohorts using electronic health records.利用电子健康记录识别患者表型队列的方法综述。
J Am Med Inform Assoc. 2014 Mar-Apr;21(2):221-30. doi: 10.1136/amiajnl-2013-001935. Epub 2013 Nov 7.
5
Evaluating current automatic de-identification methods with Veteran's health administration clinical documents.评估退伍军人健康管理局临床文档中当前的自动去识别方法。
BMC Med Res Methodol. 2012 Jul 27;12:109. doi: 10.1186/1471-2288-12-109.
6
Automatic de-identification of textual documents in the electronic health record: a review of recent research.电子健康记录中文本文件的自动去识别:近期研究综述。
BMC Med Res Methodol. 2010 Aug 2;10:70. doi: 10.1186/1471-2288-10-70.
7
Extracting information from textual documents in the electronic health record: a review of recent research.从电子健康记录中的文本文件提取信息:近期研究综述
Yearb Med Inform. 2008:128-44.
8
Rapidly retargetable approaches to de-identification in medical records.医疗记录中快速可重新定位的去识别方法。
J Am Med Inform Assoc. 2007 Sep-Oct;14(5):564-73. doi: 10.1197/jamia.M2435. Epub 2007 Jun 28.

三种现成去识别工具的速度与准确性比较分析

A Comparative Analysis of Speed and Accuracy for Three Off-the-Shelf De-Identification Tools.

作者信息

Heider Paul M, Obeid Jihad S, Meystre Stéphane M

机构信息

Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC.

出版信息

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:241-250. eCollection 2020.

PMID:32477643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7233098/
Abstract

A growing quantity of health data is being stored in Electronic Health Records (EHR). The free-text section of these clinical notes contains important patient and treatment information for research but also contains Personally Identifiable Information (PII), which cannot be freely shared within the research community without compromising patient confidentiality and privacy rights. Significant work has been invested in investigating automated approaches to text de-identification, the process of removing or redacting PII. Few studies have examined the performance of existing de-identification pipelines in a controlled comparative analysis. In this study, we use publicly available corpora to analyze speed and accuracy differences between three de-identification systems that can be run off-the-shelf: Amazon Comprehend Medical PHId, Clinacuity's CliniDeID, and the National Library of Medicine's Scrubber. No single system dominated all the compared metrics. NLM Scrubber was the fastest while CliniDeID generally had the highest accuracy.

摘要

越来越多的健康数据被存储在电子健康记录(EHR)中。这些临床记录的自由文本部分包含了用于研究的重要患者和治疗信息,但也包含个人身份信息(PII),在不损害患者保密性和隐私权的情况下,这些信息不能在研究社区内自由共享。人们已经投入了大量工作来研究文本去识别化的自动化方法,即去除或编辑PII的过程。很少有研究在受控的比较分析中检验现有去识别化流程的性能。在本研究中,我们使用公开可用的语料库来分析三种现成的去识别化系统之间的速度和准确性差异:亚马逊理解医疗PHId、Clinacuity的CliniDeID以及美国国立医学图书馆的Scrubber。没有一个系统在所有比较指标上都占主导地位。NLM Scrubber速度最快,而CliniDeID通常准确性最高。