• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Classifying Cyber-Risky Clinical Notes by Employing Natural Language Processing.利用自然语言处理对具有网络风险的临床记录进行分类
Proc Annu Hawaii Int Conf Syst Sci. 2022;2022:4140-4146. doi: 10.24251/hicss.2022.505. Epub 2022 Jan 4.
2
Text de-identification for privacy protection: a study of its impact on clinical text information content.用于隐私保护的文本去识别化:对其对临床文本信息内容影响的一项研究
J Biomed Inform. 2014 Aug;50:142-50. doi: 10.1016/j.jbi.2014.01.011. Epub 2014 Feb 3.
3
Natural Language Processing for Enterprise-scale De-identification of Protected Health Information in Clinical Notes.自然语言处理在临床记录中用于企业级的保护健康信息去识别。
AMIA Jt Summits Transl Sci Proc. 2022 May 23;2022:92-101. eCollection 2022.
4
De-identification of patient notes with recurrent neural networks.使用递归神经网络对患者记录进行去识别化处理。
J Am Med Inform Assoc. 2017 May 1;24(3):596-606. doi: 10.1093/jamia/ocw156.
5
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
6
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.深度学习方法在跨机构环境下对临床记录进行去识别的研究。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):232. doi: 10.1186/s12911-019-0935-4.
7
Preparing an annotated gold standard corpus to share with extramural investigators for de-identification research.准备一个带注释的金标准语料库,以便与校外研究人员共享用于去识别化研究。
J Biomed Inform. 2014 Aug;50:173-183. doi: 10.1016/j.jbi.2014.01.014. Epub 2014 Feb 17.
8
An Accurate Deep Learning Model for Clinical Entity Recognition From Clinical Notes.一种从临床笔记中识别临床实体的精确深度学习模型。
IEEE J Biomed Health Inform. 2021 Oct;25(10):3804-3811. doi: 10.1109/JBHI.2021.3099755. Epub 2021 Oct 5.
9
Building a tobacco user registry by extracting multiple smoking behaviors from clinical notes.从临床记录中提取多种吸烟行为来建立一个烟草使用者登记册。
BMC Med Inform Decis Mak. 2019 Jul 25;19(1):141. doi: 10.1186/s12911-019-0863-3.
10
Classifying clinical notes with pain assessment using machine learning.使用机器学习对临床记录进行疼痛评估分类。
Med Biol Eng Comput. 2018 Jul;56(7):1285-1292. doi: 10.1007/s11517-017-1772-1. Epub 2017 Dec 26.

引用本文的文献

1
Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.利用人工智能改善临床文档记录:一项系统综述。
Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.

本文引用的文献

1
De-Identification of Medical Narrative Data.医学叙事数据的去识别化
Stud Health Technol Inform. 2017;244:23-27.
2
A cascaded approach for Chinese clinical text de-identification with less annotation effort.一种用于中文临床文本去识别的级联方法,所需标注工作量较少。
J Biomed Inform. 2017 Sep;73:76-83. doi: 10.1016/j.jbi.2017.07.017. Epub 2017 Jul 26.
3
Entity recognition from clinical texts via recurrent neural network.基于循环神经网络的临床文本实体识别。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67. doi: 10.1186/s12911-017-0468-7.
4
De-identification of clinical notes via recurrent neural network and conditional random field.通过递归神经网络和条件随机场对临床记录进行去识别。
J Biomed Inform. 2017 Nov;75S:S34-S42. doi: 10.1016/j.jbi.2017.05.023. Epub 2017 Jun 1.
5
Creation of a new longitudinal corpus of clinical narratives.创建一个新的临床叙事纵向语料库。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S6-S10. doi: 10.1016/j.jbi.2015.09.018. Epub 2015 Oct 1.
6
Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.用于去识别化的纵向临床记录标注:2014年i2b2/德克萨斯大学健康科学中心语料库
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S20-S29. doi: 10.1016/j.jbi.2015.07.020. Epub 2015 Aug 28.
7
CRFs based de-identification of medical records.基于病例报告表的医疗记录去识别化处理。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S39-S46. doi: 10.1016/j.jbi.2015.08.012. Epub 2015 Aug 24.
8
Text de-identification for privacy protection: a study of its impact on clinical text information content.用于隐私保护的文本去识别化:对其对临床文本信息内容影响的一项研究
J Biomed Inform. 2014 Aug;50:142-50. doi: 10.1016/j.jbi.2014.01.011. Epub 2014 Feb 3.
9
Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs.为叙事性患者记录构建去识别系统:成本效益权衡。
Int J Med Inform. 2013 Sep;82(9):821-31. doi: 10.1016/j.ijmedinf.2013.03.005. Epub 2013 Apr 30.
10
Building gold standard corpora for medical natural language processing tasks.为医学自然语言处理任务构建金标准语料库。
AMIA Annu Symp Proc. 2012;2012:144-53. Epub 2012 Nov 3.

利用自然语言处理对具有网络风险的临床记录进行分类

Classifying Cyber-Risky Clinical Notes by Employing Natural Language Processing.

作者信息

Schmeelk Suzanna, Dogo Martins Samuel, Peng Yifan, Patra Braja Gopal

机构信息

St. John's University, Queens, New York.

Queen's University Belfast, United Kingdom.

出版信息

Proc Annu Hawaii Int Conf Syst Sci. 2022;2022:4140-4146. doi: 10.24251/hicss.2022.505. Epub 2022 Jan 4.

DOI:10.24251/hicss.2022.505
PMID:35528964
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9076271/
Abstract

Clinical notes, which can be embedded into electronic medical records, document patient care delivery and summarize interactions between healthcare providers and patients. These clinical notes directly inform patient care and can also indirectly inform research and quality/safety metrics, among other indirect metrics. Recently, some states within the United States of America require patients to have open access to their clinical notes to improve the exchange of patient information for patient care. Thus, developing methods to assess the cyber risks of clinical notes before sharing and exchanging data is critical. While existing natural language processing techniques are geared to de-identify clinical notes, to the best of our knowledge, few have focused on classifying sensitive-information risk, which is a fundamental step toward developing effective, widespread protection of patient health information. To bridge this gap, this research investigates methods for identifying security/privacy risks within clinical notes. The classification either can be used upstream to identify areas within notes that likely contain sensitive information or downstream to improve the identification of clinical notes that have not been entirely de-identified. We develop several models using unigram and word2vec features with different classifiers to categorize sentence risk. Experiments on i2b2 de-identification dataset show that the SVM classifier using word2vec features obtained a maximum F1-score of 0.792. Future research involves articulation and differentiation of risk in terms of different global regulatory requirements.

摘要

临床记录可嵌入电子病历中,记录患者护理情况,并总结医疗服务提供者与患者之间的互动。这些临床记录直接为患者护理提供信息,也可间接为研究以及质量/安全指标等其他间接指标提供信息。最近,美国的一些州要求患者能够公开获取自己的临床记录,以改善患者护理中患者信息的交换。因此,在共享和交换数据之前开发评估临床记录网络风险的方法至关重要。虽然现有的自然语言处理技术旨在对临床记录进行去识别处理,但据我们所知,很少有技术专注于对敏感信息风险进行分类,而这是朝着有效、广泛地保护患者健康信息迈出的关键一步。为了弥补这一差距,本研究调查了识别临床记录中安全/隐私风险的方法。这种分类既可以在流程上游用于识别记录中可能包含敏感信息的区域,也可以在流程下游用于改进对尚未完全去识别的临床记录的识别。我们使用一元语法和词向量特征以及不同的分类器开发了几种模型,对句子风险进行分类。在i2b2去识别数据集上进行的实验表明,使用词向量特征的支持向量机分类器获得的最大F1分数为0.792。未来的研究包括根据不同的全球监管要求阐明和区分风险。