• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过电子健康记录加速检查点抑制剂诱导的结肠炎病例的管理。

Accelerated curation of checkpoint inhibitor-induced colitis cases from electronic health records.

作者信息

Rahman Protiva, Ye Cheng, Mittendorf Kathleen F, Lenoue-Newton Michele, Micheel Christine, Wolber Jan, Osterman Travis, Fabbri Daniel

机构信息

Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.

Vanderbilt Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA.

出版信息

JAMIA Open. 2023 Apr 1;6(1):ooad017. doi: 10.1093/jamiaopen/ooad017. eCollection 2023 Apr.

DOI:10.1093/jamiaopen/ooad017
PMID:37012912
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10066800/
Abstract

OBJECTIVE

Automatically identifying patients at risk of immune checkpoint inhibitor (ICI)-induced colitis allows physicians to improve patientcare. However, predictive models require training data curated from electronic health records (EHR). Our objective is to automatically identify notes documenting ICI-colitis cases to accelerate data curation.

MATERIALS AND METHODS

We present a data pipeline to automatically identify ICI-colitis from EHR notes, accelerating chart review. The pipeline relies on BERT, a state-of-the-art natural language processing (NLP) model. The first stage of the pipeline segments long notes using keywords identified through a logistic classifier and applies BERT to identify ICI-colitis notes. The next stage uses a second BERT model tuned to identify false positive notes and remove notes that were likely positive for mentioning colitis as a side-effect. The final stage further accelerates curation by highlighting the colitis-relevant portions of notes. Specifically, we use BERT's attention scores to find high-density regions describing colitis.

RESULTS

The overall pipeline identified colitis notes with 84% precision and reduced the curator note review load by 75%. The segment BERT classifier had a high recall of 0.98, which is crucial to identify the low incidence (<10%) of colitis.

DISCUSSION

Curation from EHR notes is a burdensome task, especially when the curation topic is complicated. Methods described in this work are not only useful for ICI colitis but can also be adapted for other domains.

CONCLUSION

Our extraction pipeline reduces manual note review load and makes EHR data more accessible for research.

摘要

目的

自动识别有免疫检查点抑制剂(ICI)诱导性结肠炎风险的患者,有助于医生改善患者护理。然而,预测模型需要从电子健康记录(EHR)中整理出的训练数据。我们的目标是自动识别记录ICI结肠炎病例的笔记,以加速数据整理。

材料与方法

我们提出了一种数据管道,用于从EHR笔记中自动识别ICI结肠炎,从而加速病历审查。该管道依赖于BERT,这是一种先进的自然语言处理(NLP)模型。管道的第一阶段使用通过逻辑分类器识别的关键词对长笔记进行分段,并应用BERT来识别ICI结肠炎笔记。下一阶段使用第二个经过调整的BERT模型来识别假阳性笔记,并删除那些可能因提及结肠炎作为副作用而呈阳性的笔记。最后阶段通过突出显示笔记中与结肠炎相关的部分,进一步加速整理过程。具体而言,我们使用BERT的注意力分数来找到描述结肠炎的高密度区域。

结果

整个管道识别结肠炎笔记的精度为84%,并将整理人员的笔记审查工作量减少了75%。分段BERT分类器的召回率高达0.98,这对于识别低发病率(<10%)的结肠炎至关重要。

讨论

从EHR笔记中进行整理是一项繁重的任务,尤其是当整理主题复杂时。这项工作中描述的方法不仅对ICI结肠炎有用,也可适用于其他领域。

结论

我们的提取管道减少了人工笔记审查工作量,并使EHR数据更便于用于研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/a84a80d4ed41/ooad017f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/f0ae35ad5d26/ooad017f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/99d8964e705f/ooad017f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/8e3d47bd0a79/ooad017f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/a84a80d4ed41/ooad017f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/f0ae35ad5d26/ooad017f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/99d8964e705f/ooad017f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/8e3d47bd0a79/ooad017f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10066800/a84a80d4ed41/ooad017f4.jpg

相似文献

1
Accelerated curation of checkpoint inhibitor-induced colitis cases from electronic health records.通过电子健康记录加速检查点抑制剂诱导的结肠炎病例的管理。
JAMIA Open. 2023 Apr 1;6(1):ooad017. doi: 10.1093/jamiaopen/ooad017. eCollection 2023 Apr.
2
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
3
Development of a generalizable natural language processing pipeline to extract physician-reported pain from clinical reports: Generated using publicly-available datasets and tested on institutional clinical reports for cancer patients with bone metastases.开发一种可推广的自然语言处理管道,从临床报告中提取医生报告的疼痛:使用公开可用的数据集生成,并在患有骨转移的癌症患者的机构临床报告上进行测试。
J Biomed Inform. 2021 Aug;120:103864. doi: 10.1016/j.jbi.2021.103864. Epub 2021 Jul 12.
4
Finding Important Terms for Patients in Their Electronic Health Records: A Learning-to-Rank Approach Using Expert Annotations.在患者电子健康记录中查找重要术语:一种使用专家注释的排序学习方法。
JMIR Med Inform. 2016 Nov 30;4(4):e40. doi: 10.2196/medinform.6373.
5
Assessing the utility of deep neural networks in detecting superficial surgical site infections from free text electronic health record data.评估深度神经网络从自由文本电子健康记录数据中检测浅表手术部位感染的效用。
Front Digit Health. 2024 Jan 8;5:1249835. doi: 10.3389/fdgth.2023.1249835. eCollection 2023.
6
Unsupervised ensemble ranking of terms in electronic health record notes based on their importance to patients.基于术语对患者的重要性对电子健康记录笔记中的术语进行无监督集成排序。
J Biomed Inform. 2017 Apr;68:121-131. doi: 10.1016/j.jbi.2017.02.016. Epub 2017 Mar 4.
7
Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study.使用深度学习系统对电子健康记录中的出血事件进行关系分类:一项实证研究。
JMIR Med Inform. 2021 Jul 2;9(7):e27527. doi: 10.2196/27527.
8
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
9
Development of a Natural Language Processing System for Extracting Rheumatoid Arthritis Outcomes From Clinical Notes Using the National Rheumatology Informatics System for Effectiveness Registry.利用国家风湿病疗效登记信息系统开发一个用于从临床记录中提取类风湿关节炎治疗结果的自然语言处理系统。
Arthritis Care Res (Hoboken). 2023 Mar;75(3):608-615. doi: 10.1002/acr.24869. Epub 2022 Oct 31.
10
Extraction of clinical phenotypes for Alzheimer's disease dementia from clinical notes using natural language processing.使用自然语言处理技术从临床记录中提取阿尔茨海默病痴呆的临床表型。
JAMIA Open. 2023 Feb 24;6(1):ooad014. doi: 10.1093/jamiaopen/ooad014. eCollection 2023 Apr.

引用本文的文献

1
Shareable artificial intelligence to extract cancer outcomes from electronic health records for precision oncology research.可共享人工智能从电子健康记录中提取癌症结果,用于精准肿瘤学研究。
Nat Commun. 2024 Nov 12;15(1):9787. doi: 10.1038/s41467-024-54071-x.
2
Health Care Language Models and Their Fine-Tuning for Information Extraction: Scoping Review.医疗保健语言模型及其在信息提取方面的微调:范围综述。
JMIR Med Inform. 2024 Oct 21;12:e60164. doi: 10.2196/60164.
3
Prediction of Effectiveness and Toxicities of Immune Checkpoint Inhibitors Using Real-World Patient Data.

本文引用的文献

1
Amplifying Domain Expertise in Clinical Data Pipelines.增强临床数据管道中的领域专业知识。
JMIR Med Inform. 2020 Nov 5;8(11):e19612. doi: 10.2196/19612.
2
Snorkel: Rapid Training Data Creation with Weak Supervision.Snorkel:通过弱监督快速创建训练数据
Proceedings VLDB Endowment. 2017 Nov;11(3):269-282. doi: 10.14778/3157794.3157797.
3
Immune checkpoint inhibitor-induced gastrointestinal and hepatic injury: pathologists' perspective.免疫检查点抑制剂相关的胃肠道和肝脏损伤:病理学家的视角。
使用真实世界患者数据预测免疫检查点抑制剂的疗效和毒性。
JCO Clin Cancer Inform. 2024 Feb;8:e2300207. doi: 10.1200/CCI.23.00207.
J Clin Pathol. 2018 Aug;71(8):665-671. doi: 10.1136/jclinpath-2018-205143. Epub 2018 Apr 27.