• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用词嵌入和机器学习技术从临床记录中自动识别免疫相关不良事件患者

Automated Identification of Patients With Immune-Related Adverse Events From Clinical Notes Using Word Embedding and Machine Learning.

机构信息

Innovation Center for Biomedical Informatics (ICBI), Georgetown University, Washington, DC.

Memorial Sloan Kettering Cancer Center, Manhattan, New York, NY.

出版信息

JCO Clin Cancer Inform. 2021 May;5:541-549. doi: 10.1200/CCI.20.00109.

DOI:10.1200/CCI.20.00109
PMID:33989017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8462565/
Abstract

PURPOSE

Although immune checkpoint inhibitors (ICIs) have substantially improved survival in patients with advanced malignancies, they are associated with a unique spectrum of side effects termed immune-related adverse events (irAEs). To ensure treatment safety, research efforts are needed to comprehensively detect and understand irAEs. Retrospective analysis of data from electronic health records can provide knowledge to characterize these toxicities. However, such information is not captured in a structured format within the electronic health record and requires manual chart review.

MATERIALS AND METHODS

In this work, we propose a natural language processing pipeline that can automatically annotate clinical notes and determine whether there is evidence that a patient developed an irAE. Seven hundred eighty-one cases were manually reviewed by clinicians and annotated for irAEs at the patient level. A dictionary of irAEs keywords was used to perform text reduction on clinical notes belonging to each patient; only sentences with relevant expressions were kept. Word embeddings were then used to generate vector representations over the reduced text, which served as input for the machine learning classifiers. The output of the models was presence or absence of any irAEs. Additional models were built to classify skin-related toxicities, endocrine toxicities, and colitis.

RESULTS

The model for any irAE achieved an average F1-score = 0.75 and area under the receiver operating characteristic curve = 0.85. This outperformed a basic keyword filtering approach. Although the classifier of any irAEs achieved good accuracy, individual irAE classification still has room for improvement.

CONCLUSION

We demonstrate that patient-level annotations combined with a machine learning approach using keywords filtering and word embeddings can achieve promising accuracy in classifying irAEs in clinical notes. This model may facilitate annotation and analysis of large irAEs data sets.

摘要

目的

尽管免疫检查点抑制剂(ICIs)显著改善了晚期恶性肿瘤患者的生存,但它们与一种称为免疫相关不良事件(irAEs)的独特副作用谱有关。为了确保治疗安全,需要进行研究工作以全面检测和了解 irAEs。电子健康记录中数据的回顾性分析可以提供知识来描述这些毒性。然而,此类信息在电子健康记录中未以结构化格式捕获,需要进行手动图表审查。

材料和方法

在这项工作中,我们提出了一种自然语言处理管道,可以自动注释临床记录并确定患者是否有发生 irAE 的证据。781 例病例由临床医生手动审查并对患者进行 irAE 注释。使用 irAE 关键字词典对每位患者的临床记录进行文本缩减;仅保留具有相关表达的句子。然后使用词嵌入生成简化文本的向量表示,作为机器学习分类器的输入。模型的输出为是否存在任何 irAE。还构建了其他模型来分类皮肤毒性、内分泌毒性和结肠炎。

结果

任何 irAE 的模型平均 F1 得分为 0.75,接收器操作特征曲线下面积为 0.85。这优于基本的关键字过滤方法。虽然任何 irAE 的分类器都具有良好的准确性,但个别 irAE 分类仍有改进的空间。

结论

我们证明了患者级注释与使用关键字过滤和词嵌入的机器学习方法相结合,可以在对临床记录中的 irAE 进行分类时达到有希望的准确性。该模型可以促进 irAEs 大数据集的注释和分析。

相似文献

1
Automated Identification of Patients With Immune-Related Adverse Events From Clinical Notes Using Word Embedding and Machine Learning.使用词嵌入和机器学习技术从临床记录中自动识别免疫相关不良事件患者
JCO Clin Cancer Inform. 2021 May;5:541-549. doi: 10.1200/CCI.20.00109.
2
Electronic patient-reported outcomes and machine learning in predicting immune-related adverse events of immune checkpoint inhibitor therapies.电子患者报告结局与机器学习在预测免疫检查点抑制剂治疗相关免疫不良事件中的应用。
BMC Med Inform Decis Mak. 2021 Jun 30;21(1):205. doi: 10.1186/s12911-021-01564-0.
3
Identification and Characterization of Immune Checkpoint Inhibitor-Induced Toxicities From Electronic Health Records Using Natural Language Processing.利用自然语言处理从电子健康记录中识别和描述免疫检查点抑制剂诱导的毒性。
JCO Clin Cancer Inform. 2024 Apr;8:e2300151. doi: 10.1200/CCI.23.00151.
4
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
5
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
6
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
7
Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment.用于评估前列腺癌治疗后以患者为中心的结果的弱监督自然语言处理
JAMIA Open. 2019 Apr;2(1):150-159. doi: 10.1093/jamiaopen/ooy057. Epub 2019 Jan 4.
8
Looking for low vision: Predicting visual prognosis by fusing structured and free-text data from electronic health records.寻找低视力:通过融合电子健康记录中的结构化和自由文本数据来预测视觉预后。
Int J Med Inform. 2022 Mar;159:104678. doi: 10.1016/j.ijmedinf.2021.104678. Epub 2021 Dec 30.
9
Mining fall-related information in clinical notes: Comparison of rule-based and novel word embedding-based machine learning approaches.挖掘临床记录中与跌倒相关的信息:基于规则和基于新颖词嵌入的机器学习方法的比较。
J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.
10
Classifying early infant feeding status from clinical notes using natural language processing and machine learning.使用自然语言处理和机器学习对临床记录进行早期婴儿喂养状态分类。
Sci Rep. 2024 Apr 3;14(1):7831. doi: 10.1038/s41598-024-58299-x.

引用本文的文献

1
Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing.利用医学自然语言处理阐明塞来昔布在卡培他滨诱导的手足综合征中的预防作用。
JCO Clin Cancer Inform. 2025 Aug;9:e2500096. doi: 10.1200/CCI-25-00096. Epub 2025 Aug 12.
2
Validation of Immune-Related Adverse Event (irAE) Case Definitions in a Real-World Lung Cancer Population.真实世界肺癌人群中免疫相关不良事件(irAE)病例定义的验证
Pharmacoepidemiol Drug Saf. 2025 Feb;34(2):e70100. doi: 10.1002/pds.70100.
3
Artificial intelligence-enabled safety monitoring in Alzheimer's disease clinical trials.阿尔茨海默病临床试验中基于人工智能的安全监测
J Prev Alzheimers Dis. 2025 Jan;12(1):100002. doi: 10.1016/j.tjpad.2024.100002. Epub 2025 Jan 1.
4
Detection of Patient-Level Immunotherapy-Related Adverse Events (irAEs) from Clinical Narratives of Electronic Health Records: A High-Sensitivity Artificial Intelligence Model.从电子健康记录的临床叙述中检测患者层面的免疫治疗相关不良事件(irAEs):一种高灵敏度人工智能模型。
Pragmat Obs Res. 2024 Dec 20;15:243-252. doi: 10.2147/POR.S468253. eCollection 2024.
5
Cancer and treatment specific incidence rates of immune-related adverse events induced by immune checkpoint inhibitors: a systematic review.免疫检查点抑制剂诱导的免疫相关不良事件的癌症及治疗特异性发病率:一项系统综述
Br J Cancer. 2025 Jan;132(1):51-57. doi: 10.1038/s41416-024-02887-1. Epub 2024 Nov 3.
6
Enhancing Precision in Detecting Severe Immune-Related Adverse Events: Comparative Analysis of Large Language Models and International Classification of Disease Codes in Patient Records.提高严重免疫相关不良事件检测的精准度:大型语言模型与患者记录中疾病国际分类代码的比较分析
J Clin Oncol. 2024 Dec 10;42(35):4134-4144. doi: 10.1200/JCO.24.00326. Epub 2024 Sep 3.
7
Navigating the Complexities of Artificial Intelligence-Enabled Real-World Data Collection for Oncology Pharmacovigilance.人工智能驱动的肿瘤药物警戒真实世界数据采集的复杂性探讨。
JCO Clin Cancer Inform. 2024 May;8:e2400051. doi: 10.1200/CCI.24.00051.
8
Identification and Characterization of Immune Checkpoint Inhibitor-Induced Toxicities From Electronic Health Records Using Natural Language Processing.利用自然语言处理从电子健康记录中识别和描述免疫检查点抑制剂诱导的毒性。
JCO Clin Cancer Inform. 2024 Apr;8:e2300151. doi: 10.1200/CCI.23.00151.
9
Promise and Perils of Large Language Models for Cancer Survivorship and Supportive Care.大语言模型在癌症生存和支持性护理中的前景与挑战。
J Clin Oncol. 2024 May 10;42(14):1607-1611. doi: 10.1200/JCO.23.02439. Epub 2024 Mar 7.
10
Adverse drug event detection using natural language processing: A scoping review of supervised learning methods.基于自然语言处理的药物不良反应检测:监督学习方法的范围综述。
PLoS One. 2023 Jan 3;18(1):e0279842. doi: 10.1371/journal.pone.0279842. eCollection 2023.

本文引用的文献

1
Long-Term Outcomes and Responses to Retreatment in Patients With Melanoma Treated With PD-1 Blockade.接受 PD-1 阻断治疗的黑色素瘤患者的长期结果和再治疗应答。
J Clin Oncol. 2020 May 20;38(15):1655-1663. doi: 10.1200/JCO.19.01464. Epub 2020 Feb 13.
2
BioWordVec, improving biomedical word embeddings with subword information and MeSH.BioWordVec,利用子词信息和 MeSH 改进生物医学词向量。
Sci Data. 2019 May 10;6(1):52. doi: 10.1038/s41597-019-0055-0.
3
Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment.用于评估前列腺癌治疗后以患者为中心的结果的弱监督自然语言处理
JAMIA Open. 2019 Apr;2(1):150-159. doi: 10.1093/jamiaopen/ooy057. Epub 2019 Jan 4.
4
Immunotherapy-related adverse events (irAEs): extraction from FDA drug labels and comparative analysis.免疫疗法相关不良事件(irAEs):从美国食品药品监督管理局(FDA)药品标签中提取及对比分析
JAMIA Open. 2019 Apr;2(1):173-178. doi: 10.1093/jamiaopen/ooy045. Epub 2018 Oct 15.
5
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
6
Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning.从电子健康记录笔记中提取与药物不良事件相关的信息:基于深度学习的端到端模型设计
JMIR Med Inform. 2018 Nov 26;6(4):e12159. doi: 10.2196/12159.
7
Immune-related adverse events of immune checkpoint inhibitors: a brief review.免疫检查点抑制剂的免疫相关不良事件:简要综述。
Curr Oncol. 2018 Oct;25(5):342-347. doi: 10.3747/co.25.4235. Epub 2018 Oct 31.
8
Clinical assessment of immune-related adverse events.免疫相关不良事件的临床评估
Ther Adv Med Oncol. 2018 Mar 30;10:1758835918764628. doi: 10.1177/1758835918764628. eCollection 2018.
9
Management of Immune-Related Adverse Events in Patients Treated With Immune Checkpoint Inhibitor Therapy: American Society of Clinical Oncology Clinical Practice Guideline Summary.接受免疫检查点抑制剂治疗患者免疫相关不良事件的管理:美国临床肿瘤学会临床实践指南摘要
J Oncol Pract. 2018 Apr;14(4):247-249. doi: 10.1200/JOP.18.00005. Epub 2018 Mar 8.
10
Immune-Related Adverse Events Associated with Immune Checkpoint Blockade.与免疫检查点阻断相关的免疫相关不良事件。
N Engl J Med. 2018 Jan 11;378(2):158-168. doi: 10.1056/NEJMra1703481.