• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从电子健康记录中提取罕见不良事件的主动学习:儿科心脏病学研究

Active learning for extracting rare adverse events from electronic health records: A study in pediatric cardiology.

作者信息

Quennelle Sophie, Malekzadeh-Milani Sophie, Garcelon Nicolas, Faour Hassan, Burgun Anita, Faviez Carole, Tsopra Rosy, Bonnet Damien, Neuraz Antoine

机构信息

Inserm, UMR_S1138, Centre de Recherche des Cordeliers, Sorbonne Université, Paris, France; Inria, équipe HeKA, PariSantéCampus, Paris, France; M3C-Necker, Hôpital Universitaire Necker-Enfants malades, Assistance Publique-Hôpitaux de Paris, Paris, France; Université Paris Cité, Paris, France.

M3C-Necker, Hôpital Universitaire Necker-Enfants malades, Assistance Publique-Hôpitaux de Paris, Paris, France.

出版信息

Int J Med Inform. 2025 Mar;195:105761. doi: 10.1016/j.ijmedinf.2024.105761. Epub 2024 Dec 12.

DOI:10.1016/j.ijmedinf.2024.105761
PMID:39689449
Abstract

OBJECTIVE

Automate the extraction of adverse events from the text of electronic medical records of patients hospitalized for cardiac catheterization.

METHODS

We focused on events related to cardiac catheterization as defined by the NCDR-IMPACT registry. These events were extracted from the Necker Children's Hospital data warehouse. Electronic health records were pre-screened using regular expressions. The resulting datasets contained numerous false positives sentences that were annotated by a cardiologist using an active learning process. A deep learning text classifier was then trained on this active learning-annotated dataset to accurately identify patients who have suffered a serious adverse event.

RESULTS

The dataset included 2,980 patients. Regular expression based extraction of adverse events related to cardiac catheterization achieved a perfect recall. Due to the rarity of adverse events, the dataset obtained from this initial pre-screening step was imbalanced, containing a significant number of false positives. The active learning annotation enabled the acquisition of a representative dataset suitable for training a deep learning model. The deep learning text-classifier identified patients who underwent adverse events after cardiac catheterization with a recall of 0.78 and a specificity of 0.94.

CONCLUSION

Our model effectively identified patients who experienced adverse events related to cardiac catheterization using real clinical data. Enabled by an active learning annotation process, it shows promise for large language model applications in clinical research, especially for rare diseases with limited annotated databases. Our model's strength lies in its development by physicians for physicians, ensuring its relevance and applicability in clinical practice.

摘要

目的

实现从因心脏导管插入术住院患者的电子病历文本中自动提取不良事件。

方法

我们重点关注由NCDR-IMPACT注册中心定义的与心脏导管插入术相关的事件。这些事件从内克尔儿童医院数据仓库中提取。使用正则表达式对电子健康记录进行预筛选。生成的数据集包含大量误报句子,由心脏病专家通过主动学习过程进行注释。然后在这个经过主动学习注释的数据集上训练一个深度学习文本分类器,以准确识别遭受严重不良事件的患者。

结果

该数据集包括2980名患者。基于正则表达式提取与心脏导管插入术相关的不良事件实现了完美召回率。由于不良事件罕见,从这个初始预筛选步骤获得的数据集不均衡,包含大量误报。主动学习注释能够获取适合训练深度学习模型的代表性数据集。深度学习文本分类器识别出心脏导管插入术后发生不良事件的患者,召回率为0.78,特异性为0.94。

结论

我们的模型使用真实临床数据有效地识别了经历与心脏导管插入术相关不良事件的患者。通过主动学习注释过程,它在临床研究中的大语言模型应用方面显示出前景,特别是对于注释数据库有限的罕见疾病。我们模型的优势在于由医生为医生开发,确保了其在临床实践中的相关性和适用性。

相似文献

1
Active learning for extracting rare adverse events from electronic health records: A study in pediatric cardiology.从电子健康记录中提取罕见不良事件的主动学习:儿科心脏病学研究
Int J Med Inform. 2025 Mar;195:105761. doi: 10.1016/j.ijmedinf.2024.105761. Epub 2024 Dec 12.
2
MultiADE: A Multi-domain benchmark for Adverse Drug Event extraction.MultiADE:用于不良药物事件提取的多领域基准测试。
J Biomed Inform. 2024 Dec;160:104744. doi: 10.1016/j.jbi.2024.104744. Epub 2024 Nov 12.
3
Regular expression-based learning to extract bodyweight values from clinical notes.基于正则表达式的学习方法,用于从临床记录中提取体重值。
J Biomed Inform. 2015 Apr;54:186-90. doi: 10.1016/j.jbi.2015.02.009. Epub 2015 Mar 5.
4
Domain transformation on biological event extraction by learning methods.通过学习方法进行生物事件抽取的领域转换。
J Biomed Inform. 2019 Jul;95:103236. doi: 10.1016/j.jbi.2019.103236. Epub 2019 Jun 18.
5
Using Synthetic Health Care Data to Leverage Large Language Models for Named Entity Recognition: Development and Validation Study.利用合成医疗保健数据借助大语言模型进行命名实体识别:开发与验证研究。
J Med Internet Res. 2025 Mar 18;27:e66279. doi: 10.2196/66279.
6
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
7
Improving entity recognition using ensembles of deep learning and fine-tuned large language models: A case study on adverse event extraction from VAERS and social media.使用深度学习集成和微调大语言模型改进实体识别:以从VAERS和社交媒体中提取不良事件为例
J Biomed Inform. 2025 Mar;163:104789. doi: 10.1016/j.jbi.2025.104789. Epub 2025 Feb 7.
8
Extracting adverse drug events from clinical Notes: A systematic review of approaches used.从临床记录中提取药物不良事件:对所用方法的系统评价
J Biomed Inform. 2024 Mar;151:104603. doi: 10.1016/j.jbi.2024.104603. Epub 2024 Feb 6.
9
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
10
Deep Learning-based detection of psychiatric attributes from German mental health records.基于深度学习的德国心理健康记录中精神属性的检测。
Int J Med Inform. 2022 May;161:104724. doi: 10.1016/j.ijmedinf.2022.104724. Epub 2022 Feb 22.

引用本文的文献

1
The Case for the Pediatric Cardiologist-Informaticist.儿科心脏病专家-信息学家的情况
Pediatr Cardiol. 2025 Aug 26. doi: 10.1007/s00246-025-04001-5.
2
COPD-MMDDxNet: a multimodal deep learning framework for accurate COPD diagnosis using electronic medical records.慢性阻塞性肺疾病-多模态疾病诊断网络(COPD-MMDDxNet):一种使用电子病历进行准确慢性阻塞性肺疾病诊断的多模态深度学习框架。
Front Med (Lausanne). 2025 Jul 11;12:1601736. doi: 10.3389/fmed.2025.1601736. eCollection 2025.