• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型

Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.

作者信息

Wu Hong, Ji Jiatong, Tian Haimei, Chen Yao, Ge Weihong, Zhang Haixia, Yu Feng, Zou Jianjun, Nakamura Mitsuhiro, Liao Jun

机构信息

School of Science, China Pharmaceutical University, Nanjing, China.

School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China.

出版信息

JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.

DOI:10.2196/26407
PMID:34855616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8686410/
Abstract

BACKGROUND

With the increasing variety of drugs, the incidence of adverse drug events (ADEs) is increasing year by year. Massive numbers of ADEs are recorded in electronic medical records and adverse drug reaction (ADR) reports, which are important sources of potential ADR information. Meanwhile, it is essential to make latent ADR information automatically available for better postmarketing drug safety reevaluation and pharmacovigilance.

OBJECTIVE

This study describes how to identify ADR-related information from Chinese ADE reports.

METHODS

Our study established an efficient automated tool, named BBC-Radical. BBC-Radical is a model that consists of 3 components: Bidirectional Encoder Representations from Transformers (BERT), bidirectional long short-term memory (bi-LSTM), and conditional random field (CRF). The model identifies ADR-related information from Chinese ADR reports. Token features and radical features of Chinese characters were used to represent the common meaning of a group of words. BERT and Bi-LSTM-CRF were novel models that combined these features to conduct named entity recognition (NER) tasks in the free-text section of 24,890 ADR reports from the Jiangsu Province Adverse Drug Reaction Monitoring Center from 2010 to 2016. Moreover, the man-machine comparison experiment on the ADE records from Drum Tower Hospital was designed to compare the NER performance between the BBC-Radical model and a manual method.

RESULTS

The NER model achieved relatively high performance, with a precision of 96.4%, recall of 96.0%, and F1 score of 96.2%. This indicates that the performance of the BBC-Radical model (precision 87.2%, recall 85.7%, and F1 score 86.4%) is much better than that of the manual method (precision 86.1%, recall 73.8%, and F1 score 79.5%) in the recognition task of each kind of entity.

CONCLUSIONS

The proposed model was competitive in extracting ADR-related information from ADE reports, and the results suggest that the application of our method to extract ADR-related information is of great significance in improving the quality of ADR reports and postmarketing drug safety evaluation.

摘要

背景

随着药物种类的不断增加,药物不良事件(ADEs)的发生率逐年上升。大量的ADEs记录在电子病历和药物不良反应(ADR)报告中,这些是潜在ADR信息的重要来源。同时,使潜在的ADR信息自动可用对于更好地进行上市后药物安全性再评估和药物警戒至关重要。

目的

本研究描述了如何从中文ADE报告中识别与ADR相关的信息。

方法

我们的研究建立了一个高效的自动化工具,名为BBC-Radical。BBC-Radical是一个由3个组件组成的模型:来自变换器的双向编码器表示(BERT)、双向长短期记忆(bi-LSTM)和条件随机场(CRF)。该模型从中文ADR报告中识别与ADR相关的信息。汉字的词元特征和部首特征被用来表示一组词的共同含义。BERT和Bi-LSTM-CRF是新颖的模型,它们结合这些特征在2010年至2016年江苏省药品不良反应监测中心的24890份ADR报告的自由文本部分进行命名实体识别(NER)任务。此外,设计了对鼓楼医院ADE记录的人机对比实验,以比较BBC-Radical模型和人工方法之间的NER性能。

结果

NER模型取得了相对较高的性能,精确率为96.4%,召回率为96.0%,F1分数为96.2%。这表明在各类实体的识别任务中,BBC-Radical模型(精确率87.2%,召回率85.7%,F1分数86.4%)的性能比人工方法(精确率86.1%,召回率73.8%,F1分数79.5%)要好得多。

结论

所提出的模型在从ADE报告中提取与ADR相关的信息方面具有竞争力,结果表明我们的方法在提取与ADR相关的信息方面的应用对于提高ADR报告的质量和上市后药物安全性评估具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/b997f984f0ef/medinform_v9i12e26407_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/a2ae3445b6e8/medinform_v9i12e26407_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/2b282c5e535a/medinform_v9i12e26407_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/0c3db8368d49/medinform_v9i12e26407_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/b997f984f0ef/medinform_v9i12e26407_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/a2ae3445b6e8/medinform_v9i12e26407_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/2b282c5e535a/medinform_v9i12e26407_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/0c3db8368d49/medinform_v9i12e26407_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e036/8686410/b997f984f0ef/medinform_v9i12e26407_fig4.jpg

相似文献

1
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型
JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.
2
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
3
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.
4
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
5
Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.基于机器学习方法的中文电子健康记录临床命名实体识别
JMIR Med Inform. 2018 Dec 17;6(4):e50. doi: 10.2196/medinform.9965.
6
Deep learning approaches for extracting adverse events and indications of dietary supplements from clinical text.深度学习方法从临床文本中提取膳食补充剂的不良事件和适应证。
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577. doi: 10.1093/jamia/ocaa218.
7
MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records.MF-MNER:中文临床电子病历中的多模型融合命名实体识别。
Interdiscip Sci. 2024 Jun;16(2):489-502. doi: 10.1007/s12539-024-00624-z. Epub 2024 Apr 5.
8
Korean clinical entity recognition from diagnosis text using BERT.基于 BERT 的韩语文本临床实体识别。
BMC Med Inform Decis Mak. 2020 Sep 30;20(Suppl 7):242. doi: 10.1186/s12911-020-01241-8.
9
An imConvNet-based deep learning model for Chinese medical named entity recognition.基于 imConvNet 的深度学习模型在中文医疗命名实体识别中的应用。
BMC Med Inform Decis Mak. 2022 Nov 21;22(1):303. doi: 10.1186/s12911-022-02049-4.
10
Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.基于 BERT(来自 Transformers 的双向编码器表示)的深度学习方法在提取中文放射学报告证据中的应用:计算机辅助肝癌诊断框架的开发。
J Med Internet Res. 2021 Jan 12;23(1):e19689. doi: 10.2196/19689.

引用本文的文献

1
Identifying Adverse Drug Events in Clinical Text Using Fine-Tuned Clinical Language Models: Machine Learning Study.使用微调临床语言模型识别临床文本中的药物不良事件:机器学习研究
JMIR Form Res. 2025 Sep 11;9:e71949. doi: 10.2196/71949.
2
Large Language Models for Adverse Drug Events: A Clinical Perspective.用于药物不良事件的大语言模型:临床视角
J Clin Med. 2025 Aug 4;14(15):5490. doi: 10.3390/jcm14155490.
3
A review of recent artificial intelligence for traditional medicine.近期人工智能在传统医学中的应用综述。

本文引用的文献

1
Ontological Organization and Bioinformatic Analysis of Adverse Drug Reactions From Package Inserts: Development and Usability Study.从药品说明书中提取药物不良反应的本体组织和生物信息学分析:开发和可用性研究。
J Med Internet Res. 2020 Jul 20;22(7):e20443. doi: 10.2196/20443.
2
Chinese clinical named entity recognition with variant neural structures based on BERT methods.基于 BERT 方法的中文临床命名实体识别与变体神经结构。
J Biomed Inform. 2020 Jul;107:103422. doi: 10.1016/j.jbi.2020.103422. Epub 2020 Apr 28.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.
J Tradit Complement Med. 2025 Feb 21;15(3):215-228. doi: 10.1016/j.jtcme.2025.02.009. eCollection 2025 May.
4
Natural language processing of electronic medical records identifies cardioprotective agents for anthracycline induced cardiotoxicity.电子病历的自然语言处理可识别用于蒽环类药物诱导心脏毒性的心脏保护剂。
Sci Rep. 2025 Feb 24;15(1):6678. doi: 10.1038/s41598-025-91187-6.
5
Chinese Clinical Named Entity Recognition With Segmentation Synonym Sentence Synthesis Mechanism: Algorithm Development and Validation.基于分词、同义词和句子合成机制的中文临床命名实体识别:算法开发与验证
JMIR Med Inform. 2024 Nov 21;12:e60334. doi: 10.2196/60334.
6
Post-marketing surveillance of anticancer drugs using natural language processing of electronic medical records.利用电子病历的自然语言处理技术对抗癌药物进行上市后监测。
NPJ Digit Med. 2024 Nov 9;7(1):315. doi: 10.1038/s41746-024-01323-1.
7
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.
8
Automated System to Capture Patient Symptoms From Multitype Japanese Clinical Texts: Retrospective Study.从多类型日本临床文本中自动捕获患者症状的系统:回顾性研究。
JMIR Med Inform. 2024 Sep 24;12:e58977. doi: 10.2196/58977.
9
exKidneyBERT: a language model for kidney transplant pathology reports and the crucial role of extended vocabularies.exKidneyBERT:一种用于肾移植病理报告的语言模型及扩展词汇表的关键作用。
PeerJ Comput Sci. 2024 Feb 28;10:e1888. doi: 10.7717/peerj-cs.1888. eCollection 2024.
10
A Deep Learning Model for the Normalization of Institution Names by Multisource Literature Feature Fusion: Algorithm Development Study.一种基于多源文献特征融合的机构名称规范化深度学习模型:算法开发研究
JMIR Form Res. 2023 Aug 18;7:e47434. doi: 10.2196/47434.
利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
5
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.
6
The Adverse Drug Reactions From Patient Reports in Social Media Project: Protocol for an Evaluation Against a Gold Standard.社交媒体项目中患者报告的药物不良反应:针对金标准的评估方案
JMIR Res Protoc. 2019 May 7;8(5):e11448. doi: 10.2196/11448.
7
An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition.基于注意力机制的 BiLSTM-CRF 方法在文档级化学命名实体识别中的应用。
Bioinformatics. 2018 Apr 15;34(8):1381-1388. doi: 10.1093/bioinformatics/btx761.
8
Prescription extraction using CRFs and word embeddings.使用条件随机场和词嵌入进行处方提取。
J Biomed Inform. 2017 Aug;72:60-66. doi: 10.1016/j.jbi.2017.07.002. Epub 2017 Jul 4.
9
Impact of Medicine Withdrawal on Reporting of Adverse Events Involving Therapeutic Alternatives: A Study from the French Spontaneous Reporting Database.停药对涉及治疗替代药物不良事件报告的影响:来自法国自发报告数据库的一项研究。
Drug Saf. 2017 Nov;40(11):1099-1107. doi: 10.1007/s40264-017-0561-y.
10
Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation.利用带有词嵌入表示的 Bi-LSTM 递归神经网络挖掘社交媒体中的电子烟不良事件。
J Am Med Inform Assoc. 2018 Jan 1;25(1):72-80. doi: 10.1093/jamia/ocx045.