• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用动态注意力网络增强中医命名实体识别:一种动态注意力方法

Enhancing traditional Chinese medical named entity recognition with Dyn-Att Net: a dynamic attention approach.

作者信息

Hou Jingming, Saad Saidah, Omar Nazlia

机构信息

Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia.

出版信息

PeerJ Comput Sci. 2024 May 31;10:e2022. doi: 10.7717/peerj-cs.2022. eCollection 2024.

DOI:10.7717/peerj-cs.2022
PMID:39669466
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11636975/
Abstract

Our study focuses on Traditional Chinese Medical (TCM) named entity recognition (NER), which involves identifying and extracting specific entity names from TCM record. This task has significant implications for doctors and researchers, as it enables the automated identification of relevant TCM terms, ultimately enhancing research efficiency and accuracy. However, the current Bidirectional Encoder Representations from Transformers-Long Short Term Memory-Conditional Random Fields (BERT-LSTM-CRF) model for TCM NER is constrained by a traditional structure, limiting its capacity to fully harness the advantages provided by Bidirectional Encoder Representations from Transformers (BERT) and long short term memory (LSTM) models. Through comparative experiments, we also observed that the straightforward superimposition of models actually leads to a decrease in recognition results. To optimize the structure of the traditional BERT-BiLSTM-CRF model and obtain more effective text representations, we propose the Dyn-Att Net model, which introduces dynamic attention and a parallel structure. By integrating BERT and LSTM models with the dynamic attention mechanism, our model effectively captures semantic, contextual, and sequential relations within text sequences, resulting in high accuracy. To validate the effectiveness of our model, we compared it with nine other models in TCM dataset namely the publicly available PaddlePaddle dataset. Our Dyn-Att Net model, based on BERT, outperforms the other models, achieving an F1 score of 81.91%, accuracy of 92.06%, precision of 80.26%, and recall of 83.76%. Furthermore, its robust generalization capability is substantiated through validation on the APTNER, MSRA, and EduNER datasets. Overall, the Dyn-Att Net model not only enhances NER accuracy within the realm of traditional Chinese medicine, but also showcases considerable potential for cross-domain generalization. Moreover, the Dyn-Att Net model's parallel architecture facilitates efficient computation, contributing to time-saving efforts in NER tasks.

摘要

我们的研究聚焦于中医命名实体识别(NER),即从中医记录中识别并提取特定实体名称。这项任务对医生和研究人员具有重要意义,因为它能实现对相关中医术语的自动识别,最终提高研究效率和准确性。然而,当前用于中医NER的双向编码器表征来自变换器-长短期记忆-条件随机场(BERT-LSTM-CRF)模型受到传统结构的限制,限制了其充分利用双向编码器表征来自变换器(BERT)和长短期记忆(LSTM)模型优势的能力。通过对比实验,我们还观察到模型的直接叠加实际上会导致识别结果下降。为了优化传统BERT-BiLSTM-CRF模型的结构并获得更有效的文本表征,我们提出了动态注意力网络(Dyn-Att Net)模型,该模型引入了动态注意力和并行结构。通过将BERT和LSTM模型与动态注意力机制相结合,我们的模型有效地捕捉了文本序列中的语义、上下文和顺序关系,从而实现了高精度。为了验证我们模型的有效性,我们将其与中医数据集中的其他九个模型进行了比较,即公开可用的飞桨数据集。我们基于BERT的Dyn-Att Net模型优于其他模型,F1分数达到81.91%,准确率为92.06%,精确率为80.26%,召回率为83.76%。此外,通过在APTNER、MSRA和EduNER数据集上的验证,证实了其强大的泛化能力。总体而言,Dyn-Att Net模型不仅提高了中医领域内的NER准确性,还展现出了相当大的跨域泛化潜力。此外,Dyn-Att Net模型的并行架构有助于高效计算,在NER任务中节省了时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/38cffd94cf9c/peerj-cs-10-2022-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/34eaa7a7efb6/peerj-cs-10-2022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/69c5cdfe5b31/peerj-cs-10-2022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/ff8b19c59419/peerj-cs-10-2022-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/ffa0011189b6/peerj-cs-10-2022-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/df6b7dd4c8f9/peerj-cs-10-2022-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/38cffd94cf9c/peerj-cs-10-2022-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/34eaa7a7efb6/peerj-cs-10-2022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/69c5cdfe5b31/peerj-cs-10-2022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/ff8b19c59419/peerj-cs-10-2022-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/ffa0011189b6/peerj-cs-10-2022-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/df6b7dd4c8f9/peerj-cs-10-2022-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0849/11636975/38cffd94cf9c/peerj-cs-10-2022-g006.jpg

相似文献

1
Enhancing traditional Chinese medical named entity recognition with Dyn-Att Net: a dynamic attention approach.利用动态注意力网络增强中医命名实体识别:一种动态注意力方法
PeerJ Comput Sci. 2024 May 31;10:e2022. doi: 10.7717/peerj-cs.2022. eCollection 2024.
2
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型
JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.
3
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
4
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
5
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
6
Adversarial active learning for the identification of medical concepts and annotation inconsistency.对抗式主动学习在医学概念识别和标注不一致性中的应用。
J Biomed Inform. 2020 Aug;108:103481. doi: 10.1016/j.jbi.2020.103481. Epub 2020 Jul 18.
7
An imConvNet-based deep learning model for Chinese medical named entity recognition.基于 imConvNet 的深度学习模型在中文医疗命名实体识别中的应用。
BMC Med Inform Decis Mak. 2022 Nov 21;22(1):303. doi: 10.1186/s12911-022-02049-4.
8
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT.基于混合神经网络和医学 MC-BERT 的中文电子病历命名实体识别。
BMC Med Inform Decis Mak. 2022 Dec 1;22(1):315. doi: 10.1186/s12911-022-02059-2.
9
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
10
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.

引用本文的文献

1
Chinese crop diseases and pests named entity recognition based on variational information bottleneck and feature enhancement.基于变分信息瓶颈和特征增强的中国农作物病虫害命名实体识别
Sci Rep. 2025 Aug 27;15(1):31573. doi: 10.1038/s41598-025-04252-5.
2
CMedRAGBot: A Chinese Medical Chatbot Based on Graph RAG and Large Language Models.CMedRAGBot:一种基于图结构检索增强生成(Graph RAG)和大语言模型的中文医学聊天机器人。
Interdiscip Sci. 2025 Jun 5. doi: 10.1007/s12539-025-00715-5.
3
MBLSTM is a contextual interaction refined method for time series prediction.

本文引用的文献

1
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
2
Advances in the Application of Traditional Chinese Medicine Using Artificial Intelligence: A Review.人工智能在中医药应用中的进展:综述。
Am J Chin Med. 2023;51(5):1067-1083. doi: 10.1142/S0192415X23500490. Epub 2023 Jul 7.
3
EduNER: a Chinese named entity recognition dataset for education research.
门控循环单元长短期记忆网络(MBLSTM)是一种用于时间序列预测的上下文交互优化方法。
Sci Rep. 2025 May 27;15(1):18563. doi: 10.1038/s41598-025-03243-w.
4
Artificial intelligence in traditional Chinese medicine: advances in multi-metabolite multi-target interaction modeling.人工智能在中医领域的应用:多代谢物多靶点相互作用建模的进展
Front Pharmacol. 2025 Apr 15;16:1541509. doi: 10.3389/fphar.2025.1541509. eCollection 2025.
EduNER:一个用于教育研究的中文命名实体识别数据集。
Neural Comput Appl. 2023 May 20:1-15. doi: 10.1007/s00521-023-08635-5.
4
Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities.医疗保健知识图谱构建:对最新技术水平、开放问题及机遇的系统综述。
J Big Data. 2023;10(1):81. doi: 10.1186/s40537-023-00774-9. Epub 2023 May 28.
5
TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.TCMNER 和 PubMed:一种基于汉字的新型模型和中医命名实体识别数据集。
J Healthc Eng. 2021 Aug 7;2021:3544281. doi: 10.1155/2021/3544281. eCollection 2021.
6
Incorporating multi-level CNN and attention mechanism for Chinese clinical named entity recognition.基于多层 CNN 和注意力机制的中文临床命名实体识别。
J Biomed Inform. 2021 Apr;116:103737. doi: 10.1016/j.jbi.2021.103737. Epub 2021 Mar 15.
7
Attention in Natural Language Processing.自然语言处理中的注意力机制。
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4291-4308. doi: 10.1109/TNNLS.2020.3019893. Epub 2021 Oct 5.
8
Combating COVID-19 with integrated traditional Chinese and Western medicine in China.中国中西医结合抗击新冠肺炎疫情。
Acta Pharm Sin B. 2020 Jul;10(7):1149-1162. doi: 10.1016/j.apsb.2020.06.009. Epub 2020 Jun 27.
9
Quantitative knowledge presentation models of traditional Chinese medicine (TCM): A review.中医(TCM)的定量知识表示模型:综述。
Artif Intell Med. 2020 Mar;103:101810. doi: 10.1016/j.artmed.2020.101810. Epub 2020 Jan 24.
10
An ontological framework for the formalization, organization and usage of TCM-Knowledge.用于中医知识形式化、组织和使用的本体论框架。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):53. doi: 10.1186/s12911-019-0760-9.