• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于精确实体识别的扩散增强条件随机场和双向长短期记忆网络框架。

A diffusion enhanced CRF and BiLSTM framework for accurate entity recognition.

作者信息

Qiu Yunfei, Dong Libo, Zhang Wenwen, Xing Haoran, Huang Junwei

机构信息

Software Engineering, Liaoning Technical University, Huludao, 123105, Liaoning, China.

出版信息

Sci Rep. 2025 Jun 4;15(1):19670. doi: 10.1038/s41598-025-04036-x.

DOI:10.1038/s41598-025-04036-x
PMID:40467889
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12137753/
Abstract

In Named Entity Recognition tasks, the diffusion model effectively processes discrete data. However, the original model struggles with capturing long-distance dependencies and integrating contextual information, making it difficult to recognize related entities and handle complex syntactic structures. These issues result in ambiguity and uncertainty in entity boundary recognition, affecting overall accuracy and stability. To solve this, we suggest a diffusion model with Conditional Random Fields and Bidirectional Long Short-Term Memory layers. Firstly, the BiLSTM-CRF model captures long-distance dependencies and contextual information, enhancing entity boundary recognition accuracy. Secondly, the Tversky and CRF loss functions select optimal label predictions from the probability distribution, integrating these through weighted summation to enhance sequence dependency processing and label accuracy. Thirdly, we introduce self-attention and graph attention mechanisms to handle complex data structures by processing attention probabilities, integrating with the adjacency matrix, and improving the recognition of key entity relationships. Finally, an automatic noise adjustment mechanism modifies noise levels based on performance, enhancing stability and robustness in inconsistent environments. Experiments demonstrate that this approach improves performance on several NER datasets, with significant gains in recall, accuracy, and F1 scores, making the model more robust in handling noisy and complex environments.

摘要

在命名实体识别任务中,扩散模型有效地处理离散数据。然而,原始模型在捕捉长距离依赖关系和整合上下文信息方面存在困难,导致难以识别相关实体并处理复杂的句法结构。这些问题导致实体边界识别中的模糊性和不确定性,影响整体准确性和稳定性。为了解决这个问题,我们提出了一种带有条件随机场和双向长短期记忆层的扩散模型。首先,双向长短期记忆 - 条件随机场(BiLSTM - CRF)模型捕捉长距离依赖关系和上下文信息,提高实体边界识别的准确性。其次,特沃斯基(Tversky)和条件随机场损失函数从概率分布中选择最优标签预测,通过加权求和将这些预测整合起来,以增强序列依赖关系处理和标签准确性。第三,我们引入自注意力和图注意力机制,通过处理注意力概率、与邻接矩阵整合以及改善关键实体关系的识别来处理复杂的数据结构。最后,一种自动噪声调整机制根据性能修改噪声水平,在不一致的环境中增强稳定性和鲁棒性。实验表明,这种方法在几个命名实体识别数据集上提高了性能,在召回率、准确率和F1分数方面有显著提升,使模型在处理噪声和复杂环境时更具鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/df8481d1143d/41598_2025_4036_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/b1bf9b9768c3/41598_2025_4036_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/91ed0a20e1db/41598_2025_4036_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/cfba0d8d2686/41598_2025_4036_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/99e81575045d/41598_2025_4036_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c4a0781cab94/41598_2025_4036_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/a474621e28ca/41598_2025_4036_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/1b07b40fbce4/41598_2025_4036_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/73f372418791/41598_2025_4036_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c117d7710d5b/41598_2025_4036_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c8333774eb59/41598_2025_4036_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/dae7203acc41/41598_2025_4036_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/f6aeef947660/41598_2025_4036_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/8cd1f31fce57/41598_2025_4036_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/6093daed9f37/41598_2025_4036_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/15359364f9bc/41598_2025_4036_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/571c8b2fe4e8/41598_2025_4036_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/fccbd8411916/41598_2025_4036_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/df8481d1143d/41598_2025_4036_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/b1bf9b9768c3/41598_2025_4036_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/91ed0a20e1db/41598_2025_4036_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/cfba0d8d2686/41598_2025_4036_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/99e81575045d/41598_2025_4036_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c4a0781cab94/41598_2025_4036_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/a474621e28ca/41598_2025_4036_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/1b07b40fbce4/41598_2025_4036_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/73f372418791/41598_2025_4036_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c117d7710d5b/41598_2025_4036_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/c8333774eb59/41598_2025_4036_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/dae7203acc41/41598_2025_4036_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/f6aeef947660/41598_2025_4036_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/8cd1f31fce57/41598_2025_4036_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/6093daed9f37/41598_2025_4036_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/15359364f9bc/41598_2025_4036_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/571c8b2fe4e8/41598_2025_4036_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/fccbd8411916/41598_2025_4036_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ec/12137753/df8481d1143d/41598_2025_4036_Fig16_HTML.jpg

相似文献

1
A diffusion enhanced CRF and BiLSTM framework for accurate entity recognition.一种用于精确实体识别的扩散增强条件随机场和双向长短期记忆网络框架。
Sci Rep. 2025 Jun 4;15(1):19670. doi: 10.1038/s41598-025-04036-x.
2
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
3
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.基于词干级特征和自注意力机制的中文临床命名实体识别。
J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18.
4
Enhancing traditional Chinese medical named entity recognition with Dyn-Att Net: a dynamic attention approach.利用动态注意力网络增强中医命名实体识别:一种动态注意力方法
PeerJ Comput Sci. 2024 May 31;10:e2022. doi: 10.7717/peerj-cs.2022. eCollection 2024.
5
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
6
Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF.基于多头自注意力机制结合 BiLSTM-CRF 的中文医疗文本实体识别
Math Biosci Eng. 2022 Jan 4;19(3):2206-2218. doi: 10.3934/mbe.2022103.
7
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
8
An imConvNet-based deep learning model for Chinese medical named entity recognition.基于 imConvNet 的深度学习模型在中文医疗命名实体识别中的应用。
BMC Med Inform Decis Mak. 2022 Nov 21;22(1):303. doi: 10.1186/s12911-022-02049-4.
9
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.
10
Relationship extraction between entities with long distance dependencies and noise based on semantic and syntactic features.基于语义和句法特征的具有长距离依存关系和噪声的实体之间的关系提取
Sci Rep. 2025 May 6;15(1):15750. doi: 10.1038/s41598-025-00915-5.

本文引用的文献

1
TaughtNet: Learning Multi-Task Biomedical Named Entity Recognition From Single-Task Teachers.TaughtNet:从单任务教师那里学习多任务生物医学命名实体识别。
IEEE J Biomed Health Inform. 2023 May;27(5):2512-2523. doi: 10.1109/JBHI.2023.3244044. Epub 2023 May 4.
2
Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF.基于多头自注意力机制的 BiLSTM-CRF 的中文临床命名实体识别。
Artif Intell Med. 2022 May;127:102282. doi: 10.1016/j.artmed.2022.102282. Epub 2022 Mar 18.
3
Candidate region aware nested named entity recognition.
候选区域感知嵌套命名实体识别。
Neural Netw. 2021 Oct;142:340-350. doi: 10.1016/j.neunet.2021.02.019. Epub 2021 Mar 16.
4
Biomedical named entity recognition using BERT in the machine reading comprehension framework.基于机器阅读理解框架的 BERT 在生物医学命名实体识别中的应用。
J Biomed Inform. 2021 Jun;118:103799. doi: 10.1016/j.jbi.2021.103799. Epub 2021 May 6.
5
Improving biomedical named entity recognition with syntactic information.利用句法信息提高生物医学命名实体识别。
BMC Bioinformatics. 2020 Nov 25;21(1):539. doi: 10.1186/s12859-020-03834-6.
6
Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition.将词典纳入深度神经网络以进行中文临床命名实体识别。
J Biomed Inform. 2019 Apr;92:103133. doi: 10.1016/j.jbi.2019.103133. Epub 2019 Feb 25.
7
An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition.基于注意力机制的 BiLSTM-CRF 方法在文档级化学命名实体识别中的应用。
Bioinformatics. 2018 Apr 15;34(8):1381-1388. doi: 10.1093/bioinformatics/btx761.
8
NCBI disease corpus: a resource for disease name recognition and concept normalization.NCBI疾病语料库:一种用于疾病名称识别和概念规范化的资源。
J Biomed Inform. 2014 Feb;47:1-10. doi: 10.1016/j.jbi.2013.12.006. Epub 2014 Jan 3.