• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 ERNIE-Doc-BiLSTM-CRF 的特定金融领域未知实体识别。

Recognition of Unknown Entities in Specific Financial Field Based on ERNIE-Doc-BiLSTM-CRF.

机构信息

College of Information and Computer, Taiyuan University of Technology, Taiyuan, Shanxi 030000, China.

出版信息

Comput Intell Neurosci. 2022 May 14;2022:3139898. doi: 10.1155/2022/3139898. eCollection 2022.

DOI:10.1155/2022/3139898
PMID:35607461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9124090/
Abstract

The Internet is rich in information related to the financial field. The financial entity information text containing new internet vocabulary has a certain impact on the results of existing recognition algorithms. How to solve the problems of new vocabulary and polysemy is a problem to be solved in the current field. This paper proposes an ERNIE-Doc-BiLSTM-CRF named entity recognition model based on the pretrained language model. Compared with the traditional model, the ERNIE-Doc pretrained language model constructs a unique word vector from the word vector and combines the location coding, which solves polysemy problem well. The intensive skimming mechanism realizes the long text processing well and captures the context information effectively. The experimental results show that the accuracy of this model is 86.72%, the recall rate is 83.39%, and the F1 value is 85.02%, which is 13.36% higher than other models; the recall rate is increased by 13.05%, and the F1 value is increased by 13.21%.

摘要

互联网上充斥着大量与金融领域相关的信息。含有新网络词汇的金融实体信息文本对现有识别算法的结果有一定影响。如何解决新词和一词多义问题是当前领域需要解决的问题。本文提出了一种基于预训练语言模型的 ERNIE-Doc-BiLSTM-CRF 命名实体识别模型。与传统模型相比,ERNIE-Doc 预训练语言模型从词向量构建独特的词向量,并结合位置编码,很好地解决了一词多义问题。密集式略读机制很好地实现了长文本处理,并有效地捕获了上下文信息。实验结果表明,该模型的准确率为 86.72%,召回率为 83.39%,F1 值为 85.02%,比其他模型高出 13.36%;召回率提高了 13.05%,F1 值提高了 13.21%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/97e66fb13c80/CIN2022-3139898.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/a3abd0bd2f42/CIN2022-3139898.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/2b24ad409f82/CIN2022-3139898.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/a88c8d72e963/CIN2022-3139898.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/97e66fb13c80/CIN2022-3139898.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/a3abd0bd2f42/CIN2022-3139898.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/2b24ad409f82/CIN2022-3139898.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/a88c8d72e963/CIN2022-3139898.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dbe/9124090/97e66fb13c80/CIN2022-3139898.004.jpg

相似文献

1
Recognition of Unknown Entities in Specific Financial Field Based on ERNIE-Doc-BiLSTM-CRF.基于 ERNIE-Doc-BiLSTM-CRF 的特定金融领域未知实体识别。
Comput Intell Neurosci. 2022 May 14;2022:3139898. doi: 10.1155/2022/3139898. eCollection 2022.
2
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
3
A hybrid approach for named entity recognition in Chinese electronic medical record.中文电子病历命名实体识别的混合方法。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):64. doi: 10.1186/s12911-019-0767-2.
4
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.
5
Deep learning-based methods for natural hazard named entity recognition.基于深度学习的自然灾害命名实体识别方法。
Sci Rep. 2022 Mar 17;12(1):4598. doi: 10.1038/s41598-022-08667-2.
6
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
7
Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF.基于多头自注意力机制结合 BiLSTM-CRF 的中文医疗文本实体识别
Math Biosci Eng. 2022 Jan 4;19(3):2206-2218. doi: 10.3934/mbe.2022103.
8
An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.
9
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.基于词干级特征和自注意力机制的中文临床命名实体识别。
J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18.
10
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT.基于混合神经网络和医学 MC-BERT 的中文电子病历命名实体识别。
BMC Med Inform Decis Mak. 2022 Dec 1;22(1):315. doi: 10.1186/s12911-022-02059-2.

引用本文的文献

1
Retracted: Recognition of Unknown Entities in Specific Financial Field Based on ERNIE-Doc-BiLSTM-CRF.撤回:基于ERNIE-Doc-BiLSTM-CRF的特定金融领域未知实体识别
Comput Intell Neurosci. 2023 Jul 12;2023:9761350. doi: 10.1155/2023/9761350. eCollection 2023.

本文引用的文献

1
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.