• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生物医学命名实体识别的字符级神经网络。

Character-level neural network for biomedical named entity recognition.

作者信息

Gridach Mourad

机构信息

High Institute of Technology, Ibn Zohr University, Agadir, Morocco.

出版信息

J Biomed Inform. 2017 Jun;70:85-91. doi: 10.1016/j.jbi.2017.05.002. Epub 2017 May 11.

DOI:10.1016/j.jbi.2017.05.002
PMID:28502909
Abstract

Biomedical named entity recognition (BNER), which extracts important named entities such as genes and proteins, is a challenging task in automated systems that mine knowledge in biomedical texts. The previous state-of-the-art systems required large amounts of task-specific knowledge in the form of feature engineering, lexicons and data pre-processing to achieve high performance. In this paper, we introduce a novel neural network architecture that benefits from both word- and character-level representations automatically, by using a combination of bidirectional long short-term memory (LSTM) and conditional random field (CRF) eliminating the need for most feature engineering tasks. We evaluate our system on two datasets: JNLPBA corpus and the BioCreAtIvE II Gene Mention (GM) corpus. We obtained state-of-the-art performance by outperforming the previous systems. To the best of our knowledge, we are the first to investigate the combination of deep neural networks, CRF, word embeddings and character-level representation in recognizing biomedical named entities.

摘要

生物医学命名实体识别(BNER)是一项在生物医学文本中挖掘知识的自动化系统中的具有挑战性的任务,它用于提取诸如基因和蛋白质等重要命名实体。以前的最先进系统需要大量以特征工程、词典和数据预处理形式存在的特定任务知识才能实现高性能。在本文中,我们介绍了一种新颖的神经网络架构,该架构通过结合双向长短期记忆(LSTM)和条件随机场(CRF)自动受益于单词级和字符级表示,从而无需进行大多数特征工程任务。我们在两个数据集上评估我们的系统:JNLPBA语料库和BioCreAtIvE II基因提及(GM)语料库。我们通过超越以前的系统获得了最先进的性能。据我们所知,我们是第一个在识别生物医学命名实体中研究深度神经网络、CRF、词嵌入和字符级表示相结合的方法的。

相似文献

1
Character-level neural network for biomedical named entity recognition.用于生物医学命名实体识别的字符级神经网络。
J Biomed Inform. 2017 Jun;70:85-91. doi: 10.1016/j.jbi.2017.05.002. Epub 2017 May 11.
2
Long short-term memory RNN for biomedical named entity recognition.用于生物医学命名实体识别的长短期记忆循环神经网络
BMC Bioinformatics. 2017 Oct 30;18(1):462. doi: 10.1186/s12859-017-1868-5.
3
Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature.基于字符和词的双向 LSTM 嵌入 - 用于从文献中识别生物医学命名实体的动态递归神经网络。
J Biomed Inform. 2020 Dec;112:103609. doi: 10.1016/j.jbi.2020.103609. Epub 2020 Oct 26.
4
Entity recognition from clinical texts via recurrent neural network.基于循环神经网络的临床文本实体识别。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67. doi: 10.1186/s12911-017-0468-7.
5
Evaluating word representation features in biomedical named entity recognition tasks.评估生物医学命名实体识别任务中的词表示特征。
Biomed Res Int. 2014;2014:240403. doi: 10.1155/2014/240403. Epub 2014 Mar 6.
6
Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.用于健康领域命名实体识别的具有专用词嵌入的递归神经网络。
J Biomed Inform. 2017 Dec;76:102-109. doi: 10.1016/j.jbi.2017.11.007. Epub 2017 Nov 13.
7
Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别
AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.
8
Integrating Language Model and Reading Control Gate in BLSTM-CRF for Biomedical Named Entity Recognition.在 BLSTM-CRF 中集成语言模型和阅读控制门进行生物医学命名实体识别。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):841-846. doi: 10.1109/TCBB.2018.2868346. Epub 2018 Sep 3.
9
Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition.基于 CNN 和 LSTM 的组合特征嵌入的生物医学命名实体识别。
J Biomed Inform. 2020 Mar;103:103381. doi: 10.1016/j.jbi.2020.103381. Epub 2020 Jan 28.
10
Transfer learning for biomedical named entity recognition with neural networks.基于神经网络的生物医学命名实体识别的迁移学习。
Bioinformatics. 2018 Dec 1;34(23):4087-4094. doi: 10.1093/bioinformatics/bty449.

引用本文的文献

1
Biomedical named entity recognition using improved green anaconda-assisted Bi-GRU-based hierarchical ResNet model.使用改进的绿色蟒蛇辅助的基于双向门控循环单元的分层残差神经网络模型进行生物医学命名实体识别。
BMC Bioinformatics. 2025 Jan 30;26(1):34. doi: 10.1186/s12859-024-06008-w.
2
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.
3
Automated Information Extraction from Thyroid Operation Narrative: A Comparative Study of GPT-4 and Fine-tuned KoELECTRA.
从甲状腺手术记录中自动提取信息:GPT-4与微调后的KoELECTRA的比较研究
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:249-257. eCollection 2024.
4
Large language model based framework for automated extraction of genetic interactions from unstructured data.基于大型语言模型的框架,用于从非结构化数据中自动提取遗传相互作用。
PLoS One. 2024 May 21;19(5):e0303231. doi: 10.1371/journal.pone.0303231. eCollection 2024.
5
BioBBC: a multi-feature model that enhances the detection of biomedical entities.生物 BBC:一种增强生物医学实体检测的多特征模型。
Sci Rep. 2024 Apr 2;14(1):7697. doi: 10.1038/s41598-024-58334-x.
6
A comprehensive large scale biomedical knowledge graph for AI powered data driven biomedical research.一个用于人工智能驱动的数据驱动型生物医学研究的综合性大规模生物医学知识图谱。
bioRxiv. 2025 Mar 4:2023.10.13.562216. doi: 10.1101/2023.10.13.562216.
7
Protein-Protein Interaction Network Extraction Using Text Mining Methods Adds Insight into Autism Spectrum Disorder.使用文本挖掘方法提取蛋白质-蛋白质相互作用网络可深入了解自闭症谱系障碍。
Biology (Basel). 2023 Oct 18;12(10):1344. doi: 10.3390/biology12101344.
8
Early Predicting Tribocorrosion Rate of Dental Implant Titanium Materials Using Random Forest Machine Learning Models.使用随机森林机器学习模型早期预测牙科种植体钛材料的摩擦腐蚀速率
Tribol Int. 2023 Sep;187. doi: 10.1016/j.triboint.2023.108735. Epub 2023 Jun 26.
9
A Joint Extraction System Based on Conditional Layer Normalization for Health Monitoring.基于条件层归一化的健康监测联合提取系统。
Sensors (Basel). 2023 May 16;23(10):4812. doi: 10.3390/s23104812.
10
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.