• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合结构特征从中国文献中获得手术程序的长期认知。

Surgical procedure long terms recognition from Chinese literature incorporating structural feature.

作者信息

Jiale Nan, Gao Dongping, Sun Yuanyuan, Li Xiaoying, Shen Xifeng, Li Meiting, Zhang Weining, Ren Huiling, Qin Yi

机构信息

Institute of Medical Information, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, 100020, China.

出版信息

Heliyon. 2022 Oct 29;8(11):e11291. doi: 10.1016/j.heliyon.2022.e11291. eCollection 2022 Nov.

DOI:10.1016/j.heliyon.2022.e11291
PMID:36387477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9640963/
Abstract

With rapid development of technologies in medical diagnosis and treatment, the novel and complicated concepts and usages of clinical terms especially of surgical procedures have become common in daily routine. Expected to be performed in an operating room and accompanied by an incision based on expert discretion, surgical procedures imply clinical understanding of diagnosis, examination, testing, equipment, drugs and symptoms, etc., but terms expressing surgical procedures are difficult to recognize since the terms are highly distinctive due to long morphological length and complex linguistics phenomena. To achieve higher recognition performance and overcome the challenge of the absence of natural delimiters in Chinese sentences, we propose a Named Entity Recognition (NER) model named Structural-SoftLexicon-Bi-LSTM-CRF (SSBC) empowered by pre-trained model BERT. In particular, we pre-trained a lexicon embedding over large-scale medical corpus to better leverage domain-specific structural knowledge. With input additionally augmented by BERT, rich multigranular information and structural term information is transferred from Structural-SoftLexicon to downstream model Bi-LSTM-CRF. Therefore, we could get a global optimal prediction of input sequence. We evaluate our model on a self-built corpus and results show that SSBC with pre-trained model outperforms other state-of-the-art benchmarks, surpassing at most 3.77% in F1 score. This study hopefully would benefit Diagnostic Related Groups (DRGs) and Diagnosis Intervention Package (DIP) grouping system, medical records statistics and analysis, Medicare payment system, etc.

摘要

随着医学诊断和治疗技术的快速发展,临床术语尤其是外科手术的新颖复杂概念和用法在日常工作中已变得很常见。外科手术预期在手术室进行,并根据专家判断进行切口操作,这意味着要对诊断、检查、测试、设备、药物和症状等有临床理解,但表示外科手术的术语却难以识别,因为这些术语由于形态长度长和语言现象复杂而具有高度独特性。为了实现更高的识别性能并克服中文句子中缺乏自然分隔符的挑战,我们提出了一种名为Structural-SoftLexicon-Bi-LSTM-CRF(SSBC)的命名实体识别(NER)模型,该模型由预训练模型BERT赋能。具体而言,我们在大规模医学语料库上预训练了词嵌入,以更好地利用特定领域的结构知识。通过BERT对输入进行额外增强,丰富的多粒度信息和结构术语信息从Structural-SoftLexicon转移到下游模型Bi-LSTM-CRF。因此,我们可以得到输入序列的全局最优预测。我们在自建语料库上评估了我们的模型,结果表明,带有预训练模型的SSBC优于其他最先进的基准模型,F1分数最多提高了3.77%。本研究有望使诊断相关组(DRGs)和诊断干预包(DIP)分组系统、病历统计与分析、医疗保险支付系统等受益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/e6f46374b1e7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/021df261df27/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/c061b66feae3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/676dccfdb5eb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/e6f46374b1e7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/021df261df27/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/c061b66feae3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/676dccfdb5eb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d5/9640963/e6f46374b1e7/gr4.jpg

相似文献

1
Surgical procedure long terms recognition from Chinese literature incorporating structural feature.结合结构特征从中国文献中获得手术程序的长期认知。
Heliyon. 2022 Oct 29;8(11):e11291. doi: 10.1016/j.heliyon.2022.e11291. eCollection 2022 Nov.
2
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型
JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
5
Korean clinical entity recognition from diagnosis text using BERT.基于 BERT 的韩语文本临床实体识别。
BMC Med Inform Decis Mak. 2020 Sep 30;20(Suppl 7):242. doi: 10.1186/s12911-020-01241-8.
6
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
7
Adversarial active learning for the identification of medical concepts and annotation inconsistency.对抗式主动学习在医学概念识别和标注不一致性中的应用。
J Biomed Inform. 2020 Aug;108:103481. doi: 10.1016/j.jbi.2020.103481. Epub 2020 Jul 18.
8
Chinese clinical named entity recognition with variant neural structures based on BERT methods.基于 BERT 方法的中文临床命名实体识别与变体神经结构。
J Biomed Inform. 2020 Jul;107:103422. doi: 10.1016/j.jbi.2020.103422. Epub 2020 Apr 28.
9
A multi-layer soft lattice based model for Chinese clinical named entity recognition.基于多层软晶格的中文临床命名实体识别模型。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):201. doi: 10.1186/s12911-022-01924-4.
10
DeIDNER Model: A Neural Network Named Entity Recognition Model for Use in the De-identification of Clinical Notes.DeIDNER模型:一种用于临床记录去识别化的神经网络命名实体识别模型。
Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap. 2022 Feb;5:640-647. doi: 10.5220/0010884500003123.

引用本文的文献

1
Named Entity Recognition in Electronic Health Records: A Methodological Review.电子健康记录中的命名实体识别:方法学综述
Healthc Inform Res. 2023 Oct;29(4):286-300. doi: 10.4258/hir.2023.29.4.286. Epub 2023 Oct 31.

本文引用的文献

1
Automatic data extraction to support meta-analysis statistical analysis: a case study on breast cancer.自动数据提取以支持荟萃分析统计分析:以乳腺癌为例。
BMC Med Inform Decis Mak. 2022 Jun 18;22(1):158. doi: 10.1186/s12911-022-01897-4.
2
How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?你的生物医学命名实体识别模型如何推广到新实体?
IEEE Access. 2022 Mar 8;10:31513-31523. doi: 10.1109/ACCESS.2022.3157854. eCollection 2022.
3
TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.
TCMNER 和 PubMed:一种基于汉字的新型模型和中医命名实体识别数据集。
J Healthc Eng. 2021 Aug 7;2021:3544281. doi: 10.1155/2021/3544281. eCollection 2021.
4
Towards Chinese clinical named entity recognition by dynamic embedding using domain-specific knowledge.通过使用领域特定知识的动态嵌入实现中文临床命名实体识别
J Biomed Inform. 2020 Jun;106:103435. doi: 10.1016/j.jbi.2020.103435. Epub 2020 Apr 29.
5
Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition.基于 CNN 和 LSTM 的组合特征嵌入的生物医学命名实体识别。
J Biomed Inform. 2020 Mar;103:103381. doi: 10.1016/j.jbi.2020.103381. Epub 2020 Jan 28.
6
Terminologies augmented recurrent neural network model for clinical named entity recognition.基于扩充术语的循环神经网络模型在临床命名实体识别中的应用。
J Biomed Inform. 2020 Feb;102:103356. doi: 10.1016/j.jbi.2019.103356. Epub 2019 Dec 16.
7
Adversarial training based lattice LSTM for Chinese clinical named entity recognition.基于对抗训练的格 lattice LSTM 进行中文临床命名实体识别。
J Biomed Inform. 2019 Nov;99:103290. doi: 10.1016/j.jbi.2019.103290. Epub 2019 Sep 23.
8
DrugMetab: An Integrated Machine Learning and Lexicon Mapping Named Entity Recognition Method for Drug Metabolite.药物代谢:一种集成机器学习和词典映射的药物代谢物命名实体识别方法。
CPT Pharmacometrics Syst Pharmacol. 2018 Nov;7(11):709-717. doi: 10.1002/psp4.12340. Epub 2018 Sep 29.
9
GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text.GRAM-CNN:一种基于局部上下文的深度学习方法,用于生物医学文本中的命名实体识别。
Bioinformatics. 2018 May 1;34(9):1547-1554. doi: 10.1093/bioinformatics/btx815.
10
A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.中文临床文本中医疗实体识别的新方法。
J Healthc Eng. 2017;2017:4898963. doi: 10.1155/2017/4898963. Epub 2017 Jul 5.