• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于特征词的中医临床术语抽取的半监督方法。

A semi-supervised approach for extracting TCM clinical terms based on feature words.

机构信息

School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai, 201620, China.

China National Institute of Standardization, Beijing, China.

出版信息

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):118. doi: 10.1186/s12911-020-1108-1.

DOI:10.1186/s12911-020-1108-1
PMID:32646408
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7477860/
Abstract

BACKGROUND

A semi-supervised model is proposed for extracting clinical terms of Traditional Chinese Medicine using feature words.

METHODS

The extraction model is based on BiLSTM-CRF and combined with semi-supervised learning and feature word set, which reduces the cost of manual annotation and leverage extraction results.

RESULTS

Experiment results show that the proposed model improves the extraction of five types of TCM clinical terms, including traditional Chinese medicine, symptoms, patterns, diseases and formulas. The best F1-value of the experiment reaches 78.70% on the test dataset.

CONCLUSIONS

This method can reduce the cost of manual labeling and improve the result in the NER research of TCM clinical terms.

摘要

背景

提出了一种基于特征词的半监督模型,用于提取中医药临床术语。

方法

该提取模型基于 BiLSTM-CRF,并结合了半监督学习和特征词集,降低了人工标注的成本,并利用了提取结果。

结果

实验结果表明,所提出的模型提高了五种类型的中医药临床术语的提取效果,包括中药、症状、证候、疾病和方剂。实验的最佳 F1 值在测试数据集上达到 78.70%。

结论

该方法可以降低人工标注的成本,并提高中医药临床术语的命名实体识别研究中的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/45a930b1522f/12911_2020_1108_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/7f3a6af8cb15/12911_2020_1108_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/d9316f8e2a11/12911_2020_1108_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/45a930b1522f/12911_2020_1108_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/7f3a6af8cb15/12911_2020_1108_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/d9316f8e2a11/12911_2020_1108_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fd2/7646366/45a930b1522f/12911_2020_1108_Fig3_HTML.jpg

相似文献

1
A semi-supervised approach for extracting TCM clinical terms based on feature words.基于特征词的中医临床术语抽取的半监督方法。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):118. doi: 10.1186/s12911-020-1108-1.
2
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
3
Extraction of Traditional Chinese Medicine Entity: Design of a Novel Span-Level Named Entity Recognition Method With Distant Supervision.中药实体提取:一种基于远程监督的新型跨度级命名实体识别方法的设计
JMIR Med Inform. 2021 Jun 14;9(6):e28219. doi: 10.2196/28219.
4
Extracting relations from traditional Chinese medicine literature via heterogeneous entity networks.通过异构实体网络从中医文献中提取关系
J Am Med Inform Assoc. 2016 Mar;23(2):356-65. doi: 10.1093/jamia/ocv092. Epub 2015 Jul 29.
5
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.
6
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
7
Diagnosis knowledge constrained network based on first-order logic for syndrome differentiation.基于一阶逻辑的辨证诊断知识约束网络
Artif Intell Med. 2024 Jan;147:102739. doi: 10.1016/j.artmed.2023.102739. Epub 2023 Nov 30.
8
Deep semi-supervised learning ensemble framework for classifying co-mentions of human proteins and phenotypes.深度半监督学习集成框架,用于分类人类蛋白质和表型的共提及。
BMC Bioinformatics. 2021 Oct 16;22(1):500. doi: 10.1186/s12859-021-04421-z.
9
Automated information extraction model enhancing traditional Chinese medicine RCT evidence extraction (Evi-BERT): algorithm development and validation.增强中医药随机对照试验证据提取的自动化信息提取模型(Evi-BERT):算法开发与验证
Front Artif Intell. 2024 Aug 15;7:1454945. doi: 10.3389/frai.2024.1454945. eCollection 2024.
10
A Multigranularity Text Driven Named Entity Recognition CGAN Model for Traditional Chinese Medicine Literatures.一种基于多粒度文本驱动的中医药文献命名实体识别 CGAN 模型。
Comput Intell Neurosci. 2022 Sep 24;2022:1495841. doi: 10.1155/2022/1495841. eCollection 2022.

引用本文的文献

1
Digital intelligence technology: new quality productivity for precision traditional Chinese medicine.数字智能技术:精准中医的新型质量生产力。
Front Pharmacol. 2025 Apr 8;16:1526187. doi: 10.3389/fphar.2025.1526187. eCollection 2025.
2
Semantic-enhanced graph neural network for named entity recognition in ancient Chinese books.基于语义增强图神经网络的古籍命名实体识别
Sci Rep. 2024 Jul 30;14(1):17488. doi: 10.1038/s41598-024-68561-x.
3
Natural Language Processing Applications for Computer-Aided Diagnosis in Oncology.

本文引用的文献

1
Deep learning with word embeddings improves biomedical named entity recognition.使用词嵌入的深度学习可改善生物医学命名实体识别。
Bioinformatics. 2017 Jul 15;33(14):i37-i48. doi: 10.1093/bioinformatics/btx228.
2
Knowledge graph for TCM health preservation: Design, construction, and applications.中医养生知识图谱:设计、构建与应用
Artif Intell Med. 2017 Mar;77:48-52. doi: 10.1016/j.artmed.2017.04.001. Epub 2017 Apr 21.
3
Character-level neural network for biomedical named entity recognition.用于生物医学命名实体识别的字符级神经网络。
用于肿瘤学计算机辅助诊断的自然语言处理应用
Diagnostics (Basel). 2023 Jan 12;13(2):286. doi: 10.3390/diagnostics13020286.
4
Identifying infected patients using semi-supervised and transfer learning.使用半监督和迁移学习识别感染患者。
J Am Med Inform Assoc. 2022 Sep 12;29(10):1696-1704. doi: 10.1093/jamia/ocac109.
5
Information Extraction from the Text Data on Traditional Chinese Medicine: A Review on Tasks, Challenges, and Methods from 2010 to 2021.从中医文本数据中提取信息:2010年至2021年任务、挑战及方法综述
Evid Based Complement Alternat Med. 2022 May 13;2022:1679589. doi: 10.1155/2022/1679589. eCollection 2022.
6
TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.TCMNER 和 PubMed:一种基于汉字的新型模型和中医命名实体识别数据集。
J Healthc Eng. 2021 Aug 7;2021:3544281. doi: 10.1155/2021/3544281. eCollection 2021.
J Biomed Inform. 2017 Jun;70:85-91. doi: 10.1016/j.jbi.2017.05.002. Epub 2017 May 11.
4
Current Status of Standardization of Traditional Chinese Medicine in China.中国中医药标准化的现状
Evid Based Complement Alternat Med. 2016;2016:9123103. doi: 10.1155/2016/9123103. Epub 2016 Mar 27.
5
A study of active learning methods for named entity recognition in clinical text.临床文本中命名实体识别的主动学习方法研究
J Biomed Inform. 2015 Dec;58:11-18. doi: 10.1016/j.jbi.2015.09.010. Epub 2015 Sep 15.
6
Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.基于深度神经网络的中文临床文本命名实体识别
Stud Health Technol Inform. 2015;216:624-8.
7
A comprehensive study of named entity recognition in Chinese clinical text.中文临床文本命名实体识别的综合研究。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):808-14. doi: 10.1136/amiajnl-2013-002381. Epub 2013 Dec 17.
8
Data processing and analysis in real-world traditional Chinese medicine clinical data: challenges and approaches.真实世界中医药临床数据的数据处理与分析:挑战与方法。
Stat Med. 2012 Mar 30;31(7):653-60. doi: 10.1002/sim.4417. Epub 2011 Dec 9.
9
Traditional Chinese medicine.传统中医
Med Clin North Am. 2002 Jan;86(1):63-73. doi: 10.1016/s0025-7125(03)00072-5.