• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积和 BiGRU 双通道机制融合的中文医学文本分类模型。

A model of integrating convolution and BiGRU dual-channel mechanism for Chinese medical text classifications.

机构信息

School of Software, Henan University, Kaifeng, China.

School of Digital Arts and Communication, Shandong University of Art & Design, Jinan, China.

出版信息

PLoS One. 2023 Mar 16;18(3):e0282824. doi: 10.1371/journal.pone.0282824. eCollection 2023.

DOI:10.1371/journal.pone.0282824
PMID:36928266
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10019650/
Abstract

Recently, a lot of Chinese patients consult treatment plans through social networking platforms, but the Chinese medical text contains rich information, including a large number of medical nomenclatures and symptom descriptions. How to build an intelligence model to automatically classify the text information consulted by patients and recommend the correct department for patients is very important. In order to address the problem of insufficient feature extraction from Chinese medical text and low accuracy, this paper proposes a dual channel Chinese medical text classification model. The model extracts feature of Chinese medical text at different granularity, comprehensively and accurately obtains effective feature information, and finally recommends departments for patients according to text classification. One channel of the model focuses on medical nomenclatures, symptoms and other words related to hospital departments, gives different weights, calculates corresponding feature vectors with convolution kernels of different sizes, and then obtains local text representation. The other channel uses the BiGRU network and attention mechanism to obtain text representation, highlighting the important information of the whole sentence, that is, global text representation. Finally, the model uses full connection layer to combine the representation vectors of the two channels, and uses Softmax classifier for classification. The experimental results show that the accuracy, recall and F1-score of the model are improved by 10.65%, 8.94% and 11.62% respectively compared with the baseline models in average, which proves that our model has better performance and robustness.

摘要

最近,很多中国患者通过社交网络平台咨询治疗方案,但中文医疗文本包含丰富的信息,包括大量的医学术语和症状描述。如何构建一个智能模型,自动对患者咨询的文本信息进行分类,并为患者推荐正确的科室,这非常重要。为了解决中文医疗文本特征提取不足、准确率低的问题,本文提出了一种双通道中文医疗文本分类模型。该模型在不同粒度上提取中文医疗文本的特征,全面准确地获取有效的特征信息,最后根据文本分类为患者推荐科室。模型的一个通道侧重于与医院科室相关的医学术语、症状等词汇,给予不同的权重,用不同大小的卷积核计算相应的特征向量,从而得到局部文本表示。另一个通道使用 BiGRU 网络和注意力机制获取文本表示,突出整句的重要信息,即全局文本表示。最后,模型使用全连接层将两个通道的表示向量进行组合,并使用 Softmax 分类器进行分类。实验结果表明,与基线模型相比,该模型在平均准确率、召回率和 F1 分数上分别提高了 10.65%、8.94%和 11.62%,证明了我们的模型具有更好的性能和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/47bbaf4461b6/pone.0282824.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/ecd87f4ee9e3/pone.0282824.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/ca039ca1fc1b/pone.0282824.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/504000647706/pone.0282824.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/cd4311f3f59d/pone.0282824.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/5302b3431666/pone.0282824.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/3c845b7e8a75/pone.0282824.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/2626f4e07039/pone.0282824.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/0c2f646d4ace/pone.0282824.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/55a48e6a0ac7/pone.0282824.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/aef59f84602d/pone.0282824.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/5e92b3c67d11/pone.0282824.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/47bbaf4461b6/pone.0282824.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/ecd87f4ee9e3/pone.0282824.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/ca039ca1fc1b/pone.0282824.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/504000647706/pone.0282824.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/cd4311f3f59d/pone.0282824.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/5302b3431666/pone.0282824.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/3c845b7e8a75/pone.0282824.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/2626f4e07039/pone.0282824.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/0c2f646d4ace/pone.0282824.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/55a48e6a0ac7/pone.0282824.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/aef59f84602d/pone.0282824.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/5e92b3c67d11/pone.0282824.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d23/10019650/47bbaf4461b6/pone.0282824.g012.jpg

相似文献

1
A model of integrating convolution and BiGRU dual-channel mechanism for Chinese medical text classifications.基于卷积和 BiGRU 双通道机制融合的中文医学文本分类模型。
PLoS One. 2023 Mar 16;18(3):e0282824. doi: 10.1371/journal.pone.0282824. eCollection 2023.
2
DCCL: Dual-channel hybrid neural network combined with self-attention for text classification.DCCL:双通道混合神经网络与自注意力相结合的文本分类方法。
Math Biosci Eng. 2023 Jan;20(2):1981-1992. doi: 10.3934/mbe.2023091. Epub 2022 Nov 9.
3
Named Entity Recognition of Medical Text Based on the Deep Neural Network.基于深度神经网络的医学文本命名实体识别
J Healthc Eng. 2022 Mar 7;2022:3990563. doi: 10.1155/2022/3990563. eCollection 2022.
4
Chinese text classification method based on sentence information enhancement and feature fusion.基于句子信息增强与特征融合的中文文本分类方法
Heliyon. 2024 Aug 24;10(17):e36861. doi: 10.1016/j.heliyon.2024.e36861. eCollection 2024 Sep 15.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Integrating machine learning with linguistic features: A universal method for extraction and normalization of temporal expressions in Chinese texts.将机器学习与语言特征相结合:一种用于中文文本中时间表达式提取与规范化的通用方法。
Comput Methods Programs Biomed. 2023 May;233:107474. doi: 10.1016/j.cmpb.2023.107474. Epub 2023 Mar 11.
7
Author identification of literary works based on text analysis and deep learning.基于文本分析和深度学习的文学作品作者身份识别。
Heliyon. 2024 Jan 29;10(3):e25464. doi: 10.1016/j.heliyon.2024.e25464. eCollection 2024 Feb 15.
8
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
9
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT.基于混合神经网络和医学 MC-BERT 的中文电子病历命名实体识别。
BMC Med Inform Decis Mak. 2022 Dec 1;22(1):315. doi: 10.1186/s12911-022-02059-2.
10
Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification.使用挤压胶囊进行短文本分类的多级语义提取研究
Entropy (Basel). 2022 Apr 23;24(5):590. doi: 10.3390/e24050590.

引用本文的文献

1
Assemble the shallow or integrate a deep? Toward a lightweight solution for glyph-aware Chinese text classification.组合浅层还是集成深层?迈向有向汉字分类的轻量级解决方案。
PLoS One. 2023 Jul 28;18(7):e0289204. doi: 10.1371/journal.pone.0289204. eCollection 2023.

本文引用的文献

1
Using Recurrent Neural Networks for Predicting Type-2 Diabetes from Genomic and Tabular Data.利用循环神经网络从基因组和表格数据预测2型糖尿病
Diagnostics (Basel). 2022 Dec 6;12(12):3067. doi: 10.3390/diagnostics12123067.
2
A Complete Process of Text Classification System Using State-of-the-Art NLP Models.使用最先进的自然语言处理模型的文本分类系统的完整流程。
Comput Intell Neurosci. 2022 Jun 9;2022:1883698. doi: 10.1155/2022/1883698. eCollection 2022.
3
Fine-Tuned DenseNet-169 for Breast Cancer Metastasis Prediction Using FastAI and 1-Cycle Policy.
使用 FastAI 和 1-Cycle 策略对密集型 DenseNet-169 进行微调,以进行乳腺癌转移预测。
Sensors (Basel). 2022 Apr 13;22(8):2988. doi: 10.3390/s22082988.
4
Classification of Skin Disease Using Deep Learning Neural Networks with MobileNet V2 and LSTM.基于 MobileNet V2 和 LSTM 的深度学习神经网络在皮肤病分类中的应用。
Sensors (Basel). 2021 Apr 18;21(8):2852. doi: 10.3390/s21082852.
5
An Improved Double Channel Long Short-Term Memory Model for Medical Text Classification.用于医学文本分类的改进双通道长短时记忆模型。
J Healthc Eng. 2021 Feb 23;2021:6664893. doi: 10.1155/2021/6664893. eCollection 2021.
6
Leveraging word embeddings and medical entity extraction for biomedical dataset retrieval using unstructured texts.利用词嵌入和医学实体提取,通过非结构化文本检索生物医学数据集。
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax091.
7
A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing.基于 BLE 传感器和实时数据处理的糖尿病患者个性化医疗保健监测系统。
Sensors (Basel). 2018 Jul 6;18(7):2183. doi: 10.3390/s18072183.
8
A Natural Language Processing System That Links Medical Terms in Electronic Health Record Notes to Lay Definitions: System Development Using Physician Reviews.一种将电子健康记录笔记中的医学术语与通俗定义相链接的自然语言处理系统:利用医生评审进行系统开发。
J Med Internet Res. 2018 Jan 22;20(1):e26. doi: 10.2196/jmir.8669.
9
Phenotypic Analysis of Clinical Narratives Using Human Phenotype Ontology.使用人类表型本体对临床叙述进行表型分析。
Stud Health Technol Inform. 2017;245:581-585.
10
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.