多任务联合学习模型在中医分词和证候分类中的应用

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine.

机构信息

School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China.

Institute of Biomedical Engineering, School of Life Science, Shanghai University, Shanghai 200444, China.

出版信息

Int J Environ Res Public Health. 2022 May 5;19(9):5601. doi: 10.3390/ijerph19095601.

DOI:10.3390/ijerph19095601

PMID:35564995

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9103751/

Abstract

Evidence-based treatment is the basis of traditional Chinese medicine (TCM), and the accurate differentiation of syndromes is important for treatment in this context. The automatic differentiation of syndromes of unstructured medical records requires two important steps: Chinese word segmentation and text classification. Due to the ambiguity of the Chinese language and the peculiarities of syndrome differentiation, these tasks pose a daunting challenge. We use text classification to model syndrome differentiation for TCM, and use multi-task learning (MTL) and deep learning to accomplish the two challenging tasks of Chinese word segmentation and syndrome differentiation. Two classic deep neural networks—bidirectional long short-term memory (Bi-LSTM) and text-based convolutional neural networks (TextCNN)—are fused into MTL to simultaneously carry out these two tasks. We used our proposed method to conduct a large number of comparative experiments. The experimental comparisons showed that it was superior to other methods on both tasks. Our model yielded values of accuracy, specificity, and sensitivity of 0.93, 0.94, and 0.90, and 0.80, 0.82, and 0.78 on the Chinese word segmentation task and the syndrome differentiation task, respectively. Moreover, statistical analyses showed that the accuracies of the non-joint and joint models were both within the 95% confidence interval, with pvalue < 0.05. The experimental comparison showed that our method is superior to prevalent methods on both tasks. The work here can help modernize TCM through intelligent differentiation.

摘要

循证治疗是中医（TCM）的基础，准确区分证候对于这种治疗方法非常重要。非结构化医疗记录的证候自动区分需要两个重要步骤：中文分词和文本分类。由于中文的模糊性和证候区分的特殊性，这些任务极具挑战性。我们使用文本分类对中医的证候区分进行建模，并使用多任务学习（MTL）和深度学习来完成中文分词和证候区分这两个具有挑战性的任务。我们将两个经典的深度神经网络——双向长短时记忆网络（Bi-LSTM）和基于文本的卷积神经网络（TextCNN）——融合到 MTL 中，同时进行这两个任务。我们使用提出的方法进行了大量的对比实验。实验比较表明，该方法在这两个任务上都优于其他方法。我们的模型在中文分词任务和证候区分任务上的准确率、特异性和敏感度分别为 0.93、0.94 和 0.90，以及 0.80、0.82 和 0.78。此外，统计分析表明，非联合和联合模型的准确率均在 95%置信区间内，p 值<0.05。实验比较表明，我们的方法在这两个任务上都优于流行的方法。这项工作可以通过智能区分帮助中医现代化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a47/9103751/ccf6d6c6fc9b/ijerph-19-05601-g001.jpg

相似文献

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine.多任务联合学习模型在中医分词和证候分类中的应用

Int J Environ Res Public Health. 2022 May 5;19(9):5601. doi: 10.3390/ijerph19095601.

Multi-Task Joint Learning Model for Segmenting and Classifying Tongue Images Using a Deep Neural Network.基于深度神经网络的用于舌图像分割与分类的多任务联合学习模型

IEEE J Biomed Health Inform. 2020 Sep;24(9):2481-2489. doi: 10.1109/JBHI.2020.2986376. Epub 2020 Apr 17.

End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation.用于肺癌诊断中模仿中医辨证的端到端模型：模型开发与验证

JMIR Med Inform. 2020 Jun 16;8(6):e17821. doi: 10.2196/17821.

End-to-End syndrome differentiation of Yin deficiency and Yang deficiency in traditional Chinese medicine.中医阴、阳虚证的端到端辨证。

Comput Methods Programs Biomed. 2019 Jun;174:9-15. doi: 10.1016/j.cmpb.2018.10.011. Epub 2018 Oct 16.

Traditional Chinese medicine diagnostic prediction model for holistic syndrome differentiation based on deep learning.基于深度学习的中医整体辨证诊断预测模型

Integr Med Res. 2024 Mar;13(1):101019. doi: 10.1016/j.imr.2023.101019. Epub 2023 Dec 19.

Artificial Intelligence-Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study.基于人工智能的中医辅助诊断系统：验证研究。

JMIR Med Inform. 2020 Jun 15;8(6):e17608. doi: 10.2196/17608.

A Data-Driven Model for Automated Chinese Word Segmentation and POS Tagging.基于数据驱动的中文分词与词性标注自动化模型

Comput Intell Neurosci. 2022 Sep 16;2022:7622392. doi: 10.1155/2022/7622392. eCollection 2022.

A hybrid Chinese word segmentation model for quality management-related texts based on transfer learning.基于迁移学习的质量管理相关文本混合分词模型。

PLoS One. 2022 Oct 7;17(10):e0270154. doi: 10.1371/journal.pone.0270154. eCollection 2022.

CapsTM: capsule network for Chinese medical text matching.CapsTM：用于中文医疗文本匹配的胶囊网络。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):94. doi: 10.1186/s12911-021-01442-9.

Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。

J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.

引用本文的文献

Digital intelligence technology: new quality productivity for precision traditional Chinese medicine.数字智能技术：精准中医的新型质量生产力。

Front Pharmacol. 2025 Apr 8;16:1526187. doi: 10.3389/fphar.2025.1526187. eCollection 2025.

Dual-channel knowledge attention for traditional Chinese medicine syndrome differentiation.用于中医辨证的双通道知识注意力机制

Sci Rep. 2025 Apr 18;15(1):13487. doi: 10.1038/s41598-025-96404-w.

Traditional Chinese medicine diagnostic prediction model for holistic syndrome differentiation based on deep learning.基于深度学习的中医整体辨证诊断预测模型

Integr Med Res. 2024 Mar;13(1):101019. doi: 10.1016/j.imr.2023.101019. Epub 2023 Dec 19.

A Visualization Method of Knowledge Graphs for the Computation and Comprehension of Ultrasound Reports.一种用于超声报告计算与理解的知识图谱可视化方法。

Biomimetics (Basel). 2023 Nov 21;8(8):560. doi: 10.3390/biomimetics8080560.

Efficacy of Xinbao pill on chronic heart failure: Study protocol of a multicenter, randomized, double-blind, placebo-controlled trial.心宝丸治疗慢性心力衰竭的疗效：一项多中心、随机、双盲、安慰剂对照试验的研究方案

Front Pharmacol. 2022 Oct 25;13:1058799. doi: 10.3389/fphar.2022.1058799. eCollection 2022.

Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN.基于 ERNIE-Gram+GCN 的中文旅游评论情感分类。

Int J Environ Res Public Health. 2022 Oct 19;19(20):13520. doi: 10.3390/ijerph192013520.

本文引用的文献

Research of insomnia on traditional Chinese medicine diagnosis and treatment based on machine learning.基于机器学习的失眠症中医诊疗研究

Chin Med. 2021 Jan 6;16(1):2. doi: 10.1186/s13020-020-00409-8.

Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition.基于图划分的无监督多粒度中文分词与术语发现。

J Biomed Inform. 2020 Oct;110:103542. doi: 10.1016/j.jbi.2020.103542. Epub 2020 Aug 24.

JMIR Med Inform. 2020 Jun 16;8(6):e17821. doi: 10.2196/17821.

Artificial Intelligence-Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study.基于人工智能的中医辅助诊断系统：验证研究。

JMIR Med Inform. 2020 Jun 15;8(6):e17608. doi: 10.2196/17608.

Multi-Task Joint Learning Model for Segmenting and Classifying Tongue Images Using a Deep Neural Network.基于深度神经网络的用于舌图像分割与分类的多任务联合学习模型

IEEE J Biomed Health Inform. 2020 Sep;24(9):2481-2489. doi: 10.1109/JBHI.2020.2986376. Epub 2020 Apr 17.

Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.基于卷积神经网络的中文电子病历智能诊断。

BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.

End-to-End syndrome differentiation of Yin deficiency and Yang deficiency in traditional Chinese medicine.中医阴、阳虚证的端到端辨证。

Comput Methods Programs Biomed. 2019 Jun;174:9-15. doi: 10.1016/j.cmpb.2018.10.011. Epub 2018 Oct 16.

Why Chinese medicine is heading for clinics around the world.为什么中医正走向世界各地的诊所。

Nature. 2018 Sep;561(7724):448-450. doi: 10.1038/d41586-018-06782-7.

A neural network multi-task learning approach to biomedical named entity recognition.一种用于生物医学命名实体识别的神经网络多任务学习方法。

BMC Bioinformatics. 2017 Aug 15;18(1):368. doi: 10.1186/s12859-017-1776-8.

A network-based approach to investigate the pattern of syndrome in depression.一种基于网络的方法来研究抑郁症的证候模式。

Evid Based Complement Alternat Med. 2015;2015:768249. doi: 10.1155/2015/768249. Epub 2015 Mar 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多任务联合学习模型在中医分词和证候分类中的应用

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献