中医阴、阳虚证的端到端辨证。

End-to-End syndrome differentiation of Yin deficiency and Yang deficiency in traditional Chinese medicine.

机构信息

Institute of Linguistics, Chinese Academy of Social Sciences, Beijing 100732, China; China Multilingual and Multimodal Corpora and Big Data Research Centre, Beijing 100089, China.

Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China.

出版信息

Comput Methods Programs Biomed. 2019 Jun;174:9-15. doi: 10.1016/j.cmpb.2018.10.011. Epub 2018 Oct 16.

Abstract

BACKGROUND AND OBJECTIVE

Yin and Yang, two concepts adapted from classical Chinese philosophy, play a diagnostic role in Traditional Chinese Medicine (TCM). The Yin and Yang in harmonious balance indicate health, whereas imbalances to either side indicate unhealthiness, which may result in diseases. Yin-yang disharmony is considered to be the cause of pathological changes. Syndrome differentiation of yin-yang is crucial to clinical diagnosis. It lays a foundation for subsequent medical judgments, including therapeutic methods, and formula, among many others. However, because of the complexities of the mechanisms and manifestations of disease, it is difficult to exactly point out which one, yin or yang, is disharmonious. There has been inadequate research conducted on syndrome differentiation of yin and yang from a computational perspective. In this study, we present a computational method, viz. an end-to-end syndrome differentiation of yin deficiency and yang deficiency.

METHODS

Unlike most previous studies on syndrome differentiation, which use structured datasets, this study takes unstructured texts in medical records as its inputs. It models syndrome differentiation as a task of text classification. This study experiments on two state-of-the-art end-to-end algorithms for text classification, i.e. a classic convolutional neural network (CNN) and fastText. These two systems take the n-grams of several types of tokens as their inputs, including characters, terms, and words.

RESULTS

When evaluated on a data set with 7326 modern medical records in TCM, it is observed that CNN and fastText generally give rise to comparable performances. The best accuracy rate of 92.55% comes from the system taking inputs as raw as n-grams of characters. It implies that one can build at least a moderate system for the differentiation of yin deficiency and yang deficiency even if he has no glossary or tokenizer at hand.

CONCLUSIONS

This study has demonstrated the feasibility of using end-to-end text classification algorithms to differentiate yin deficiency and yang deficiency on unstructured medical records.

摘要

背景与目的

阴阳,两个源自中国古典哲学的概念,在中医(TCM)中具有诊断作用。阴阳和谐平衡表示健康,而任何一方的失衡都表示不健康,可能导致疾病。阴阳失调被认为是病理变化的原因。阴阳辨证对于临床诊断至关重要。它为后续的医学判断奠定了基础,包括治疗方法、方剂等。然而,由于疾病机制和表现的复杂性,很难准确指出是阴还是阳失调。从计算角度对阴阳辨证进行研究还不够充分。在这项研究中,我们提出了一种计算方法,即阴、阳两虚的端到端辨证。

方法

与大多数以前使用结构化数据集的阴阳辨证研究不同,本研究以医疗记录中的非结构化文本作为输入。它将辨证作为文本分类的任务进行建模。本研究在两种最先进的端到端文本分类算法上进行实验,即经典卷积神经网络(CNN)和 fastText。这两个系统将几种类型的标记的 n-gram 作为输入,包括字符、术语和单词。

结果

在一个包含 7326 份现代中医医疗记录的数据集上进行评估时,观察到 CNN 和 fastText 通常产生相当的性能。最高准确率为 92.55%,来自于将字符 n-gram 作为输入的系统。这意味着,即使手头没有词汇表或标记器,也至少可以构建一个中等规模的阴阳两虚辨证系统。

结论

本研究证明了使用端到端文本分类算法对非结构化医疗记录进行阴阳两虚辨证的可行性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索