结合外部医学知识图谱嵌入以提高辨证模型的性能。
Combining the External Medical Knowledge Graph Embedding to Improve the Performance of Syndrome Differentiation Model.
作者信息
Ye Qing, Yang Rui, Cheng Chun-Lei, Peng Lin, Lan Yong
机构信息
School of Computer, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China.
出版信息
Evid Based Complement Alternat Med. 2023 Feb 1;2023:2088698. doi: 10.1155/2023/2088698. eCollection 2023.
The electronic medical records (EMRs) of traditional Chinese medicine (TCM) include a wealth of TCM knowledge and syndrome diagnosis information, which is crucial for improving the quality of TCM auxiliary decision-making. In practical diagnosis, one disease corresponds to one syndrome, posing considerable hurdles for the informatization of TCM. The purpose of this work was to create an end-to-end TCM diagnostic model, and the knowledge graph (KG) created in this article is used to improve the model's information and realize auxiliary decision-making for TCM disorders. We approached auxiliary decision-making for syndrome differentiation in this article as a multilabel classification task and presented a knowledge-based decision support model for syndrome differentiation (KDSD). Specifically, we created a KG based on TCM features (TCMKG), supplementing the textual representation of medical data with embedded information. Finally, we proposed fusing medical text with KG entity representation (F-MT-KER) to get prediction results using a linear output layer. After obtaining the vector representation of the medical record text using the BERT model, the vector representation of various KG embedded models can provide additional hidden information to a certain extent. Experimental results show that our method improves by 1% (P@1) on the syndrome differentiation auxiliary decision task compared to the baseline model BERT. The usage of EMRs can aid TCM development more efficiently. With the help of entity level representation, character level representation, and model fusion, the multilabel classification method based on the pretraining model and KG can better simulate the TCM syndrome differentiation of the complex cases.
中医电子病历(EMRs)包含丰富的中医知识和证候诊断信息,这对于提高中医辅助决策质量至关重要。在实际诊断中,一种疾病对应一种证候,这给中医信息化带来了相当大的障碍。这项工作的目的是创建一个端到端的中医诊断模型,本文创建的知识图谱(KG)用于改进模型的信息并实现对中医病症的辅助决策。我们将本文中的证候鉴别辅助决策作为多标签分类任务,并提出了一种基于知识的证候鉴别决策支持模型(KDSD)。具体来说,我们基于中医特征创建了一个知识图谱(TCMKG),用嵌入信息补充医学数据的文本表示。最后,我们提出将医学文本与KG实体表示融合(F-MT-KER),使用线性输出层获得预测结果。使用BERT模型获得病历文本的向量表示后,各种KG嵌入模型的向量表示可以在一定程度上提供额外的隐藏信息。实验结果表明,与基线模型BERT相比,我们的方法在证候鉴别辅助决策任务上提高了1%(P@1)。电子病历的使用可以更有效地辅助中医发展。借助实体级表示、字符级表示和模型融合,基于预训练模型和KG的多标签分类方法可以更好地模拟复杂病例的中医证候鉴别。