Weng Heng, Chen Jielong, Ou Aihua, Lao Yingrong
State Key Laboratory of Dampness Syndrome of Chinese Medicine, Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China.
School of Information Science, Guangdong University of Finance & Economics, Guangzhou, China.
JMIR Med Inform. 2022 Sep 2;10(9):e38414. doi: 10.2196/38414.
Knowledge discovery from treatment data records from Chinese physicians is a dramatic challenge in the application of artificial intelligence (AI) models to the research of traditional Chinese medicine (TCM).
This paper aims to construct a TCM knowledge graph (KG) from Chinese physicians and apply it to the decision-making related to diagnosis and treatment in TCM.
A new framework leveraging a representation learning method for TCM KG construction and application was designed. A transformer-based Contextualized Knowledge Graph Embedding (CoKE) model was applied to KG representation learning and knowledge distillation. Automatic identification and expansion of multihop relations were integrated with the CoKE model as a pipeline. Based on the framework, a TCM KG containing 59,882 entities (eg, diseases, symptoms, examinations, drugs), 17 relations, and 604,700 triples was constructed. The framework was validated through a link predication task.
Experiments showed that the framework outperforms a set of baseline models in the link prediction task using the standard metrics mean reciprocal rank (MRR) and Hits@N. The knowledge graph embedding (KGE) multitagged TCM discriminative diagnosis metrics also indicated the improvement of our framework compared with the baseline models.
Experiments showed that the clinical KG representation learning and application framework is effective for knowledge discovery and decision-making assistance in diagnosis and treatment. Our framework shows superiority of application prospects in tasks such as KG-fused multimodal information diagnosis, KGE-based text classification, and knowledge inference-based medical question answering.
从中国医生的治疗数据记录中进行知识发现,是人工智能(AI)模型应用于中医(TCM)研究的一项巨大挑战。
本文旨在构建一个源自中国医生的中医知识图谱(KG),并将其应用于中医诊疗决策。
设计了一个利用表示学习方法进行中医知识图谱构建与应用的新框架。基于Transformer的上下文知识图谱嵌入(CoKE)模型应用于知识图谱表示学习和知识蒸馏。多跳关系的自动识别与扩展作为一个流程与CoKE模型集成。基于该框架,构建了一个包含59,882个实体(如疾病、症状、检查、药物)、17种关系和604,700个三元组的中医知识图谱。通过链接预测任务对该框架进行了验证。
实验表明,在使用标准指标平均倒数排名(MRR)和Hits@N的链接预测任务中,该框架优于一组基线模型。知识图谱嵌入(KGE)多标签中医判别诊断指标也表明,与基线模型相比,我们的框架有所改进。
实验表明,临床知识图谱表示学习与应用框架在诊疗知识发现和决策辅助方面是有效的。我们的框架在知识图谱融合多模态信息诊断、基于知识图谱嵌入的文本分类以及基于知识推理的医学问答等任务中显示出应用前景的优越性。