Jin Xiaojie, Wang Yanru, Wang Jiarui, Gao Qian, Huang Yuhan, Shao Lingyu, Zhao Jiali, Li Jintian, Li Ling, Zhang Zhiming, Li Shuyan, Liu Yongqi
Key Laboratory of Dunhuang Medicine, Ministry of Education, Gansu University of Chinese Medicine, Dingxi East Road, 35th, Lanzhou, 730000, China, 86 13919019578.
College of Pharmacy, Gansu University of Chinese Medicine, Lanzhou, China.
JMIR Med Inform. 2025 Jul 16;13:e64725. doi: 10.2196/64725.
Syndrome differentiation in traditional Chinese medicine (TCM) is an ancient principle that guides disease diagnosis and treatment. Among these, the cold and hot syndromes play a crucial role in identifying the nature of the disease and guiding the treatment of viral pneumonia. However, differentiating between cold and hot syndromes is often considered esoteric. Machine learning offers a promising avenue for clinicians to identify these syndromes more accurately, thereby supporting more informed clinical decision-making in the treatment.
This study aims to construct a diagnostic model for differentiating cold and hot syndromes in viral pneumonia by integrating TCM and modern medical features using machine learning methods.
The application of 8 machine learning algorithms (gradient boosting machine [GBM], logistic regression, random forest, extreme gradient boosting [XGB], light gradient boosting machine [LGB], ridge regression, least absolute shrinkage and selection operator, and support vector machine) generated and validated (both internally and externally) a model for differentiating cold and hot syndromes in viral pneumonia, based on clinical data from 1484 patient samples collected at 2 medical centers between 2021 and 2022.
The GBM model, which combines TCM and modern medicine features, outperformed models using only TCM features or only modern medicine features in distinguishing cold and hot syndromes in patients with viral pneumonia. The optimal discrimination model comprised 13 optimal features (temperature, red cell distribution width-SD, creatinine, total bilirubin, globulin, C-reactive protein, unconjugated bilirubin, white blood cell, neutrophil percentage, aspartate transaminase/alanine transaminase, total cholesterol, thrombocytocrit, and age) and the GBM algorithm, achieving an area under the curve (AUC) of 0.7788. Under internal and external testing, the AUCs were 0.7645 and 0.8428, respectively. Moreover, significant differences were observed between the cold and hot syndrome groups in temperature (P=.02), red cell distribution width-SD (P<.001), neutrophil percentage (P=.01), total cholesterol (P=.003), thrombocytocrit (P<.001), and age (P<.001).
This pioneering study integrates the theory of TCM cold and hot syndromes with modern laboratory-based tests through machine learning. The developed model offers a novel approach for differentiating cold and hot syndromes in viral pneumonia, enabling practitioners to identify the syndrome quickly and efficiently, thereby supporting more informed clinical decision-making. Additionally, this research provides new insights into the modernization and scientific interpretation of TCM syndrome differentiation.
中医辨证是指导疾病诊断和治疗的古老原则。其中,寒证和热证在识别疾病本质和指导病毒性肺炎治疗方面起着关键作用。然而,寒证和热证的鉴别往往被认为晦涩难懂。机器学习为临床医生更准确地识别这些证型提供了一条有前景的途径,从而在治疗中支持更明智的临床决策。
本研究旨在通过机器学习方法整合中医和现代医学特征,构建一种用于鉴别病毒性肺炎寒证和热证的诊断模型。
应用8种机器学习算法(梯度提升机[GBM]、逻辑回归、随机森林、极端梯度提升[XGB]、轻量级梯度提升机[LGB]、岭回归、最小绝对收缩和选择算子以及支持向量机),基于2021年至2022年在2个医疗中心收集的1484例患者样本的临床数据,生成并验证(内部和外部)一种用于鉴别病毒性肺炎寒证和热证的模型。
结合中医和现代医学特征的GBM模型在鉴别病毒性肺炎患者的寒证和热证方面优于仅使用中医特征或仅使用现代医学特征的模型。最佳鉴别模型包括13个最佳特征(体温、红细胞分布宽度标准差、肌酐、总胆红素、球蛋白、C反应蛋白、非结合胆红素、白细胞、中性粒细胞百分比、天冬氨酸转氨酶/丙氨酸转氨酶、总胆固醇、血小板压积和年龄)和GBM算法,曲线下面积(AUC)为0.7788。在内部和外部测试中,AUC分别为0.7645和0.8428。此外,寒证组和热证组在体温(P = 0.02)、红细胞分布宽度标准差(P < 0.001)、中性粒细胞百分比(P = 0.01)、总胆固醇(P = 0.003)、血小板压积(P < 0.001)和年龄(P < 0.001)方面存在显著差异。
这项开创性研究通过机器学习将中医寒证和热证理论与现代实验室检测相结合。所开发的模型为鉴别病毒性肺炎寒证和热证提供了一种新方法,使从业者能够快速有效地识别证型,从而支持更明智的临床决策。此外,本研究为中医辨证的现代化和科学解读提供了新的见解。