Research Base of Traditional Chinese Medicine Syndrome, Fujian University of Traditional Chinese Medicine, Fuzhou 350122, China.
College of Integrated Chinese and Western Medicine, Fujian University of Traditional Chinese Medicine, Fuzhou 350122, China.
Biomed Pharmacother. 2021 May;137:111367. doi: 10.1016/j.biopha.2021.111367. Epub 2021 Feb 13.
BACKGROUND: Metabolic syndrome (MS) is a major global health concern comprising a cluster of co-occurring conditions that increase the risk of heart disease, stroke and type 2 diabetes. MS is usually diagnosed using a combination of physiochemical indexes (such as BMI, abdominal circumference and blood pressure) but largely ignores clinical symptoms when investigating prevention and treatment of the disease. Exploring predictors of MS using multiple diagnostic indicators may improve early diagnosis and treatment of MS. Traditional Chinese medicine (TCM) attaches importance to the etiology of disease symptoms and indications using four diagnostic methods, which have long been used to treat metabolic disease. Therefore, in this study, we aimed to develop predictive indicators for MS using both physiochemical indexes and TCM methods. METHODS: Clinical information (including both physiochemical and TCM indexes) was obtained from a cohort of 586 individuals across 4 hospitals in China, comprising 136 healthy controls and 450 MS cases. Using this cohort, we compared three classic machine learning methods: decision tree (DT), support vector machine (SVM) and random forest (RF) towards MS diagnosis using physiochemical and TCM indexes, with the best model selected by comparing the accuracy, specificity and sensitivity of the three models. In parallel, the best proportional partition of the training data to the test data was confirmed by observing the changes in evaluation indexes using each model. Next, three subsets containing different categories of variables (including both TCM and physicochemical indexes combined - termed the "fused indexes", only physicochemical indexes, and TCM indexes only) were compared and analyzed using the best performing model and optimum training to test data proportion. Next, the best subset was selected through comprehensive comparative analysis, and then the important prediction variables were selected according to their weight. RESULTS: When comparing the three models, we found that the RF model had the highest average accuracy (average 0.942, 95%CI [0.925, 0.958]) and sensitivity (average 0.993, 95%CI [0.990, 0.996]). Besides, when the training set accounted for 80% of the cohort data, the specificity got the best value and the accuracy and sensitivity were also very high in RF model. In view of the performance of the three different subsets, the prediction accuracy and sensitivity of models analyzing the fused indexes and only physicochemical indexes remained at a high level. Further, the mean value of specificity of the model using fused indexes was 0.916, which was significantly higher than the model with only physicochemical indexes (average 0.822) and the model with only TCM indexes (average 0.403). Based on the RF model and data allocation ratio (8:2), we further extracted the top 20 most significant variables from the fused indexes, which included 14 physicochemical indexes and 6 TCM indexes including wiry pulse, chest tightness, spontaneous perspiration, greasy tongue coating etc. CONCLUSION: Compared with SVM and DT models, the RF model showed the best performance, especially when the ratio of the training set to test set is 8:2. Compared with single predictive indexes, the model constructed by combining physiochemical indexes with TCM indexes (i.e. the fused indexes) exhibited better predictive ability. In addition to common physicochemical indexes, some TCM indexes, such as wiry pulse, chest tightness, spontaneous perspiration, greasy tongue coating, can also improve diagnosis of MS.
背景:代谢综合征(MS)是一个主要的全球健康问题,它由一组同时发生的病症组成,这些病症会增加心脏病、中风和 2 型糖尿病的风险。MS 通常使用生理化学指标(如 BMI、腰围和血压)的组合来诊断,但在研究疾病的预防和治疗时,很大程度上忽略了临床症状。使用多种诊断指标来探索 MS 的预测因子可能会改善 MS 的早期诊断和治疗。中医(TCM)重视使用四种诊断方法对疾病症状和指征的病因,这些方法长期以来一直用于治疗代谢疾病。因此,在这项研究中,我们旨在使用生理化学指标和 TCM 方法来开发 MS 的预测指标。
方法:从中国 4 家医院的 586 名个体中获得临床信息(包括生理化学和 TCM 指标),其中包括 136 名健康对照者和 450 名 MS 病例。使用该队列,我们比较了三种经典的机器学习方法:决策树(DT)、支持向量机(SVM)和随机森林(RF),使用生理化学和 TCM 指标对 MS 进行诊断,通过比较三个模型的准确性、特异性和敏感性来选择最佳模型。同时,通过观察每个模型使用训练数据的测试数据的最佳比例来确认最佳比例划分。接下来,使用性能最佳的模型和最佳的训练到测试数据比例,对包含不同类别的变量的三个子集(包括 TCM 和生理化学指标的组合——称为“融合指标”、仅生理化学指标和仅 TCM 指标)进行比较和分析。接下来,通过综合比较分析选择最佳子集,然后根据权重选择重要的预测变量。
结果:在比较这三种模型时,我们发现 RF 模型的平均准确性(平均 0.942,95%CI [0.925,0.958])和敏感性(平均 0.993,95%CI [0.990,0.996])最高。此外,当训练集占队列数据的 80%时,RF 模型的特异性得到了最佳值,准确性和敏感性也非常高。考虑到三种不同子集的性能,分析融合指标和仅生理化学指标的模型的预测准确性和敏感性仍然保持在较高水平。此外,使用融合指标的模型的特异性平均值为 0.916,明显高于仅使用生理化学指标的模型(平均 0.822)和仅使用 TCM 指标的模型(平均 0.403)。基于 RF 模型和数据分配比例(8:2),我们进一步从融合指标中提取了前 20 个最重要的变量,其中包括 14 个生理化学指标和 6 个 TCM 指标,包括细脉、胸闷、自汗、油腻舌涂层等。
结论:与 SVM 和 DT 模型相比,RF 模型表现出最佳性能,尤其是当训练集与测试集的比例为 8:2 时。与单一预测指标相比,将生理化学指标与 TCM 指标(即融合指标)相结合构建的模型具有更好的预测能力。除了常见的生理化学指标外,一些 TCM 指标,如细脉、胸闷、自汗、油腻舌涂层等,也可以提高 MS 的诊断能力。
Biomed Pharmacother. 2021-5
Evid Based Complement Alternat Med. 2020-11-26
Int J Environ Res Public Health. 2020-3-12
Zhongguo Gu Shang. 2018-6-25
BMC Med Inform Decis Mak. 2025-2-18