Li Guo-Zheng, He Zehui, Shao Feng-Feng, Ou Ai-Hua, Lin Xiao-Zhong
BMC Med Genomics. 2015;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1755-8794-8-S3-S4. Epub 2015 Sep 23.
Hypertension is one of the major risk factors for cardiovascular diseases. Research on the patient classification of hypertension has become an important topic because Traditional Chinese Medicine lies primarily in "treatment based on syndromes differentiation of the patients".
Clinical data of hypertension was collected with 12 syndromes and 129 symptoms including inspection, tongue, inquiry, and palpation symptoms. Syndromes differentiation was modeled as a patient classification problem in the field of data mining, and a new multi-label learning model BrSmoteSvm was built dealing with the class-imbalanced of the dataset.
The experiments showed that the BrSmoteSvm had a better results comparing to other multi-label classifiers in the evaluation criteria of Average precision, Coverage, One-error, Ranking loss.
BrSmoteSvm can model the hypertension's syndromes differentiation better considering the imbalanced problem.
高血压是心血管疾病的主要危险因素之一。由于中医主要基于“辨证论治”,高血压患者分类研究已成为一个重要课题。
收集高血压临床数据,包括望、舌、问、切等12种证候和129种症状。在数据挖掘领域,将辨证论治建模为患者分类问题,并构建了一种新的多标签学习模型BrSmoteSvm来处理数据集的类不平衡问题。
实验表明,在平均精度、覆盖度、单错误率、排序损失等评估标准方面,BrSmoteSvm比其他多标签分类器具有更好的结果。
考虑到不平衡问题,BrSmoteSvm能够更好地对高血压辨证论治进行建模。