Zhang Xin, Yan Lin-Feng, Hu Yu-Chuan, Li Gang, Yang Yang, Han Yu, Sun Ying-Zhi, Liu Zhi-Cheng, Tian Qiang, Han Zi-Yang, Liu Le-De, Hu Bin-Quan, Qiu Zi-Yu, Wang Wen, Cui Guang-Bin
Department of Radiology, Tangdu Hospital, The Fourth Military Medical University, Xi'an 710038, Shaanxi, P.R. China.
Department of Neurosurgery, Tangdu Hospital, The Fourth Military Medical University, Xi'an 710038, Shaanxi, P.R. China.
Oncotarget. 2017 Jul 18;8(29):47816-47830. doi: 10.18632/oncotarget.18001.
Current machine learning techniques provide the opportunity to develop noninvasive and automated glioma grading tools, by utilizing quantitative parameters derived from multi-modal magnetic resonance imaging (MRI) data. However, the efficacies of different machine learning methods in glioma grading have not been investigated.A comprehensive comparison of varied machine learning methods in differentiating low-grade gliomas (LGGs) and high-grade gliomas (HGGs) as well as WHO grade II, III and IV gliomas based on multi-parametric MRI images was proposed in the current study. The parametric histogram and image texture attributes of 120 glioma patients were extracted from the perfusion, diffusion and permeability parametric maps of preoperative MRI. Then, 25 commonly used machine learning classifiers combined with 8 independent attribute selection methods were applied and evaluated using leave-one-out cross validation (LOOCV) strategy. Besides, the influences of parameter selection on the classifying performances were investigated. We found that support vector machine (SVM) exhibited superior performance to other classifiers. By combining all tumor attributes with synthetic minority over-sampling technique (SMOTE), the highest classifying accuracy of 0.945 or 0.961 for LGG and HGG or grade II, III and IV gliomas was achieved. Application of Recursive Feature Elimination (RFE) attribute selection strategy further improved the classifying accuracies. Besides, the performances of LibSVM, SMO, IBk classifiers were influenced by some key parameters such as kernel type, c, gama, K, etc. SVM is a promising tool in developing automated preoperative glioma grading system, especially when being combined with RFE strategy. Model parameters should be considered in glioma grading model optimization.
当前的机器学习技术提供了开发非侵入性自动胶质瘤分级工具的机会,即利用从多模态磁共振成像(MRI)数据中获取的定量参数。然而,不同机器学习方法在胶质瘤分级中的效果尚未得到研究。本研究提出基于多参数MRI图像,对多种机器学习方法在区分低级别胶质瘤(LGG)和高级别胶质瘤(HGG)以及世界卫生组织(WHO)II、III和IV级胶质瘤方面进行全面比较。从120例胶质瘤患者术前MRI的灌注、扩散和通透性参数图中提取参数直方图和图像纹理属性。然后,应用25种常用的机器学习分类器并结合8种独立的属性选择方法,采用留一法交叉验证(LOOCV)策略进行评估。此外,还研究了参数选择对分类性能的影响。我们发现支持向量机(SVM)表现出优于其他分类器的性能。通过将所有肿瘤属性与合成少数类过采样技术(SMOTE)相结合,LGG和HGG或II、III和IV级胶质瘤的最高分类准确率分别达到0.945或0.961。应用递归特征消除(RFE)属性选择策略进一步提高了分类准确率。此外,LibSVM、SMO、IBk分类器的性能受内核类型、c、伽马(gama)、K等一些关键参数的影响。SVM是开发自动化术前胶质瘤分级系统的一种有前景的工具,尤其是与RFE策略相结合时。在胶质瘤分级模型优化中应考虑模型参数。