Huong Audrey K C, Tay Kim Gaik, Ngu Xavier T I
Faculty of Electrical and Electronic Engineering, Universiti Tun Hussein Onn Malaysia, Batu Pahat, Malaysia.
Healthc Inform Res. 2021 Oct;27(4):298-306. doi: 10.4258/hir.2021.27.4.298. Epub 2021 Oct 31.
Different complex strategies of fusing handcrafted descriptors and features from convolutional neural network (CNN) models have been studied, mainly for two-class Papanicolaou (Pap) smear image classification. This paper explores a simplified system using combined binary coding for a five-class version of this problem.
This system extracted features from transfer learning of AlexNet, VGG19, and ResNet50 networks before reducing this problem into multiple binary sub-problems using error-correcting coding. The learners were trained using the support vector machine (SVM) method. The outputs of these classifiers were combined and compared to the true class codes for the final prediction.
Despite the superior performance of VGG19-SVM, with mean ± standard deviation accuracy and sensitivity of 80.68% ± 2.00% and 80.86% ± 0.45%, respectively, this model required a long training time. There were also false-negative cases using both the VGGNet-SVM and ResNet-SVM models. AlexNet-SVM was more efficient in terms of running speed and prediction consistency. Our findings also showed good diagnostic ability, with an area under the curve of approximately 0.95. Further investigation also showed good agreement between our research outcomes and that of the state-of-the-art methods, with specificity ranging from 93% to 100%.
We believe that the AlexNet-SVM model can be conveniently applied for clinical use. Further research could include the implementation of an optimization algorithm for hyperparameter tuning, as well as an appropriate selection of experimental design to improve the efficiency of Pap smear image classification.
已经研究了将手工描述符与卷积神经网络(CNN)模型的特征进行融合的不同复杂策略,主要用于两类巴氏涂片图像分类。本文探索了一种使用组合二进制编码解决该问题的五分类版本的简化系统。
该系统在使用纠错编码将此问题简化为多个二进制子问题之前,从AlexNet、VGG19和ResNet50网络的迁移学习中提取特征。使用支持向量机(SVM)方法训练学习器。将这些分类器的输出进行组合,并与真实类别代码进行比较以进行最终预测。
尽管VGG19 - SVM性能优越,平均准确率±标准差为80.68%±2.00%,灵敏度为80.86%±0.45%,但该模型训练时间长。VGGNet - SVM和ResNet - SVM模型也都存在假阴性情况。AlexNet - SVM在运行速度和预测一致性方面更高效。我们的研究结果还显示出良好的诊断能力,曲线下面积约为0.95。进一步研究还表明,我们的研究结果与最先进方法的结果具有良好的一致性,特异性范围为93%至100%。
我们认为AlexNet - SVM模型可方便地应用于临床。进一步的研究可以包括实施用于超参数调整的优化算法,以及适当选择实验设计以提高巴氏涂片图像分类的效率。