Department of Electronics and Communication Engineering, NIT, Durgapur, 713209, India.
Department of Electronics and Communication Engineering, C.V. Raman Global University, Bhubaneswar, 752054, India.
Sci Rep. 2023 Nov 10;13(1):19598. doi: 10.1038/s41598-023-46865-8.
Thyroid cancer is a life-threatening condition that arises from the cells of the thyroid gland located in the neck's frontal region just below the adam's apple. While it is not as prevalent as other types of cancer, it ranks prominently among the commonly observed cancers affecting the endocrine system. Machine learning has emerged as a valuable medical diagnostics tool specifically for detecting thyroid abnormalities. Feature selection is of vital importance in the field of machine learning as it serves to decrease the data dimensionality and concentrate on the most pertinent features. This process improves model performance, reduces training time, and enhances interpretability. This study examined binary variants of FOX-optimization algorithms for feature selection. The study employed eight transfer functions (S and V shape) to convert the FOX-optimization algorithms into their binary versions. The vision transformer-based pre-trained models (DeiT and Swin Transformer) are used for feature extraction. The extracted features are transformed using locally linear embedding, and binary FOX-optimization algorithms are applied for feature selection in conjunction with the Naïve Bayes classifier. The study utilized two datasets (ultrasound and histopathological) related to thyroid cancer images. The benchmarking is performed using the half-quadratic theory-based ensemble ranking technique. Two TOPSIS-based methods (H-TOPSIS and A-TOPSIS) are employed for initial model ranking, followed by an ensemble technique for final ranking. The problem is treated as multi-objective optimization task with accuracy, F2-score, AUC-ROC and feature space size as optimization goals. The binary FOX-optimization algorithm based on the [Formula: see text] transfer function achieved superior performance compared to other variants using both datasets as well as feature extraction techniques. The proposed framework comprised a Swin transformer to extract features, a Fox optimization algorithm with a V1 transfer function for feature selection, and a Naïve Bayes classifier and obtained the best performance for both datasets. The best model achieved an accuracy of 94.75%, an AUC-ROC value of 0.9848, an F2-Score of 0.9365, an inference time of 0.0353 seconds, and selected 5 features for the ultrasound dataset. For the histopathological dataset, the diagnosis model achieved an overall accuracy of 89.71%, an AUC-ROC score of 0.9329, an F2-Score of 0.8760, an inference time of 0.05141 seconds, and selected 12 features. The proposed model achieved results comparable to existing research with small features space.
甲状腺癌是一种危及生命的疾病,它起源于位于颈部前区、喉结下方的甲状腺细胞。虽然它不像其他类型的癌症那么普遍,但它是内分泌系统中常见的癌症之一。机器学习已成为一种有价值的医学诊断工具,专门用于检测甲状腺异常。特征选择在机器学习领域至关重要,因为它可以降低数据维度,并专注于最相关的特征。这个过程提高了模型性能,减少了训练时间,并增强了可解释性。本研究探讨了用于特征选择的二进制变体 FOX 优化算法。研究采用了八种转移函数(S 和 V 形)将 FOX 优化算法转换为二进制版本。基于 Vision Transformer 的预训练模型(DeiT 和 Swin Transformer)用于特征提取。提取的特征使用局部线性嵌入进行转换,并与朴素贝叶斯分类器结合使用二进制 FOX 优化算法进行特征选择。研究使用了两个与甲状腺癌图像相关的数据集(超声和组织病理学)。使用基于半二次理论的集成排名技术进行基准测试。使用两种基于 TOPSIS 的方法(H-TOPSIS 和 A-TOPSIS)进行初始模型排名,然后使用集成技术进行最终排名。该问题被视为具有准确性、F2 分数、AUC-ROC 和特征空间大小作为优化目标的多目标优化任务。基于 [公式:见文本] 转移函数的二进制 FOX 优化算法在使用两个数据集和特征提取技术时,与其他变体相比,表现更为出色。所提出的框架由 Swin 变压器组成,用于提取特征,由 Fox 优化算法和 V1 转移函数组成,用于特征选择,由朴素贝叶斯分类器组成,并为两个数据集获得了最佳性能。最佳模型的准确率为 94.75%,AUC-ROC 值为 0.9848,F2-Score 为 0.9365,推理时间为 0.0353 秒,为超声数据集选择了 5 个特征。对于组织病理学数据集,诊断模型的总准确率为 89.71%,AUC-ROC 得分为 0.9329,F2-Score 为 0.8760,推理时间为 0.05141 秒,选择了 12 个特征。与现有研究相比,该模型的特征空间较小,但结果相当。