Sarkar Suvobrata, Mali Kalyani
Department of Computer Science and Engineering, Dr. B.C. Roy Engineering College, Durgapur, West Bengal, India.
Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India.
Digit Health. 2023 Oct 16;9:20552076231207203. doi: 10.1177/20552076231207203. eCollection 2023 Jan-Dec.
Breast cancer is a highly predominant destructive disease among women characterised with varied tumour biology, molecular subgroups and diverse clinicopathological specifications. The potentiality of machine learning to transform complex medical data into meaningful knowledge has led to its application in breast cancer detection and prognostic evaluation.
The emergence of data-driven diagnostic model for assisting clinicians in diagnostic decision making has gained an increasing curiosity in breast cancer identification and analysis. This motivated us to develop a breast cancer data-driven model for subtype classification more accurately.
In this article, we proposed a firefly-support vector machine (SVM) breast cancer predictive model that uses clinicopathological and demographic data gathered from various tertiary care cancer hospitals or oncological centres to distinguish between patients with triple-negative breast cancer (TNBC) and non-triple-negative breast cancer (non-TNBC).
The results of the firefly-support vector machine (firefly-SVM) predictive model were distinguished from the traditional grid search-support vector machine (Grid-SVM) model, particle swarm optimisation-support vector machine (PSO-SVM) and genetic algorithm-support vector machine (GA-SVM) hybrid models through hyperparameter tuning. The findings show that the recommended firefly-SVM classification model outperformed other existing models in terms of prediction accuracy (93.4%, 86.6%, 69.6%) for automated SVM parameter selection. The effectiveness of the prediction model was also evaluated using well-known metrics, such as the F1-score, mean square error, area under the ROC curve, logarithmic loss and precision-recall curve.
Firefly-SVM predictive model may be treated as an alternate tool for breast cancer subgroup classification that would benefit the clinicians for managing the patient with proper treatment and diagnostic outcome.
乳腺癌是女性中一种极具侵袭性的破坏性疾病,具有多种肿瘤生物学特征、分子亚组和多样的临床病理特征。机器学习将复杂医学数据转化为有意义知识的潜力,促使其在乳腺癌检测和预后评估中得到应用。
数据驱动的诊断模型在辅助临床医生进行诊断决策方面的出现,引发了人们对乳腺癌识别和分析的日益浓厚兴趣。这促使我们开发一种更准确的乳腺癌数据驱动亚型分类模型。
在本文中,我们提出了一种萤火虫支持向量机(SVM)乳腺癌预测模型,该模型使用从各种三级护理癌症医院或肿瘤中心收集的临床病理和人口统计学数据,来区分三阴性乳腺癌(TNBC)患者和非三阴性乳腺癌(非TNBC)患者。
通过超参数调整,萤火虫支持向量机(萤火虫-SVM)预测模型的结果与传统网格搜索支持向量机(Grid-SVM)模型、粒子群优化支持向量机(PSO-SVM)和遗传算法支持向量机(GA-SVM)混合模型有所不同。研究结果表明,推荐的萤火虫-SVM分类模型在自动选择SVM参数的预测准确性(93.4%、86.6%、69.6%)方面优于其他现有模型。还使用了F1分数、均方误差、ROC曲线下面积、对数损失和精确召回曲线等知名指标来评估预测模型的有效性。
萤火虫-SVM预测模型可被视为乳腺癌亚组分类的替代工具,这将有助于临床医生对患者进行适当的治疗和诊断,从而改善治疗结果。