Raju Akella S Narasimha, Venkatesh K, Gatla Ranjith Kumar, Konakalla Eswara Prasad, Eid Marwa M, Titova Nataliia, Ghoneim Sherif S M, Ghaly Ramy N R
Department of Computer Science and Engineering (Data Science), Institute of Aeronautical Engineering, Dundigul, Hyderabad, Telangana, 500043, India.
Department of Networking and Communications, School of Computing, SRM Institute of Science and Technology, Kattankulathur, Chennai, Tamilnadu, 603203, India.
Sci Rep. 2025 Jan 25;15(1):3180. doi: 10.1038/s41598-025-86590-y.
The current work introduces the hybrid ensemble framework for the detection and segmentation of colorectal cancer. This framework will incorporate both supervised classification and unsupervised clustering methods to present more understandable and accurate diagnostic results. The method entails several steps with CNN models: ADa-22 and AD-22, transformer networks, and an SVM classifier, all inbuilt. The CVC ClinicDB dataset supports this process, containing 1650 colonoscopy images classified as polyps or non-polyps. The best performance in the ensembles was done by the AD-22 + Transformer + SVM model, with an AUC of 0.99, a training accuracy of 99.50%, and a testing accuracy of 99.00%. This group also saw a high accuracy of 97.50% for Polyps and 99.30% for Non-Polyps, together with a recall of 97.80% for Polyps and 98.90% for Non-Polyps, hence performing very well in identifying both cancerous and healthy regions. The framework proposed here uses K-means clustering in combination with the visualisation of bounding boxes, thereby improving segmentation and yielding a silhouette score of 0.73 with the best cluster configuration. It discusses how to combine feature interpretation challenges into medical imaging for accurate localization and precise segmentation of malignant regions. A good balance between performance and generalization shall be done by hyperparameter optimization-heavy learning rates; dropout rates and overfitting shall be suppressed effectively. The hybrid schema of this work treats the deficiencies of the previous approaches, such as incorporating CNN-based effective feature extraction, Transformer networks for developing attention mechanisms, and finally the fine decision boundary of the support vector machine. Further, we refine this process via unsupervised clustering for the purpose of enhancing the visualisation of such a procedure. Such a holistic framework, hence, further boosts classification and segmentation results by generating understandable outcomes for more rigorous benchmarking of detecting colorectal cancer and with higher reality towards clinical application feasibility.
当前的工作介绍了用于结直肠癌检测和分割的混合集成框架。该框架将结合监督分类和无监督聚类方法,以呈现更易于理解和准确的诊断结果。该方法包括使用卷积神经网络(CNN)模型的几个步骤:内置的ADa - 22和AD - 22、Transformer网络以及支持向量机(SVM)分类器。CVC ClinicDB数据集支持这一过程,该数据集包含1650张分类为息肉或非息肉的结肠镜检查图像。集成模型中的最佳性能由AD - 22 + Transformer + SVM模型实现,其曲线下面积(AUC)为0.99,训练准确率为99.50%,测试准确率为99.00%。该组对于息肉的准确率也高达97.50%,对于非息肉的准确率为99.30%,同时息肉的召回率为97.80%,非息肉的召回率为98.90%,因此在识别癌性和健康区域方面表现非常出色。这里提出的框架将K均值聚类与边界框可视化相结合,从而改善分割效果,并在最佳聚类配置下产生了0.73的轮廓系数。它讨论了如何将特征解释挑战融入医学成像中,以实现恶性区域的精确定位和精确分割。通过超参数优化——重学习率,可以在性能和泛化之间取得良好的平衡;应有效抑制辍学率和过拟合。这项工作的混合模式弥补了先前方法的不足,例如纳入基于CNN的有效特征提取、用于开发注意力机制的Transformer网络,以及最终支持向量机的精细决策边界。此外,我们通过无监督聚类来优化这个过程,以增强该过程的可视化。因此,这样一个整体框架通过生成易于理解的结果,进一步提高了分类和分割结果,以便对结直肠癌检测进行更严格的基准测试,并在临床应用可行性方面更接近实际情况。
Data Brief. 2025-4-28
Cochrane Database Syst Rev. 2025-6-19
Cochrane Database Syst Rev. 2025-6-9
Open Res Eur. 2025-5-6
Sci Rep. 2024-7-3
JAMA Netw Open. 2024-2-5
Best Pract Res Clin Gastroenterol. 2023-10
IEEE Trans Med Imaging. 2023-12
Diagnostics (Basel). 2023-9-13