Suppr超能文献

一种用于慢性人类疾病预测的特征选择的增强型高效方法:一项乳腺癌研究。

An enhanced and efficient approach for feature selection for chronic human disease prediction: A breast cancer study.

作者信息

Khanna Munish, Singh Law Kumar, Shrivastava Kapil, Singh Rekha

机构信息

School of Computing Science and Engineering, Galgotias University, Greater Noida, Gautam Buddh Nagar, India.

Department of Computer Engineering and Applications, GLA University, Mathura, India.

出版信息

Heliyon. 2024 Feb 28;10(5):e26799. doi: 10.1016/j.heliyon.2024.e26799. eCollection 2024 Mar 15.

Abstract

Computer-aided diagnosis (CAD) systems play a vital role in modern research by effectively minimizing both time and costs. These systems support healthcare professionals like radiologists in their decision-making process by efficiently detecting abnormalities as well as offering accurate and dependable information. These systems heavily depend on the efficient selection of features to accurately categorize high-dimensional biological data. These features can subsequently assist in the diagnosis of related medical conditions. The task of identifying patterns in biomedical data can be quite challenging due to the presence of numerous irrelevant or redundant features. Therefore, it is crucial to propose and then utilize a feature selection (FS) process in order to eliminate these features. The primary goal of FS approaches is to improve the accuracy of classification by eliminating features that are irrelevant or less informative. The FS phase plays a critical role in attaining optimal results in machine learning (ML)-driven CAD systems. The effectiveness of ML models can be significantly enhanced by incorporating efficient features during the training phase. This empirical study presents a methodology for the classification of biomedical data using the FS technique. The proposed approach incorporates three soft computing-based optimization algorithms, namely Teaching Learning-Based Optimization (TLBO), Elephant Herding Optimization (EHO), and a proposed hybrid algorithm of these two. These algorithms were previously employed; however, their effectiveness in addressing FS issues in predicting human diseases has not been investigated. The following evaluation focuses on the categorization of benign and malignant tumours using the publicly available Wisconsin Diagnostic Breast Cancer (WDBC) benchmark dataset. The five-fold cross-validation technique is employed to mitigate the risk of over-fitting. The evaluation of the proposed approach's proficiency is determined based on several metrics, including sensitivity, specificity, precision, accuracy, area under the receiver-operating characteristic curve (AUC), and F1-score. The best value of accuracy computed through the suggested approach is 97.96%. The proposed clinical decision support system demonstrates a highly favourable classification performance outcome, making it a valuable tool for medical practitioners to utilize as a secondary opinion and reducing the overburden of expert medical practitioners.

摘要

计算机辅助诊断(CAD)系统通过有效减少时间和成本,在现代研究中发挥着至关重要的作用。这些系统通过高效检测异常以及提供准确可靠的信息,支持放射科医生等医疗专业人员进行决策。这些系统严重依赖于特征的有效选择,以便对高维生物数据进行准确分类。这些特征随后可协助诊断相关疾病。由于存在大量无关或冗余特征,在生物医学数据中识别模式的任务可能颇具挑战性。因此,提出并利用特征选择(FS)过程以消除这些特征至关重要。FS方法的主要目标是通过消除无关或信息较少的特征来提高分类的准确性。FS阶段在机器学习(ML)驱动的CAD系统中获得最优结果方面起着关键作用。在训练阶段纳入有效特征可显著提高ML模型的有效性。本实证研究提出了一种使用FS技术对生物医学数据进行分类的方法。所提出的方法纳入了三种基于软计算的优化算法,即基于教学学习的优化(TLBO)、大象群聚优化(EHO)以及这两者的一种混合算法。这些算法此前已被采用;然而,它们在解决预测人类疾病的FS问题方面的有效性尚未得到研究。以下评估聚焦于使用公开可用的威斯康星诊断乳腺癌(WDBC)基准数据集对良性和恶性肿瘤进行分类。采用五折交叉验证技术来降低过拟合风险。基于包括灵敏度、特异性、精确度、准确度、受试者工作特征曲线下面积(AUC)和F1分数在内的多个指标,确定所提出方法的熟练度评估。通过所建议方法计算出的最佳准确度值为97.96%。所提出的临床决策支持系统展现出极为良好的分类性能结果,使其成为医学从业者用作第二意见的宝贵工具,并减轻了专家医学从业者的负担。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a17/10920178/f397b3a3fe69/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验