Suppr超能文献

探讨应用高性能遗传数据特征选择和分类算法进行结肠癌诊断。

Examining applying high performance genetic data feature selection and classification algorithms for colon cancer diagnosis.

机构信息

University of Huddersfield, Queensgate, Huddersfield, United Kingdom .

出版信息

Comput Methods Programs Biomed. 2017 Jul;146:11-24. doi: 10.1016/j.cmpb.2017.05.001. Epub 2017 May 4.

Abstract

BACKGROUND AND OBJECTIVES

This paper examines the accuracy and efficiency (time complexity) of high performance genetic data feature selection and classification algorithms for colon cancer diagnosis. The need for this research derives from the urgent and increasing need for accurate and efficient algorithms. Colon cancer is a leading cause of death worldwide, hence it is vitally important for the cancer tissues to be expertly identified and classified in a rapid and timely manner, to assure both a fast detection of the disease and to expedite the drug discovery process.

METHODS

In this research, a three-phase approach was proposed and implemented: Phases One and Two examined the feature selection algorithms and classification algorithms employed separately, and Phase Three examined the performance of the combination of these.

RESULTS

It was found from Phase One that the Particle Swarm Optimization (PSO) algorithm performed best with the colon dataset as a feature selection (29 genes selected) and from Phase Two that the Support Vector Machine (SVM) algorithm outperformed other classifications, with an accuracy of almost 86%. It was also found from Phase Three that the combined use of PSO and SVM surpassed other algorithms in accuracy and performance, and was faster in terms of time analysis (94%).

CONCLUSIONS

It is concluded that applying feature selection algorithms prior to classification algorithms results in better accuracy than when the latter are applied alone. This conclusion is important and significant to industry and society.

摘要

背景与目的

本文研究了高性能遗传数据特征选择和分类算法在结肠癌诊断中的准确性和效率(时间复杂度)。之所以需要进行这项研究,是因为迫切需要准确和高效的算法。结肠癌是全球主要的死亡原因之一,因此,快速、准确地识别和分类癌症组织至关重要,这不仅能快速发现疾病,还能加快药物研发进程。

方法

本研究提出并实施了三阶段方法:第一阶段和第二阶段分别检查了所采用的特征选择算法和分类算法,第三阶段检查了这些算法的组合性能。

结果

第一阶段发现粒子群优化(PSO)算法在结肠癌数据集上的特征选择(选择了 29 个基因)中表现最佳,第二阶段发现支持向量机(SVM)算法在分类方面优于其他算法,准确率接近 86%。第三阶段还发现,PSO 和 SVM 的联合使用在准确性和性能方面优于其他算法,在时间分析方面更快(94%)。

结论

与仅应用分类算法相比,在分类算法之前应用特征选择算法可提高准确性。这一结论对行业和社会具有重要意义。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验