Department of Brain and Cognitive Engineering, Korea University, Anam-dong, Seongbuk-gu, Seoul 02841, Republic of Korea.
Department of Electronics and Communication Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India.
Biomed Res Int. 2020 Oct 13;2020:8427574. doi: 10.1155/2020/8427574. eCollection 2020.
One of the deadliest diseases which affects the large intestine is colon cancer. Older adults are typically affected by colon cancer though it can happen at any age. It generally starts as small benign growth of cells that forms on the inside of the colon, and later, it develops into cancer. Due to the propagation of somatic alterations that affects the gene expression, colon cancer is caused. A standardized format for assessing the expression levels of thousands of genes is provided by the DNA microarray technology. The tumors of various anatomical regions can be distinguished by the patterns of gene expression in microarray technology. As the microarray data is too huge to process due to the curse of dimensionality problem, an amalgamated approach of utilizing bilevel feature selection techniques is proposed in this paper. In the first level, the genes or the features are dimensionally reduced with the help of Multivariate Minimum Redundancy-Maximum Relevance (MRMR) technique. Then, in the second level, six optimization techniques are utilized in this work for selecting the best genes or features before proceeding to classification process. The optimization techniques considered in this work are Invasive Weed Optimization (IWO), Teaching Learning-Based Optimization (TLBO), League Championship Optimization (LCO), Beetle Antennae Search Optimization (BASO), Crow Search Optimization (CSO), and Fruit Fly Optimization (FFO). Finally, it is classified with five suitable classifiers, and the best results show when IWO is utilized with MRMR, and then classified with Quadratic Discriminant Analysis (QDA), a classification accuracy of 99.16% is obtained.
一种影响大肠的最致命疾病之一是结肠癌。尽管结肠癌通常发生在老年人身上,但它也可能发生在任何年龄。它通常始于大肠内部的小良性细胞生长,然后发展成癌症。由于影响基因表达的体细胞改变的传播,导致了结肠癌。DNA 微阵列技术为评估数千个基因的表达水平提供了标准化格式。微阵列技术可以通过基因表达模式来区分不同解剖区域的肿瘤。由于微阵列数据由于维度问题的诅咒而过于庞大,无法处理,因此本文提出了一种利用双层特征选择技术的综合方法。在第一级,利用多元最小冗余-最大相关性 (MRMR) 技术对基因或特征进行降维。然后,在第二级,在进行分类过程之前,利用六种优化技术来选择最佳基因或特征。在这项工作中考虑的优化技术是入侵杂草优化 (IWO)、基于教学的优化 (TLBO)、联赛冠军优化 (LCO)、甲壳虫触角搜索优化 (BASO)、乌鸦搜索优化 (CSO) 和果蝇优化 (FFO)。最后,使用五个合适的分类器进行分类,当 IWO 与 MRMR 一起使用,然后使用二次判别分析 (QDA) 进行分类时,会得到最佳结果,获得 99.16%的分类准确性。