Software Engineering, Qeshm Institute of Higher Education, Qeshm, Iran.
Escuela Tecnica Superior de Ingenieros de Telecomunicacion Politecnica de Madrid, Madrid, Spain.
J Cancer Res Clin Oncol. 2023 Nov;149(16):15171-15184. doi: 10.1007/s00432-023-05308-7. Epub 2023 Aug 27.
Microarray information is crucial for the identification and categorisation of malignant tissues. The very limited sample size in the microarray has always been a challenge for classification design in cancer research. As a result, by pre-processing gene selection approaches and genes lacking their information, the microarray data are deleted prior to categorisation. In essence, an appropriate gene selection technique can significantly increase the accuracy of illness (cancer) classification.
For the classification of high-dimensional microarray data, a novel approach based on the hybrid model of multi-objective particle swarm optimisation (MOPSO) is proposed in this research. First, a binary vector representing each particle's position is presented at random. A gene is represented by each bit. Bit 0 denotes the absence of selection of the characteristic (gene) corresponding to it, while bit 1 denotes the selection of the gene. Therefore, the position of each particle represents a set of genes, and the linear Bayesian discriminant analysis classification algorithm calculates each particle's degree of fitness to assess the quality of the gene set that particle has chosen. The suggested methodology is applied to four different cancer database sets, and the results are contrasted with those of other approaches currently in use.
The proposed algorithm has been applied on four sets of cancer database and its results have been compared with other existing methods. The results of the implementation show that the improvement of classification accuracy in the proposed algorithm compared to other methods for four sets of databases is 25.84% on average. So that it has improved by 18.63% in the blood cancer database, 24.25% in the lung cancer database, 27.73% in the breast cancer database, and 32.80% in the prostate cancer database. Therefore, the proposed algorithm is able to identify a small set of genes containing information in a way choose to increase the classification accuracy.
Our proposed solution is used for data classification, which also improves classification accuracy. This is possible because the MOPSO model removes redundancy and reduces the number of redundant and redundant genes by considering how genes are correlated with each other.
微阵列信息对于恶性组织的鉴定和分类至关重要。微阵列中非常有限的样本量一直是癌症研究中分类设计的挑战。因此,通过预处理基因选择方法和缺乏信息的基因,在分类之前删除微阵列数据。从本质上讲,适当的基因选择技术可以显著提高疾病(癌症)分类的准确性。
对于高维微阵列数据的分类,本研究提出了一种基于多目标粒子群优化(MOPSO)混合模型的新方法。首先,随机生成表示每个粒子位置的二进制向量。每个基因由一个比特表示。比特 0 表示对应特征(基因)的选择缺失,而比特 1 表示基因的选择。因此,每个粒子的位置代表一组基因,线性贝叶斯判别分析分类算法计算每个粒子的适应度程度,以评估粒子选择的基因集的质量。该方法应用于四个不同的癌症数据库集,并与目前使用的其他方法的结果进行了对比。
该算法应用于四个癌症数据库集,其结果与其他现有方法进行了比较。实施结果表明,与其他方法相比,该算法在四个数据库集上的分类精度提高了 25.84%。在血液癌数据库中提高了 18.63%,在肺癌数据库中提高了 24.25%,在乳腺癌数据库中提高了 27.73%,在前列腺癌数据库中提高了 32.80%。因此,该算法能够以选择增加分类准确性的方式识别包含信息的一小部分基因。
我们提出的解决方案用于数据分类,也可以提高分类准确性。这是因为 MOPSO 模型通过考虑基因之间的相关性来去除冗余并减少冗余和冗余基因的数量。