Department of Information Science and Engineering, AMC Engineering College, Bengaluru, Karnataka 560083, India.
Department of Computer Science and Engineering, SRM Institute of Science and Technology, Chennai, India.
J Healthc Eng. 2022 Feb 22;2022:5821938. doi: 10.1155/2022/5821938. eCollection 2022.
In the microarray gene expression data, there are a large number of genes that are expressed at varying levels of expression. Given that there are only a few critically significant genes, it is challenging to analyze and categorize datasets that span the whole gene space. In order to aid in the diagnosis of cancer disease and, as a consequence, the suggestion of individualized treatment, the discovery of biomarker genes is essential. Starting with a large pool of candidates, the parallelized minimal redundancy and maximum relevance ensemble (mRMRe) is used to choose the top m informative genes from a huge pool of candidates. A Genetic Algorithm (GA) is used to heuristically compute the ideal set of genes by applying the Mahalanobis Distance (MD) as a distance metric. Once the genes have been identified, they are input into the GA. It is used as a classifier to four microarray datasets using the approved approach (mRMRe-GA), with the Support Vector Machine (SVM) serving as the classification basis. Leave-One-Out-Cross-Validation (LOOCV) is a cross-validation technique for assessing the performance of a classifier. It is now being investigated if the proposed mRMRe-GA strategy can be compared to other approaches. It has been shown that the proposed mRMRe-GA approach enhances classification accuracy while employing less genetic material than previous methods. Microarray, Gene Expression Data, GA, Feature Selection, SVM, and Cancer Classification are some of the terms used in this paper.
在微阵列基因表达数据中,有大量基因以不同水平表达。鉴于只有少数关键显著基因,分析和分类跨越整个基因空间的数据集具有挑战性。为了帮助诊断癌症疾病,并因此建议个体化治疗,发现生物标志物基因至关重要。从大量候选者开始,并行最小冗余和最大相关性集成 (mRMRe) 用于从大量候选者中选择前 m 个信息丰富的基因。遗传算法 (GA) 被用来通过应用马氏距离 (MD) 作为距离度量来启发式地计算理想的基因集。一旦确定了基因,就将它们输入 GA 中。它被用作使用批准的方法 (mRMRe-GA) 的四个微阵列数据集的分类器,支持向量机 (SVM) 作为分类基础。留一交叉验证 (LOOCV) 是一种用于评估分类器性能的交叉验证技术。现在正在研究所提出的 mRMRe-GA 策略是否可以与其他方法进行比较。已经表明,所提出的 mRMRe-GA 方法在使用比以前方法更少的遗传物质的同时提高了分类准确性。微阵列、基因表达数据、GA、特征选择、SVM 和癌症分类是本文中使用的一些术语。