• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不同DNA微阵列数据分类算法的比较研究

Comparative Study of Classification Algorithms for Various DNA Microarray Data.

作者信息

Kim Jingeun, Yoon Yourim, Park Hye-Jin, Kim Yong-Hyuk

机构信息

Department of IT Convergence Engineering, Gachon University, Seongnam-daero 1342, Seongnam-si 13120, Korea.

Department of Computer Engineering, College of Information Technology, Gachon University, Seongnam-daero 1342, Sujeong-gu, Seongnam-si 13120, Korea.

出版信息

Genes (Basel). 2022 Mar 11;13(3):494. doi: 10.3390/genes13030494.

DOI:10.3390/genes13030494
PMID:35328048
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8951024/
Abstract

Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and -Nearest Neighbors (KNN), and the resulting accuracies were compared. -fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance.

摘要

微阵列是电气工程和技术在生物学中的应用,它允许同时测量众多基因的表达,并且可用于分析特定疾病。本研究对各种微阵列进行分类分析,以比较不同分类算法在不同数据特征上的性能。基于五种机器学习方法,包括多层感知器(MLP)、支持向量机(SVM)、决策树(DT)、随机森林(RF)和K近邻(KNN),将数据集分为测试组和对照组,并比较所得的准确率。在评估性能时使用了十折交叉验证,并通过比较这五种机器学习方法的性能来分析结果。通过实验观察到,两种基于树的方法,DT和RF,结果显示出相似的趋势,其余三种方法,MLP、SVM和KNN,也显示出相似的趋势。除了一个数据集外,DT和RF的性能通常比其他方法差。这表明,对于微阵列数据的有效分类,选择适合数据特征的分类算法对于确保最佳性能至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/59270d918cde/genes-13-00494-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/a76135371d99/genes-13-00494-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/e6e6ae00c503/genes-13-00494-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/85996e045cfa/genes-13-00494-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/d6e1dd4d3d45/genes-13-00494-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/172ac5203127/genes-13-00494-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/d0432aa63d6d/genes-13-00494-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/c8aa11c6dd50/genes-13-00494-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/f9fc28cb132e/genes-13-00494-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/18c9f41640fb/genes-13-00494-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/ada72c70348c/genes-13-00494-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/cce1ee3df0f0/genes-13-00494-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/e3a2324607e5/genes-13-00494-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/05ee7cf9af09/genes-13-00494-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/59270d918cde/genes-13-00494-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/a76135371d99/genes-13-00494-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/e6e6ae00c503/genes-13-00494-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/85996e045cfa/genes-13-00494-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/d6e1dd4d3d45/genes-13-00494-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/172ac5203127/genes-13-00494-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/d0432aa63d6d/genes-13-00494-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/c8aa11c6dd50/genes-13-00494-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/f9fc28cb132e/genes-13-00494-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/18c9f41640fb/genes-13-00494-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/ada72c70348c/genes-13-00494-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/cce1ee3df0f0/genes-13-00494-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/e3a2324607e5/genes-13-00494-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/05ee7cf9af09/genes-13-00494-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/8951024/59270d918cde/genes-13-00494-g014.jpg

相似文献

1
Comparative Study of Classification Algorithms for Various DNA Microarray Data.不同DNA微阵列数据分类算法的比较研究
Genes (Basel). 2022 Mar 11;13(3):494. doi: 10.3390/genes13030494.
2
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
3
Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach.巴西 COVID-19 检测优先级分类模型:机器学习方法。
J Med Internet Res. 2021 Apr 8;23(4):e27293. doi: 10.2196/27293.
4
A Comparison of Machine Learning Algorithms and Feature Sets for Automatic Vocal Emotion Recognition in Speech.机器学习算法和特征集在语音自动情感识别中的比较
Sensors (Basel). 2022 Oct 6;22(19):7561. doi: 10.3390/s22197561.
5
Comparison of Classification Success Rates of Different Machine Learning Algorithms in the Diagnosis of Breast Cancer.不同机器学习算法在乳腺癌诊断中的分类成功率比较。
Asian Pac J Cancer Prev. 2022 Oct 1;23(10):3287-3297. doi: 10.31557/APJCP.2022.23.10.3287.
6
Human lung cancer classification and comprehensive analysis using different machine learning techniques.使用不同机器学习技术的人类肺癌分类与综合分析
Microsc Res Tech. 2025 Jan;88(1):234-250. doi: 10.1002/jemt.24682. Epub 2024 Sep 18.
7
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.
8
Application of supervised machine learning algorithms in the classification of sagittal gait patterns of cerebral palsy children with spastic diplegia.监督机器学习算法在痉挛性双瘫脑瘫儿童矢状面步态模式分类中的应用。
Comput Biol Med. 2019 Mar;106:33-39. doi: 10.1016/j.compbiomed.2019.01.009. Epub 2019 Jan 16.
9
Artificial Intelligence Algorithm-Based Economic Denial of Sustainability Attack Detection Systems: Cloud Computing Environments.基于人工智能算法的经济可持续性否认攻击检测系统:云计算环境。
Sensors (Basel). 2022 Jun 21;22(13):4685. doi: 10.3390/s22134685.
10
Efficient Model for Coronary Artery Disease Diagnosis: A Comparative Study of Several Machine Learning Algorithms.用于冠心病诊断的高效模型:几种机器学习算法的比较研究。
J Healthc Eng. 2022 Oct 18;2022:5359540. doi: 10.1155/2022/5359540. eCollection 2022.

引用本文的文献

1
Critical gene network and signaling pathway analysis of the extracellular signal-regulated kinase (ERK) pathway in ischemic stroke.缺血性卒中细胞外信号调节激酶(ERK)通路的关键基因网络和信号通路分析
Front Mol Neurosci. 2025 Jun 25;18:1604670. doi: 10.3389/fnmol.2025.1604670. eCollection 2025.

本文引用的文献

1
Deep-Learning-Based Cancer Profiles Classification Using Gene Expression Data Profile.基于深度学习的基因表达数据特征的癌症谱分类。
J Healthc Eng. 2022 Jan 7;2022:4715998. doi: 10.1155/2022/4715998. eCollection 2022.
2
Hybrid gene selection approach using XGBoost and multi-objective genetic algorithm for cancer classification.基于 XGBoost 和多目标遗传算法的混合基因选择方法在癌症分类中的应用。
Med Biol Eng Comput. 2022 Mar;60(3):663-681. doi: 10.1007/s11517-021-02476-x. Epub 2022 Jan 13.
3
Ensemble feature selection for stable biomarker identification and cancer classification from microarray expression data.
基于微阵列表达数据的稳定生物标志物识别和癌症分类的集成特征选择。
Comput Biol Med. 2022 Mar;142:105208. doi: 10.1016/j.compbiomed.2021.105208. Epub 2022 Jan 5.
4
Gene selection for microarray data classification via multi-objective graph theoretic-based method.基于多目标图论方法的微阵列数据分类基因选择
Artif Intell Med. 2022 Jan;123:102228. doi: 10.1016/j.artmed.2021.102228. Epub 2021 Dec 3.
5
A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data.基于微阵列基因表达数据对癌症类型进行分类的机器学习和深度学习算法的比较研究。
PeerJ Comput Sci. 2020 Apr 13;6:e270. doi: 10.7717/peerj-cs.270. eCollection 2020.
6
Upper airway gene expression reveals suppressed immune responses to SARS-CoV-2 compared with other respiratory viruses.上呼吸道基因表达显示,与其他呼吸道病毒相比,SARS-CoV-2 抑制了免疫反应。
Nat Commun. 2020 Nov 17;11(1):5854. doi: 10.1038/s41467-020-19587-y.
7
Transcriptomic changes in the nasal epithelium associated with diesel engine exhaust exposure.与柴油机尾气暴露相关的鼻上皮细胞转录组变化。
Environ Int. 2020 Apr;137:105506. doi: 10.1016/j.envint.2020.105506. Epub 2020 Feb 7.
8
scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data.scPred:一种准确的有监督方法,用于对单细胞 RNA-seq 数据进行细胞类型分类。
Genome Biol. 2019 Dec 12;20(1):264. doi: 10.1186/s13059-019-1862-5.
9
Unsupervised feature selection algorithm for multiclass cancer classification of gene expression RNA-Seq data.无监督特征选择算法在基因表达 RNA-Seq 数据的多类癌症分类中的应用。
Genomics. 2020 Mar;112(2):1916-1925. doi: 10.1016/j.ygeno.2019.11.004. Epub 2019 Nov 20.
10
Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers?基于 RNA-seq 数据的生物学分类:剪接转录本表达能否增强机器学习分类器?
RNA. 2018 Sep;24(9):1119-1132. doi: 10.1261/rna.062802.117. Epub 2018 Jun 25.