GEMS：一种用于从微阵列基因表达数据中进行癌症自动诊断和生物标志物发现的系统。

GEMS: a system for automated cancer diagnosis and biomarker discovery from microarray gene expression data.

作者信息

Statnikov Alexander, Tsamardinos Ioannis, Dosbayev Yerbolat, Aliferis Constantin F

机构信息

Discovery Systems Laboratory, Department of Biomedical Informatics, Vanderbilt University, 2209 Garland Avenue, Nashville, TN 37232, USA.

出版信息

Int J Med Inform. 2005 Aug;74(7-8):491-503. doi: 10.1016/j.ijmedinf.2005.05.002.

DOI:10.1016/j.ijmedinf.2005.05.002

PMID:15967710

Abstract

The success of treatment of patients with cancer depends on establishing an accurate diagnosis. To this end, we have built a system called GEMS (gene expression model selector) for the automated development and evaluation of high-quality cancer diagnostic models and biomarker discovery from microarray gene expression data. In order to determine and equip the system with the best performing diagnostic methodologies in this domain, we first conducted a comprehensive evaluation of classification algorithms using 11 cancer microarray datasets. In this paper we present a preliminary evaluation of the system with five new datasets. The performance of the models produced automatically by GEMS is comparable or better than the results obtained by human analysts. Additionally, we performed a cross-dataset evaluation of the system. This involved using a dataset to build a diagnostic model and to estimate its future performance, then applying this model and evaluating its performance on a different dataset. We found that models produced by GEMS indeed perform well in independent samples and, furthermore, the cross-validation performance estimates output by the system approximate well the error obtained by the independent validation. GEMS is freely available for download for non-commercial use from http://www.gems-system.org.

摘要

癌症患者的治疗成功取决于准确的诊断。为此，我们构建了一个名为GEMS（基因表达模型选择器）的系统，用于从微阵列基因表达数据中自动开发和评估高质量的癌症诊断模型以及发现生物标志物。为了确定并为该系统配备该领域中性能最佳的诊断方法，我们首先使用11个癌症微阵列数据集对分类算法进行了全面评估。在本文中，我们用五个新数据集对该系统进行了初步评估。GEMS自动生成的模型性能与人类分析人员获得的结果相当或更好。此外，我们对该系统进行了跨数据集评估。这包括使用一个数据集构建诊断模型并估计其未来性能，然后应用该模型并在另一个不同的数据集上评估其性能。我们发现，GEMS生成的模型在独立样本中确实表现良好，而且，该系统输出的交叉验证性能估计值与独立验证所获得的误差非常接近。GEMS可从http://www.gems-system.org免费下载供非商业使用。

相似文献

GEMS: a system for automated cancer diagnosis and biomarker discovery from microarray gene expression data.GEMS：一种用于从微阵列基因表达数据中进行癌症自动诊断和生物标志物发现的系统。

Int J Med Inform. 2005 Aug;74(7-8):491-503. doi: 10.1016/j.ijmedinf.2005.05.002.

Methods for multi-category cancer diagnosis from gene expression data: a comprehensive evaluation to inform decision support system development.基于基因表达数据的多类别癌症诊断方法：为决策支持系统开发提供信息的综合评估

Stud Health Technol Inform. 2004;107(Pt 2):813-7.

Reliable gene signatures for microarray classification: assessment of stability and performance.用于微阵列分类的可靠基因特征：稳定性和性能评估

Bioinformatics. 2006 Oct 1;22(19):2356-63. doi: 10.1093/bioinformatics/btl400. Epub 2006 Jul 31.

Gene selection in cancer classification using sparse logistic regression with Bayesian regularization.使用带贝叶斯正则化的稀疏逻辑回归进行癌症分类中的基因选择。

Bioinformatics. 2006 Oct 1;22(19):2348-55. doi: 10.1093/bioinformatics/btl386. Epub 2006 Jul 14.

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.用于微阵列基因表达癌症诊断的多类别分类方法的综合评估。

Bioinformatics. 2005 Mar 1;21(5):631-43. doi: 10.1093/bioinformatics/bti033. Epub 2004 Sep 16.

Dependence network modeling for biomarker identification.用于生物标志物识别的依赖网络建模

Bioinformatics. 2007 Jan 15;23(2):198-206. doi: 10.1093/bioinformatics/btl553. Epub 2006 Oct 31.

Robust classification modeling on microarray data using misclassification penalized posterior.使用误分类惩罚后验对微阵列数据进行稳健分类建模。

Bioinformatics. 2005 Jun;21 Suppl 1:i423-30. doi: 10.1093/bioinformatics/bti1020.

Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.基于独立成分分析的惩罚判别方法用于利用基因表达数据进行肿瘤分类

Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.

Gene selection using support vector machines with non-convex penalty.使用具有非凸惩罚项的支持向量机进行基因选择。

Bioinformatics. 2006 Jan 1;22(1):88-95. doi: 10.1093/bioinformatics/bti736. Epub 2005 Oct 25.

Class discovery from gene expression data based on perturbation and cluster ensemble.基于扰动和聚类集成从基因表达数据中发现类别

IEEE Trans Nanobioscience. 2009 Jun;8(2):147-60. doi: 10.1109/TNB.2009.2023321. Epub 2009 Jun 2.

引用本文的文献

Machine Learning and Graph Signal Processing Applied to Healthcare: A Review.应用于医疗保健的机器学习与图信号处理：综述

Bioengineering (Basel). 2024 Jul 2;11(7):671. doi: 10.3390/bioengineering11070671.

A Novel Hybrid Runge Kutta Optimizer with Support Vector Machine on Gene Expression Data for Cancer Classification.一种基于基因表达数据的新型混合龙格-库塔优化器与支持向量机用于癌症分类

Diagnostics (Basel). 2023 May 3;13(9):1621. doi: 10.3390/diagnostics13091621.

Low-precision feature selection on microarray data: an information theoretic approach.基于信息论的微阵列数据低精度特征选择。

Med Biol Eng Comput. 2022 May;60(5):1333-1345. doi: 10.1007/s11517-022-02508-0. Epub 2022 Mar 22.

Identification of Key Genes and Pathways in Osteosarcoma by Bioinformatics Analysis.生物信息学分析鉴定骨肉瘤中的关键基因和通路。

Comput Math Methods Med. 2022 Jan 15;2022:7549894. doi: 10.1155/2022/7549894. eCollection 2022.

One-Step Robust Low-Rank Subspace Segmentation for Tumor Sample Clustering.一步稳健的低秩子空间分割用于肿瘤样本聚类。

Comput Intell Neurosci. 2021 Dec 8;2021:9990297. doi: 10.1155/2021/9990297. eCollection 2021.

Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier.基于新型可解释集成基因选择算法和分类器的脑癌预测

Diagnostics (Basel). 2021 Oct 19;11(10):1936. doi: 10.3390/diagnostics11101936.

Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology.基于大数据技术的高维小样本数据特征选择与特征稳定性测量方法。

Comput Intell Neurosci. 2021 Sep 23;2021:3597051. doi: 10.1155/2021/3597051. eCollection 2021.

Metabolomic profiling of microbial disease etiology in community-acquired pneumonia.社区获得性肺炎中微生物病因的代谢组学分析。

PLoS One. 2021 Jun 4;16(6):e0252378. doi: 10.1371/journal.pone.0252378. eCollection 2021.

A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data.基于微阵列基因表达数据对癌症类型进行分类的机器学习和深度学习算法的比较研究。

PeerJ Comput Sci. 2020 Apr 13;6:e270. doi: 10.7717/peerj-cs.270. eCollection 2020.

The Role of Surface Chemistry in the Efficacy of Protein and DNA Microarrays for Label-Free Detection: An Overview.表面化学在蛋白质和DNA微阵列无标记检测效能中的作用：综述

Polymers (Basel). 2021 Mar 26;13(7):1026. doi: 10.3390/polym13071026.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GEMS：一种用于从微阵列基因表达数据中进行癌症自动诊断和生物标志物发现的系统。

GEMS: a system for automated cancer diagnosis and biomarker discovery from microarray gene expression data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献