通过 ℓ1 最小化对具有基因表达数据的肿瘤分类中的欠定系统进行稀疏表示。

Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.

作者信息

Sánchez R, Argáez M, Guillén P

机构信息

University of Texas aEl Paso, TX 79968, USA.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:3362-6. doi: 10.1109/IEMBS.2011.6090911.

DOI:10.1109/IEMBS.2011.6090911

PMID:22255060

Abstract

The development of cancer diagnosis models and cancer discovery from DNA microarray data are of great interest in bioinformatics and medicine. In pattern recognition and machine learning, a classification problem refers to finding an algorithm for assigning a given input data into one of several categories. Many natural signals are sparse or compressible in the sense that they have short representations when expressed in a suitable basis. Motivated by the recent successful algorithm developments for sparse signal recovery, we apply the selective nature of sparse representation to perform the above mentioned classification. In order to find such sparse representation we implement an ℓ(1)-minimization algorithm. This methodology overcomes the lack of robustness with respect to outliers. In contrast to other classification algorithms, no model selection dependency is involved. The minimization algorithm is a convex relaxation-like that has been proven to efficiently recover sparse signals. To study its performance, the proposed method is applied to six tumor gene expression datasets and numerically compared with various support vector machine methods (SVM). The numerical results show that the ℓ(1)-minimization algorithm proposed performs at least comparably and often better than SVMs.

摘要

从DNA微阵列数据中开发癌症诊断模型以及发现癌症，在生物信息学和医学领域备受关注。在模式识别和机器学习中，分类问题是指找到一种算法，将给定的输入数据分配到几个类别之一。许多自然信号在某种意义上是稀疏的或可压缩的，即在合适的基下表示时具有简短的形式。受近期稀疏信号恢复算法成功发展的启发，我们应用稀疏表示的选择性来执行上述分类。为了找到这种稀疏表示，我们实现了一种ℓ(1)最小化算法。该方法克服了对异常值缺乏鲁棒性的问题。与其他分类算法不同，它不涉及模型选择的依赖性。最小化算法类似于一种凸松弛，已被证明能有效地恢复稀疏信号。为了研究其性能，将所提出的方法应用于六个肿瘤基因表达数据集，并与各种支持向量机方法（SVM）进行数值比较。数值结果表明，所提出的ℓ(1)最小化算法的性能至少与支持向量机相当，且通常优于支持向量机。

相似文献

Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.

Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:3362-6. doi: 10.1109/IEMBS.2011.6090911.

Accurate prediction of major histocompatibility complex class II epitopes by sparse representation via ℓ 1-minimization.

BioData Min. 2014 Nov 4;7:23. doi: 10.1186/1756-0381-7-23. eCollection 2014.

Sparse representation for classification of tumors using gene expression data.

J Biomed Biotechnol. 2009;2009:403689. doi: 10.1155/2009/403689. Epub 2009 Mar 15.

SGL-SVM: A novel method for tumor classification via support vector machine with sparse group Lasso.

J Theor Biol. 2020 Feb 7;486:110098. doi: 10.1016/j.jtbi.2019.110098. Epub 2019 Nov 28.

Metasample-based sparse representation for tumor classification.

IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1273-82. doi: 10.1109/TCBB.2011.20.

A Novel Gene Selection Method Based on Sparse Representation and Max-Relevance and Min-Redundancy.

Comb Chem High Throughput Screen. 2017;20(2):158-163. doi: 10.2174/1386207320666170126114051.

MLSeq: Machine learning interface for RNA-sequencing data.

Comput Methods Programs Biomed. 2019 Jul;175:223-231. doi: 10.1016/j.cmpb.2019.04.007. Epub 2019 Apr 29.

Incorporating EBO-HSIC with SVM for Gene Selection Associated with Cervical Cancer Classification.

J Med Syst. 2018 Oct 6;42(11):225. doi: 10.1007/s10916-018-1092-5.

Training sparse least squares support vector machines by the QR decomposition.

Neural Netw. 2018 Oct;106:175-184. doi: 10.1016/j.neunet.2018.07.008. Epub 2018 Jul 19.

Application of Sparse Representation in Bioinformatics.

Front Genet. 2021 Dec 15;12:810875. doi: 10.3389/fgene.2021.810875. eCollection 2021.

引用本文的文献

A novel sparse coding algorithm for classification of tumors based on gene expression data.

Med Biol Eng Comput. 2016 Jun;54(6):869-76. doi: 10.1007/s11517-015-1382-8. Epub 2015 Sep 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过 ℓ1 最小化对具有基因表达数据的肿瘤分类中的欠定系统进行稀疏表示。

Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.

作者信息

Sánchez R, Argáez M, Guillén P

机构信息

University of Texas aEl Paso, TX 79968, USA.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:3362-6. doi: 10.1109/IEMBS.2011.6090911.

DOI:10.1109/IEMBS.2011.6090911

PMID:22255060

Abstract

摘要

通过 ℓ1 最小化对具有基因表达数据的肿瘤分类中的欠定系统进行稀疏表示。

Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过 ℓ1 最小化对具有基因表达数据的肿瘤分类中的欠定系统进行稀疏表示。

Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献