基于图形聚类方法的多亚类癌症DNA微阵列分析

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.

作者信息

Kawamura Takashi, Mutoh Hironori, Tomita Yasuyuki, Kato Ryuji, Honda Hiroyuki

机构信息

Department of Biotechnology, School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan.

出版信息

J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.

DOI:10.1263/jbb.106.442

PMID:19111639

Abstract

It is well known that various genes related to cell cycle, cell-cell adhesion, and transcriptional regulation cause the onset of cancer. Moreover, environmental factors including age, sex, and lifestyle can also contribute to the onset of cancer. Therefore, it is difficult to ascertain which factors influence the onset. Thus, patients suffering from same disease can be divided into several distinct groups. In the present study, we applied graph-based clustering to several DNA microarray datasets before the classification analysis. Several clusters formed by the graph-based clustering were used for the construction of multi-class classification model with the k-nearest neighbor and for finding genes, which are specific to a certain cluster, by One vs. Others classification. Using this approach, the classification model was constructed for four microarray datasets, leukemia, breast cancer, prostate cancer, and colon cancer, and the accuracies of classification with k-nearest neighbor were all more than 80%. And in the breast cancer dataset, we succeeded in finding genes that are specific in a cluster consisting of 38 control group samples. These results indicate the importance of sample clustering before classification model construction.

摘要

众所周知，与细胞周期、细胞间黏附以及转录调控相关的各种基因会引发癌症。此外，包括年龄、性别和生活方式在内的环境因素也可能促使癌症的发生。因此，很难确定哪些因素会影响癌症的发生。如此一来，患有相同疾病的患者可被分为几个不同的组。在本研究中，我们在分类分析之前将基于图的聚类方法应用于几个DNA微阵列数据集。基于图的聚类所形成的几个簇被用于构建k近邻多类分类模型，并通过一对多分类来寻找特定于某个簇的基因。使用这种方法，针对白血病、乳腺癌、前列腺癌和结肠癌这四个微阵列数据集构建了分类模型，并且k近邻分类的准确率均超过80%。在乳腺癌数据集中，我们成功找到了在由38个对照组样本组成的一个簇中具有特异性的基因。这些结果表明了在构建分类模型之前进行样本聚类的重要性。

相似文献

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.

J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.

Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.

Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.

Tumor classification ranking from microarray data.

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Graph-based consensus clustering for class discovery from gene expression data.

Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.

Gene selection for classification of cancers using probabilistic model building genetic algorithm.

Biosystems. 2005 Dec;82(3):208-25. doi: 10.1016/j.biosystems.2005.07.003. Epub 2005 Aug 22.

Class discovery from gene expression data based on perturbation and cluster ensemble.

IEEE Trans Nanobioscience. 2009 Jun;8(2):147-60. doi: 10.1109/TNB.2009.2023321. Epub 2009 Jun 2.

Clustering microarray gene expression data using weighted Chinese restaurant process.

Bioinformatics. 2006 Aug 15;22(16):1988-97. doi: 10.1093/bioinformatics/btl284. Epub 2006 Jun 9.

Clustering of change patterns using Fourier coefficients.

Bioinformatics. 2008 Jan 15;24(2):184-91. doi: 10.1093/bioinformatics/btm568. Epub 2007 Nov 19.

Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates.

J Theor Biol. 2009 Aug 7;259(3):533-40. doi: 10.1016/j.jtbi.2009.04.013. Epub 2009 May 3.

A stable iterative method for refining discriminative gene clusters.

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S18. doi: 10.1186/1471-2164-9-S2-S18.

引用本文的文献

Exploring Prognostic Immune Microenvironment-Related Genes in Head and Neck Squamous Cell Carcinoma from the TCGA Database.

J Cancer. 2024 Jan 1;15(3):632-644. doi: 10.7150/jca.89581. eCollection 2024.

Impact of chromosome 17q deletion in the primary lesion of colorectal cancer on liver metastasis.

Oncol Lett. 2016 Dec;12(6):4773-4778. doi: 10.3892/ol.2016.5271. Epub 2016 Oct 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于图形聚类方法的多亚类癌症DNA微阵列分析

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.

作者信息

Kawamura Takashi, Mutoh Hironori, Tomita Yasuyuki, Kato Ryuji, Honda Hiroyuki

机构信息

Department of Biotechnology, School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan.

出版信息

J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.

DOI:10.1263/jbb.106.442

PMID:19111639

Abstract

摘要

基于图形聚类方法的多亚类癌症DNA微阵列分析

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于图形聚类方法的多亚类癌症DNA微阵列分析

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献