一种基于基因群体方差比度量的简单且稳健的微阵列数据聚类算法。

A simple and robust algorithm for microarray data clustering based on gene population-variance ratio metric.

作者信息

Chatterjee Soumyadeep, Bhattacharjee Kasturi, Konar Amit

机构信息

Artificial Intelligence Laboratory, Jadavpur University, Kolkata, India.

出版信息

Biotechnol J. 2009 Sep;4(9):1357-61. doi: 10.1002/biot.200800219.

DOI:10.1002/biot.200800219

PMID:19579218

Abstract

With the advent of the microarray technology, the field of life science has been greatly revolutionized, since this technique allows the simultaneous monitoring of the expression levels of thousands of genes in a particular organism. However, the statistical analysis of expression data has its own challenges, primarily because of the huge amount of data that is to be dealt with, and also because of the presence of noise, which is almost an inherent characteristic of microarray data. Clustering is one tool used to mine meaningful patterns from microarray data. In this paper, we present a novel method of clustering yeast microarray data, which is robust and yet simple to implement. It identifies the best clusters from a given dataset on the basis of the population of the clusters as well as the variance of the feature values of the members from the cluster-center. It has been found to yield satisfactory results even in the presence of noisy data.

摘要

随着微阵列技术的出现，生命科学领域发生了巨大变革，因为该技术能够同时监测特定生物体中数千个基因的表达水平。然而，表达数据的统计分析存在自身的挑战，主要是因为要处理的数据量巨大，还因为噪声的存在，而噪声几乎是微阵列数据的固有特征。聚类是用于从微阵列数据中挖掘有意义模式的一种工具。在本文中，我们提出了一种用于对酵母微阵列数据进行聚类的新方法，该方法稳健且易于实现。它基于聚类的数量以及聚类中心成员特征值的方差，从给定数据集中识别出最佳聚类。即使存在噪声数据，该方法也已被发现能产生令人满意的结果。

相似文献

A simple and robust algorithm for microarray data clustering based on gene population-variance ratio metric.

Biotechnol J. 2009 Sep;4(9):1357-61. doi: 10.1002/biot.200800219.

Microarray data clustering based on temporal variation: FCV with TSD preclustering.

Appl Bioinformatics. 2003;2(1):35-45.

Model-based clustering on the unit sphere with an illustration using gene expression profiles.

Biostatistics. 2008 Jan;9(1):66-80. doi: 10.1093/biostatistics/kxm012. Epub 2007 Apr 27.

Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.

Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.

Incorporating gene functions as priors in model-based clustering of microarray gene expression data.

Bioinformatics. 2006 Apr 1;22(7):795-801. doi: 10.1093/bioinformatics/btl011. Epub 2006 Jan 24.

Finding multiple coherent biclusters in microarray data using variable string length multiobjective genetic algorithm.

IEEE Trans Inf Technol Biomed. 2009 Nov;13(6):969-75. doi: 10.1109/TITB.2009.2017527. Epub 2009 Mar 16.

Techniques for clustering gene expression data.

Comput Biol Med. 2008 Mar;38(3):283-93. doi: 10.1016/j.compbiomed.2007.11.001. Epub 2007 Dec 3.

Novel technique for preprocessing high dimensional time-course data from DNA microarray: mathematical model-based clustering.

Bioinformatics. 2006 Apr 1;22(7):843-8. doi: 10.1093/bioinformatics/btl016. Epub 2006 Jan 24.

Clustering of change patterns using Fourier coefficients.

Bioinformatics. 2008 Jan 15;24(2):184-91. doi: 10.1093/bioinformatics/btm568. Epub 2007 Nov 19.

Investigation of self-organizing oscillator networks for use in clustering microarray data.

IEEE Trans Nanobioscience. 2008 Mar;7(1):65-79. doi: 10.1109/TNB.2008.2000151.

引用本文的文献

Alteration of the Risk of Oral Pre-Cancer and Cancer in North India Population by CYP1A1 Polymorphism Genotypes and Haplotype.

Asian Pac J Cancer Prev. 2019 Feb 26;20(2):345-354. doi: 10.31557/APJCP.2019.20.2.345.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于基因群体方差比度量的简单且稳健的微阵列数据聚类算法。

A simple and robust algorithm for microarray data clustering based on gene population-variance ratio metric.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献