Chia Burton Kuan Hui, Karuturi R Krishna Murthy
Computational & Systems Biology, Genome Institute of Singapore, A-STAR, 60 Biopolis ST, Singapore.
Algorithms Mol Biol. 2010 May 28;5:23. doi: 10.1186/1748-7188-5-23.
Biclustering is an important analysis procedure to understand the biological mechanisms from microarray gene expression data. Several algorithms have been proposed to identify biclusters, but very little effort was made to compare the performance of different algorithms on real datasets and combine the resultant biclusters into one unified ranking.
In this paper we propose differential co-expression framework and a differential co-expression scoring function to objectively quantify quality or goodness of a bicluster of genes based on the observation that genes in a bicluster are co-expressed in the conditions belonged to the bicluster and not co-expressed in the other conditions. Furthermore, we propose a scoring function to stratify biclusters into three types of co-expression. We used the proposed scoring functions to understand the performance and behavior of the four well established biclustering algorithms on six real datasets from different domains by combining their output into one unified ranking.
Differential co-expression framework is useful to provide quantitative and objective assessment of the goodness of biclusters of co-expressed genes and performance of biclustering algorithms in identifying co-expression biclusters. It also helps to combine the biclusters output by different algorithms into one unified ranking i.e. meta-biclustering.
双聚类分析是从微阵列基因表达数据中理解生物学机制的重要分析方法。已经提出了几种算法来识别双聚类,但在真实数据集上比较不同算法的性能并将所得双聚类组合成一个统一排名方面所做的工作很少。
在本文中,我们提出了差异共表达框架和差异共表达评分函数,基于双聚类中的基因在属于该双聚类的条件下共表达而在其他条件下不共表达这一观察结果,客观地量化基因双聚类的质量或优劣。此外,我们提出了一个评分函数,将双聚类分为三种共表达类型。我们使用所提出的评分函数,通过将四种成熟的双聚类算法在六个来自不同领域的真实数据集上的输出组合成一个统一排名,来了解这些算法的性能和行为。
差异共表达框架有助于对共表达基因双聚类的优劣以及双聚类算法在识别共表达双聚类方面的性能进行定量和客观评估。它还有助于将不同算法输出的双聚类组合成一个统一排名,即元双聚类。