Suppr超能文献

基于微阵列的基因集分析:当前方法的比较。

Microarray-based gene set analysis: a comparison of current methods.

作者信息

Song Sarah, Black Michael A

机构信息

Department of Biochemistry, University of Otago, Dunedin, New Zealand.

出版信息

BMC Bioinformatics. 2008 Nov 27;9:502. doi: 10.1186/1471-2105-9-502.

Abstract

BACKGROUND

The analysis of gene sets has become a popular topic in recent times, with researchers attempting to improve the interpretability and reproducibility of their microarray analyses through the inclusion of supplementary biological information. While a number of options for gene set analysis exist, no consensus has yet been reached regarding which methodology performs best, and under what conditions. The goal of this work was to examine the performance characteristics of a collection of existing gene set analysis methods, on both simulated and real microarray data sets. Of particular interest was the potential utility gained through the incorporation of inter-gene correlation into the analysis process.

RESULTS

Each of six gene set analysis methods was applied to both simulated and publicly available microarray data sets. Overall, the various methodologies were all found to be better at detecting gene sets that moved from non-active (i.e., genes not expressed) to active states (or vice versa), rather than those that simply changed their level of activity. Methods which incorporate correlation structures were found to provide increased ability to detect altered gene sets in some settings.

CONCLUSION

Based on the results obtained through the analysis of simulated data, it is clear that the performance of gene set analysis methods is strongly influenced by the features of the data set in question, and that methods which incorporate correlation structures into the analysis process tend to achieve better performance, relative to methods which rely on univariate test statistics.

摘要

背景

近年来,基因集分析已成为一个热门话题,研究人员试图通过纳入补充生物学信息来提高其微阵列分析的可解释性和可重复性。虽然存在多种基因集分析选项,但对于哪种方法表现最佳以及在何种条件下表现最佳尚未达成共识。这项工作的目标是在模拟和真实微阵列数据集上检验一系列现有基因集分析方法的性能特征。特别感兴趣的是通过将基因间相关性纳入分析过程所获得的潜在效用。

结果

六种基因集分析方法中的每一种都应用于模拟和公开可用的微阵列数据集。总体而言,发现各种方法在检测从非活跃(即未表达的基因)转变为活跃状态(反之亦然)的基因集方面比那些只是简单改变其活性水平的基因集表现更好。发现在某些情况下,纳入相关结构的方法能够增强检测改变的基因集的能力。

结论

基于通过模拟数据分析获得的结果,很明显基因集分析方法的性能受到所讨论数据集特征的强烈影响,并且相对于依赖单变量检验统计量的方法,将相关结构纳入分析过程的方法往往能取得更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/c3dd606a304c/1471-2105-9-502-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验