Suppr超能文献

基因表达数据的图形化探索:三种多元方法的比较研究

Graphical exploration of gene expression data: a comparative study of three multivariate methods.

作者信息

Wouters Luc, Göhlmann Hinrich W, Bijnens Luc, Kass Stefan U, Molenberghs Geert, Lewi Paul J

机构信息

Center for Statistics, Limburgs Universitair Centrum, transnationale Universiteit Limburg, Universitaire Campus, gebouw D, B-3590 Diepenbeek, Belgium.

出版信息

Biometrics. 2003 Dec;59(4):1131-9. doi: 10.1111/j.0006-341x.2003.00130.x.

Abstract

This article describes three multivariate projection methods and compares them for their ability to identify clusters of biological samples and genes using real-life data on gene expression levels of leukemia patients. It is shown that principal component analysis (PCA) has the disadvantage that the resulting principal factors are not very informative, while correspondence factor analysis (CFA) has difficulties interpreting distances between objects. Spectral map analysis (SMA) is introduced as an alternative approach to the analysis of microarray data. Weighted SMA outperforms PCA, and is at least as powerful as CFA, in finding clusters in the samples, as well as identifying genes related to these clusters. SMA addresses the problem of data analysis in microarray experiments in a more appropriate manner than CFA, and allows more flexible weighting to the genes and samples. Proper weighting is important, since it enables less reliable data to be down-weighted and more reliable information to be emphasized.

摘要

本文介绍了三种多元投影方法,并利用白血病患者基因表达水平的实际数据,比较了它们识别生物样本和基因簇的能力。结果表明,主成分分析(PCA)的缺点是所得主因子信息量不大,而对应因子分析(CFA)在解释对象间距离方面存在困难。引入谱图分析(SMA)作为微阵列数据分析的替代方法。加权谱图分析在样本聚类以及识别与这些聚类相关的基因方面优于主成分分析,并且至少与对应因子分析一样强大。谱图分析比对应因子分析更适当地解决了微阵列实验中的数据分析问题,并允许对基因和样本进行更灵活的加权。适当的加权很重要,因为它能降低可靠性较低的数据的权重,并强调更可靠的信息。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验