Kong Sek Won
Department of Cardiology, Children's Hospital Boston, Harvard Medical School, Boston, MA, USA.
Methods Mol Biol. 2007;366:75-105. doi: 10.1007/978-1-59745-030-0_5.
By providing genome-scale information on gene expression, microarray technology has gained popularity in diverse areas including clinical medicine. However, the analysis and interpretation of microarray data are often complicated. This chapter describes various strategies for microarray data analysis. The analysis starts with the scanned image of a microarray. The image information is processed and summarized to numerical values that represent the abundance of transcripts. Technical variability and systematic biases can be minimized with the proper procedures of background correction and normalization. Considerable numbers of genes are not expressed or not detected by microarray technology. Those genes can be filtered out before further statistical comparison to reduce the dimensionality of the problem. The next step in analysis involves statistical comparison, cluster analysis, and visualization. Genes from the same cluster are considered to be coexpressed and/or coregulated. Also, we can group coexpressed genes into categories by their biological function and cellular location. By combining prior knowledge and statistical results, we can make an inference based on the gene expression profiles.
通过提供基因表达的全基因组规模信息,微阵列技术在包括临床医学在内的多个领域中受到了广泛关注。然而,微阵列数据的分析和解读往往很复杂。本章介绍了微阵列数据分析的各种策略。分析从微阵列的扫描图像开始。图像信息经过处理和汇总,转化为代表转录本丰度的数值。通过适当的背景校正和标准化程序,可以将技术变异性和系统偏差降至最低。相当数量的基因未被微阵列技术表达或检测到。在进行进一步的统计比较之前,可以将这些基因过滤掉,以降低问题的维度。分析的下一步涉及统计比较、聚类分析和可视化。来自同一聚类的基因被认为是共表达和/或共调控的。此外,我们可以根据共表达基因的生物学功能和细胞定位将它们分组。通过结合先验知识和统计结果,我们可以根据基因表达谱进行推断。