Steuer Ralf, Morgenthal Katja, Weckwerth Wolfram, Selbig Joachim
Institute of Biochemistry and Biology, University of Potsdam, Germany.
Methods Mol Biol. 2007;358:105-26. doi: 10.1007/978-1-59745-244-1_7.
Modern molecular biology crucially relies on computational tools to handle and interpret the large amounts of data that are generated by high-throughput measurements. To this end, much effort is dedicated to devise novel sophisticated methods that allow one to integrate, evaluate, and analyze biological data. However, prior to an application of specifically designed methods, simple and well-known statistical approaches often provide a more appropriate starting point for further analysis. This chapter seeks to describe several well-established approaches to data analysis, including various clustering techniques, discriminant function analysis, principal component analysis, multidimensional scaling, and classification trees. The chapter is accompanied by a webpage, describing the application of all algorithms in a ready-to-use format.
现代分子生物学在很大程度上依赖于计算工具来处理和解释高通量测量产生的大量数据。为此,人们投入了大量精力来设计新颖复杂的方法,以便能够整合、评估和分析生物数据。然而,在应用专门设计的方法之前,简单且广为人知的统计方法通常为进一步分析提供了更合适的起点。本章旨在描述几种成熟的数据分析方法,包括各种聚类技术、判别函数分析、主成分分析、多维缩放和分类树。本章还配有一个网页,以即用型格式描述所有算法的应用。