Zhao Qing, Shi Xingjie, Huang Jian, Liu Jin, Li Yang, Ma Shuangge
Department of Biostatistics, School of Public Health, Yale University.
Department of Biostatistics, School of Public Health, Yale University ; School of Statistics and Management, Shanghai University of Finance and Economics.
Wiley Interdiscip Rev Comput Stat. 2015 Jan-Feb;7(1):99-108. doi: 10.1002/wics.1322.
In the analysis of omics data, integrative analysis provides an effective way of pooling information across multiple datasets or multiple correlated responses, and can be more effective than single-dataset (response) analysis. Multiple families of integrative analysis methods have been proposed in the literature. The current review focuses on the penalization methods. Special attention is paid to sparse meta-analysis methods that pool summary statistics across datasets, and integrative analysis methods that pool raw data across datasets. We discuss their formulation and rationale. Beyond "standard" penalized selection, we also review contrasted penalization and Laplacian penalization which accommodate finer data structures. The computational aspects, including computational algorithms and tuning parameter selection, are examined. This review concludes with possible limitations and extensions.
在组学数据分析中,整合分析提供了一种跨多个数据集或多个相关响应汇总信息的有效方法,并且可能比单数据集(响应)分析更有效。文献中已经提出了多个整合分析方法家族。当前的综述聚焦于惩罚方法。特别关注跨数据集汇总汇总统计量的稀疏荟萃分析方法,以及跨数据集汇总原始数据的整合分析方法。我们讨论它们的公式和原理。除了“标准”惩罚选择之外,我们还综述了适应更精细数据结构的对比惩罚和拉普拉斯惩罚。研究了计算方面,包括计算算法和调优参数选择。本综述最后讨论了可能的局限性和扩展。