Hutz Janna E, Nelson Thomas, Wu Hua, McAllister Gregory, Moutsatsos Ioannis, Jaeger Savina A, Bandyopadhyay Somnath, Nigsch Florian, Cornett Ben, Jenkins Jeremy L, Selinger Douglas W
Novartis Institutes for Biomedical Research, Cambridge, MA 02139, USA.
J Biomol Screen. 2013 Apr;18(4):367-77. doi: 10.1177/1087057112469257. Epub 2012 Nov 29.
Screens using high-throughput, information-rich technologies such as microarrays, high-content screening (HCS), and next-generation sequencing (NGS) have become increasingly widespread. Compared with single-readout assays, these methods produce a more comprehensive picture of the effects of screened treatments. However, interpreting such multidimensional readouts is challenging. Univariate statistics such as t-tests and Z-factors cannot easily be applied to multidimensional profiles, leaving no obvious way to answer common screening questions such as "Is treatment X active in this assay?" and "Is treatment X different from (or equivalent to) treatment Y?" We have developed a simple, straightforward metric, the multidimensional perturbation value (mp-value), which can be used to answer these questions. Here, we demonstrate application of the mp-value to three data sets: a multiplexed gene expression screen of compounds and genomic reagents, a microarray-based gene expression screen of compounds, and an HCS compound screen. In all data sets, active treatments were successfully identified using the mp-value, and simulations and follow-up analyses supported the mp-value's statistical and biological validity. We believe the mp-value represents a promising way to simplify the analysis of multidimensional data while taking full advantage of its richness.
使用微阵列、高内涵筛选(HCS)和下一代测序(NGS)等高通量、信息丰富技术的筛选方法已越来越普遍。与单读数分析相比,这些方法能更全面地呈现被筛选处理的效果。然而,解读这种多维度读数具有挑战性。诸如t检验和Z因子等单变量统计方法不易应用于多维度数据,因而没有明显的方式来回答诸如“处理X在该分析中是否有效?”以及“处理X与处理Y不同(或等效)吗?”等常见筛选问题。我们开发了一种简单直接的指标——多维度扰动值(mp值),可用于回答这些问题。在此,我们展示了mp值在三个数据集上的应用:化合物和基因组试剂的多重基因表达筛选、基于微阵列的化合物基因表达筛选以及HCS化合物筛选。在所有数据集中,使用mp值成功识别出了活性处理,模拟和后续分析支持了mp值的统计和生物学有效性。我们认为mp值是一种很有前景的方法,可在充分利用多维度数据丰富性的同时简化其分析。