Center for Public Health and Environmental Assessment, Office of Research and Development, US EPA, Washington DC, United States of America.
Division of the National Toxicology Program, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, Durham, NC, United States of America.
PLoS One. 2020 May 15;15(5):e0232955. doi: 10.1371/journal.pone.0232955. eCollection 2020.
Whole-genome expression data generated by microarray studies have shown promise for quantitative human health risk assessment. While numerous approaches have been developed to determine benchmark doses (BMDs) from probeset-level dose responses, sensitivity of the results to methods used for normalization of the data has not yet been systematically investigated. Normalization of microarray data converts raw hybridization signals to expression estimates that are expected to be proportional to the amounts of transcripts in the profiled specimens. Different approaches to normalization have been shown to greatly influence the results of some downstream analyses, including biological interpretation. In this study we evaluate the influence of microarray normalization methods on the transcriptomic BMDs. We demonstrate using in vivo data that the use of alternative pipelines for normalization of Affymetrix microarray data can have a considerable impact on the number of detected differentially expressed genes and pathways (processes) determined to be treatment responsive, which may lead to alternative interpretations of the data. In addition, we found that normalization can have a considerable effect (as much as ~30-fold in this study) on estimation of the minimum biological potency (transcriptomic point of departure). We argue for consideration of alternative normalization methods and their data-informed selection to most effectively interpret microarray data for use in human health risk assessment.
基于微阵列研究生成的全基因组表达数据在定量人类健康风险评估方面显示出了一定的前景。虽然已经开发了许多方法来从探针集剂量反应中确定基准剂量 (BMD),但数据归一化方法的敏感性尚未得到系统研究。微阵列数据的归一化将原始杂交信号转换为表达估计值,这些估计值预计与所分析标本中转录物的量成正比。不同的归一化方法已被证明会极大地影响某些下游分析的结果,包括生物学解释。在这项研究中,我们评估了微阵列归一化方法对转录组 BMD 的影响。我们使用体内数据证明,替代管道用于 Affymetrix 微阵列数据的归一化会对检测到的差异表达基因和途径(过程)的数量产生重大影响,这些基因和途径被确定为对治疗有反应,这可能导致对数据的不同解释。此外,我们发现归一化可以对最小生物效力(转录组起点)的估计产生相当大的影响(在本研究中高达约 30 倍)。我们认为需要考虑替代的归一化方法,并根据数据选择最有效地解释用于人类健康风险评估的微阵列数据。