Suppr超能文献

Affymetrix 和 Illumina 基因表达微阵列实验中系统噪声关键来源的相对影响。

Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments.

机构信息

Applied Bioinformatics of Cancer Group, Breakthrough Breast Cancer Research Unit, Institute of Genetics and Molecular Medicine, Crewe Road South, Edinburgh, Edinburgh, EH4 2XR, UK.

出版信息

BMC Genomics. 2011 Dec 1;12:589. doi: 10.1186/1471-2164-12-589.

Abstract

BACKGROUND

Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR) and individual or pooled breast-tumour RNA.

RESULTS

A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependent upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement) and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content.

CONCLUSIONS

The magnitude of systematic processing noise in a microarray experiment is variable across probes and experiments, however it is generally the case that procedures earlier in the sample-preparation workflow are liable to introduce the most noise. Careful experimental design is important to protect against noise, detailed meta-data should always be provided, and diagnostic procedures should be routinely performed prior to downstream analyses for the detection of bias in microarray studies.

摘要

背景

系统处理噪声,包括批次效应,在微阵列实验中非常常见,但由于其可能混淆或损害实验结果,通常被忽略。由于每个数据集生成的条件不同,因此在重新分析或整合来自公共存储库的数据时,最有可能出现受损的结果。为了更好地了解实验设计中各种因素的相对噪声贡献,我们评估了几个 Illumina 和 Affymetrix 数据集,以了解通用人类参考(UHRR)和个体或混合乳腺癌 RNA 的重复杂交之间的技术变异。

结果

在每个数据集都观察到了不同程度的系统噪声,但是在所有情况下,在样品制备工作流程的早期阶段,标准对照 RNA 重复之间的变化量被发现是最大的。例如,在报告的表达中,40.6%的总变化归因于重复提取,而 13.9%归因于扩增/标记,10.8%归因于重复杂交。故意进行探针级别的批量校正方法可以有效降低这种变化的幅度,尽管改进的程度取决于模型中包含的噪声源。在综合 Illumina 数据集的芯片、运行和实验水平上引入的系统噪声高度依赖于实验设计。UHRR 和从感兴趣的样本中衍生出的 RNA 池都很好地模拟了技术变化,尽管 RNA 池的相关性更好(平均提高了 4%),并且比 UHRR 更能模拟系统噪声的影响,在所有探针上都更好地模拟了系统噪声的影响。这种噪声的影响并不是在所有探针上都是均匀的,低 GC 含量的探针比高 GC 含量的探针更容易受到批次变化的影响。

结论

微阵列实验中的系统处理噪声的幅度在探针和实验之间是可变的,但是通常情况下,样品制备工作流程早期的步骤更容易引入噪声。仔细的实验设计对于防止噪声很重要,应该始终提供详细的元数据,并且应该在下游分析之前例行执行诊断程序,以检测微阵列研究中的偏差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/360f/3269440/207a1feb3266/1471-2164-12-589-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验