Population Health Studies Division, Environmental Health Science and Research Bureau, Health Canada, Ottawa, K1A 0K9, Canada.
BMC Bioinformatics. 2010 Mar 12;11:127. doi: 10.1186/1471-2105-11-127.
Maximizing the utility of DNA microarray data requires optimization of data acquisition through selection of an appropriate scanner setting. To increase the amount of useable data, several approaches have been proposed that incorporate multiple scans at different sensitivities to reduce the quantification error and to minimize effects of saturation. However, no direct comparison of their efficacy has been made. In the present study we compared individual scans at low, medium and high sensitivity with three methods for combining data from multiple scans (either 2-scan or 3-scan cases) using an actual dataset comprising 40 technical replicates of a reference RNA standard.
Of the individual scans, the low scan exhibited the lowest background signal, the highest signal-to-noise ratio, and equivalent reproducibility to the medium and high scans. Most multiple scan approaches increased the range of probe intensities compared to the individual scans, but did not increase the dynamic range (the proportion of useable data). Approaches displayed striking differences in the background signal and signal-to-noise ratio. However, increased probe intensity range and improved signal-to-noise ratios did not necessarily correlate with improved reproducibility. Importantly, for one multiple scan method that combined 3 scans, reproducibility was significantly improved relative to individual scans and all other multiple scan approaches. The same method using 2 scans yielded significantly lower reproducibility, attributable to a lack-of-fit of the statistical model.
Our data indicate that implementation of a suitable multiple scan approach can improve reproducibility, but that model validation is critical to ensure accurate estimates of probe intensity.
通过选择适当的扫描仪设置来优化数据采集,从而最大限度地提高 DNA 微阵列数据的效用。为了增加可用数据的数量,已经提出了几种方法,这些方法结合了多个不同灵敏度的扫描,以减少定量误差并最小化饱和效应。然而,尚未对其效果进行直接比较。在本研究中,我们使用包含 40 个参考 RNA 标准技术重复的实际数据集,将低、中、高灵敏度的单个扫描与三种用于组合多个扫描数据的方法(2 扫描或 3 扫描情况)进行了比较。
在单个扫描中,低扫描显示出最低的背景信号、最高的信噪比,并且与中、高扫描的重现性相当。与单个扫描相比,大多数多扫描方法都增加了探针强度的范围,但并没有增加动态范围(可用数据的比例)。各种方法在背景信号和信噪比方面存在显著差异。然而,增加探针强度范围和提高信噪比并不一定与改善重现性相关。重要的是,对于一种将 3 个扫描组合的多扫描方法,与单个扫描和所有其他多扫描方法相比,重现性得到了显著改善。使用 2 个扫描的相同方法的重现性显著降低,这归因于统计模型的不拟合。
我们的数据表明,实施适当的多扫描方法可以提高重现性,但模型验证对于确保探针强度的准确估计至关重要。