Faculty of Veterinary Science, The University of Sydney, Sydney, NSW 2006, Australia.
BMC Bioinformatics. 2013 Jul 22;14:231. doi: 10.1186/1471-2105-14-231.
In addition to probe sequence characteristics, noise in hybridization array data is thought to be influenced by competitive hybridization between probes tiled at high densities. Empirical evaluation of competitive hybridization and an estimation of what other non-sequence related features might affect noisy data is currently lacking.
A high density array was designed to a 1.5 megabase region of the canine genome to explore the potential for probe competition to introduce noise. Multivariate assessment of the influence of probe, segment and design characteristics on hybridization intensity demonstrate that whilst increased density significantly depresses fluorescence intensities, this effect is largely consistent when an ultra high density offset is applied. Signal variation not attributable to sequence composition resulted from the reduction in competition when large inter-probe spacing was introduced due to long repetitive elements and when a lower density offset was applied. Tiling of probes immediately adjacent to various classes of repeat elements did not generate noise. Comparison of identical probe sets hybridized with DNA extracted from blood or saliva establishes salivary DNA as a source of noise.
This analysis demonstrates the occurrence of competitive hybridization between oligonucleotide probes in high density tiling arrays. It supports that probe competition does not generate random noise when it is maintained across a region. To prevent the introduction of noise from this source, the degree of competition should be regulated by minimizing variation in density across the target region. This finding can make an important contribution to optimizing coverage whilst minimizing sources of noise in the design of high density tiling arrays.
除了探针序列特征外,杂交阵列数据中的噪声被认为受到高密探针间竞争杂交的影响。目前缺乏对竞争杂交的经验评估,以及对可能影响噪声数据的其他非序列相关特征的估计。
设计了一个高密度阵列来探索探针竞争引入噪声的潜力,该阵列覆盖了犬基因组的 150 万个碱基对。多变量评估探针、片段和设计特征对杂交强度的影响表明,尽管密度增加显著降低了荧光强度,但当应用超高密度偏移时,这种影响在很大程度上是一致的。当由于长重复元件引入较大的探针间间隔,或应用较低的密度偏移时,由于竞争减少导致信号变化与序列组成无关。紧邻各种重复元件的探针的平铺并未产生噪声。比较用血液或唾液提取的 DNA 杂交的相同探针集,确定唾液 DNA 是噪声的来源。
这项分析表明,在高密度平铺阵列中寡核苷酸探针之间存在竞争杂交。它支持当竞争在整个区域内保持一致时,不会产生随机噪声。为了防止这种来源引入噪声,可以通过最小化目标区域内密度的变化来调节竞争的程度。这一发现可以为优化覆盖范围做出重要贡献,同时最大限度地减少高密度平铺阵列设计中噪声的来源。