Bruun Georg M, Wernersson Rasmus, Juncker Agnieszka S, Willenbrock Hanni, Nielsen Henrik Bjørn
Niels Bohr Institute, Blegdamsvej 17, Copenhagen, Denmark.
Nucleic Acids Res. 2007;35(7):e48. doi: 10.1093/nar/gkl1098. Epub 2007 Mar 2.
Signals from different oligonucleotide probes against the same target show great variation in intensities. However, detection of differences along a sequence e.g. to reveal intron/exon architecture, transcription boundary as well as simple absent/present calls depends on comparisons between different probes. It is therefore of great interest to correct for the variation between probes. Much of this variation is sequence dependent. We demonstrate that a thermodynamic model for hybridization of either DNA or RNA to a DNA microarray, which takes the sequence-dependent probe affinities into account significantly reduces the signal fluctuation between probes targeting the same gene transcript. For a test set of tightly tiled yeast genes, the model reduces the variance by up to a factor approximately 1/3. As a consequence of this reduction, the model is shown to yield a more accurate determination of transcription start sites for a subset of yeast genes. In another application, we identify present/absent calls for probes hybridized to the sequenced Escherichia coli strain O157:H7 EDL933. The model improves the correct calls from 85 to 95% relative to raw intensity measures. The model thus makes applications which depend on comparisons between probes aimed at different sections of the same target more reliable.
针对同一靶标的不同寡核苷酸探针发出的信号强度差异很大。然而,检测序列上的差异(例如揭示内含子/外显子结构、转录边界以及简单的存在/缺失判断)取决于不同探针之间的比较。因此,校正探针之间的差异非常重要。这种差异大多取决于序列。我们证明,一种考虑了序列依赖性探针亲和力的DNA或RNA与DNA微阵列杂交的热力学模型,能显著减少靶向同一基因转录本的探针之间的信号波动。对于一组紧密排列的酵母基因测试集,该模型可将方差降低至约1/3倍。由于这种降低,该模型被证明能更准确地确定一部分酵母基因的转录起始位点。在另一个应用中,我们确定了与测序的大肠杆菌O157:H7 EDL933菌株杂交的探针的存在/缺失情况。相对于原始强度测量,该模型将正确判断率从85%提高到了95%。因此,该模型使依赖于针对同一靶标不同部分的探针之间比较的应用更加可靠。