Zhang Li, Wu Chunlei, Carta Roberto, Zhao Haitao
Department of Biostatistics and Applied Mathematics, The University of Texas M. D. Anderson Cancer Center, 1515 Holcombe Boulevard, Unit 237, Houston, TX 77030, USA.
Nucleic Acids Res. 2007;35(3):e18. doi: 10.1093/nar/gkl1064. Epub 2006 Dec 14.
DNA/DNA duplex formation is the basic mechanism that is used in genome tiling arrays and SNP arrays manufactured by Affymetrix. However, detailed knowledge of the physical process is still lacking. In this study, we show a free energy analysis of DNA/DNA duplex formation these arrays based on the positional-dependent nearest-neighbor (PDNN) model, which was developed previously for describing DNA/RNA duplex formation on expression microarrays. Our results showed that the two ends of a probe contribute less to the stability of the duplexes and that there is a microarray surface effect on binding affinities. We also showed that free energy cost of a single mismatch depends on the bases adjacent to the mismatch site and obtained a comprehensive table of the cost of a single mismatch under all possible combination of adjacent bases. The mismatch costs were found to be correlated with those determined in aqueous solution. We further demonstrate that the DNA copy number estimated from the SNP array correlates negatively with the target length; this is presumably caused by inefficient PCR amplification for long fragments. These results provide important insights into the molecular mechanisms of microarray technology and have implications for microarray design and the interpretation of observed data.
DNA/DNA双链体形成是Affymetrix公司生产的基因组平铺阵列和SNP阵列所采用的基本机制。然而,对这一物理过程的详细了解仍然不足。在本研究中,我们基于位置依赖最近邻(PDNN)模型对这些阵列的DNA/DNA双链体形成进行了自由能分析,该模型是先前为描述表达微阵列上的DNA/RNA双链体形成而开发的。我们的结果表明,探针的两端对双链体稳定性的贡献较小,并且微阵列表面对结合亲和力有影响。我们还表明,单个错配的自由能成本取决于错配位点相邻的碱基,并获得了在相邻碱基的所有可能组合下单个错配成本的综合表格。发现错配成本与在水溶液中测定的成本相关。我们进一步证明,从SNP阵列估计的DNA拷贝数与靶标长度呈负相关;这可能是由于长片段的PCR扩增效率低下所致。这些结果为微阵列技术的分子机制提供了重要见解,并对微阵列设计和观测数据的解释具有启示意义。