Price Thomas S, Regan Regina, Mott Richard, Hedman Asa, Honey Ben, Daniels Rachael J, Smith Lee, Greenfield Andy, Tiganescu Ana, Buckle Veronica, Ventress Nicki, Ayyub Helena, Salhan Anita, Pedraza-Diaz Susana, Broxholme John, Ragoussis Jiannis, Higgs Douglas R, Flint Jonathan, Knight Samantha J L
The Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Churchill Hospital Headington, Oxford OX3 7BN, UK.
Nucleic Acids Res. 2005 Jun 16;33(11):3455-64. doi: 10.1093/nar/gki643. Print 2005.
Comparative genome hybridization (CGH) to DNA microarrays (array CGH) is a technique capable of detecting deletions and duplications in genomes at high resolution. However, array CGH studies of the human genome noting false negative and false positive results using large insert clones as probes have raised important concerns regarding the suitability of this approach for clinical diagnostic applications. Here, we adapt the Smith-Waterman dynamic-programming algorithm to provide a sensitive and robust analytic approach (SW-ARRAY) for detecting copy-number changes in array CGH data. In a blind series of hybridizations to arrays consisting of the entire tiling path for the terminal 2 Mb of human chromosome 16p, the method identified all monosomies between 267 and 1567 kb with a high degree of statistical significance and accurately located the boundaries of deletions in the range 267-1052 kb. The approach is unique in offering both a nonparametric segmentation procedure and a nonparametric test of significance. It is scalable and well-suited to high resolution whole genome array CGH studies that use array probes derived from large insert clones as well as PCR products and oligonucleotides.
DNA微阵列比较基因组杂交(array CGH)是一种能够在高分辨率下检测基因组中缺失和重复的技术。然而,使用大插入片段克隆作为探针的人类基因组array CGH研究发现了假阴性和假阳性结果,这引发了人们对该方法在临床诊断应用中的适用性的重要担忧。在此,我们采用史密斯-沃特曼动态规划算法,提供一种灵敏且稳健的分析方法(SW-ARRAY),用于检测array CGH数据中的拷贝数变化。在一系列对由人类16号染色体短臂末端2 Mb的完整平铺路径组成的阵列进行的盲法杂交实验中,该方法以高度的统计学显著性识别出了267至1567 kb之间的所有单体,并准确确定了267至1052 kb范围内缺失的边界。该方法的独特之处在于它既提供了一种非参数分割程序,又提供了一种非参数显著性检验。它具有可扩展性,非常适合使用源自大插入片段克隆以及PCR产物和寡核苷酸的阵列探针进行的高分辨率全基因组array CGH研究。