Olshen Adam B, Venkatraman E S, Lucito Robert, Wigler Michael
Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA.
Biostatistics. 2004 Oct;5(4):557-72. doi: 10.1093/biostatistics/kxh008.
DNA sequence copy number is the number of copies of DNA at a region of a genome. Cancer progression often involves alterations in DNA copy number. Newly developed microarray technologies enable simultaneous measurement of copy number at thousands of sites in a genome. We have developed a modification of binary segmentation, which we call circular binary segmentation, to translate noisy intensity measurements into regions of equal copy number. The method is evaluated by simulation and is demonstrated on cell line data with known copy number alterations and on a breast cancer cell line data set.
DNA序列拷贝数是基因组某一区域的DNA拷贝数量。癌症进展通常涉及DNA拷贝数的改变。新开发的微阵列技术能够同时测量基因组中数千个位点的拷贝数。我们对二元分割法进行了改进,称之为循环二元分割,以便将有噪声的强度测量值转化为拷贝数相等的区域。该方法通过模拟进行评估,并在已知拷贝数改变的细胞系数据以及一个乳腺癌细胞系数据集上得到了验证。