Department of Statistics, University of California, Berkeley, USA.
BMC Bioinformatics. 2010 May 12;11:245. doi: 10.1186/1471-2105-11-245.
High-throughput genotyping microarrays assess both total DNA copy number and allelic composition, which makes them a tool of choice for copy number studies in cancer, including total copy number and loss of heterozygosity (LOH) analyses. Even after state of the art preprocessing methods, allelic signal estimates from genotyping arrays still suffer from systematic effects that make them difficult to use effectively for such downstream analyses.
We propose a method, TumorBoost, for normalizing allelic estimates of one tumor sample based on estimates from a single matched normal. The method applies to any paired tumor-normal estimates from any microarray-based technology, combined with any preprocessing method. We demonstrate that it increases the signal-to-noise ratio of allelic signals, making it significantly easier to detect allelic imbalances.
TumorBoost increases the power to detect somatic copy-number events (including copy-neutral LOH) in the tumor from allelic signals of Affymetrix or Illumina origin. We also conclude that high-precision allelic estimates can be obtained from a single pair of tumor-normal hybridizations, if TumorBoost is combined with single-array preprocessing methods such as (allele-specific) CRMA v2 for Affymetrix or BeadStudio's (proprietary) XY-normalization method for Illumina. A bounded-memory implementation is available in the open-source and cross-platform R package aroma.cn, which is part of the Aroma Project (http://www.aroma-project.org/).
高通量基因分型微阵列可评估总 DNA 拷贝数和等位基因组成,这使其成为癌症拷贝数研究的首选工具,包括总拷贝数和杂合性丢失(LOH)分析。即使采用最先进的预处理方法,基因分型阵列的等位基因信号估计仍然存在系统效应,使得它们难以有效地用于此类下游分析。
我们提出了一种方法,即 TumorBoost,用于根据单个匹配正常的估计值对一个肿瘤样本的等位基因估计值进行标准化。该方法适用于任何基于微阵列技术的配对肿瘤-正常估计值,并且与任何预处理方法结合使用。我们证明它可以提高等位基因信号的信噪比,从而更轻松地检测等位基因失衡。
TumorBoost 增加了从 Affymetrix 或 Illumina 来源的等位基因信号检测肿瘤中体细胞拷贝数事件(包括拷贝中性 LOH)的能力。我们还得出结论,如果将 TumorBoost 与单阵列预处理方法(例如 Affymetrix 的(等位基因特异性)CRMA v2 或 Illumina 的 BeadStudio 的(专有)XY 标准化方法)结合使用,则可以从一对肿瘤-正常杂交中获得高精度的等位基因估计值。一个有界内存实现可在开源和跨平台 R 包 aroma.cn 中获得,该包是 Aroma 项目(http://www.aroma-project.org/)的一部分。