Suppr超能文献

一种用于检测阵列 CGH 数据中 DNA 拷贝数变异的统计变化点模型方法。

A statistical change point model approach for the detection of DNA copy number variations in array CGH data.

机构信息

Department of Mathematics and Statistics, University of Missouri-Kansas City, 5100 Rockhill Road, Kansas City, MO 64110, USA.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):529-41. doi: 10.1109/TCBB.2008.129.

Abstract

Array comparative genomic hybridization (aCGH) provides a high-resolution and high-throughput technique for screening of copy number variations (CNVs) within the entire genome. This technique, compared to the conventional CGH, significantly improves the identification of chromosomal abnormalities. However, due to the random noise inherited in the imaging and hybridization process, identifying statistically significant DNA copy number changes in aCGH data is challenging. We propose a novel approach that uses the mean and variance change point model (MVCM) to detect CNVs or breakpoints in aCGH data sets. We derive an approximate p-value for the test statistic and also give the estimate of the locus of the DNA copy number change. We carry out simulation studies to evaluate the accuracy of the estimate and the p-value formulation. These simulation results show that the approach is effective in identifying copy number changes. The approach is also tested on fibroblast cancer cell line data, breast tumor cell line data, and breast cancer cell line aCGH data sets that are publicly available. Changes that have not been identified by the circular binary segmentation (CBS) method but are biologically verified are detected by our approach on these cell lines with higher sensitivity and specificity than CBS.

摘要

阵列比较基因组杂交(aCGH)为筛查整个基因组中的拷贝数变异(CNV)提供了一种高分辨率和高通量的技术。与传统的 CGH 相比,该技术显著提高了染色体异常的识别能力。然而,由于成像和杂交过程中固有的随机噪声,在 aCGH 数据中识别具有统计学意义的 DNA 拷贝数变化具有挑战性。我们提出了一种新的方法,该方法使用均值和方差变化点模型(MVCM)来检测 aCGH 数据集中的 CNV 或断点。我们推导出了检验统计量的近似 p 值,并给出了 DNA 拷贝数变化位置的估计。我们进行了模拟研究来评估估计和 p 值公式的准确性。这些模拟结果表明,该方法在识别拷贝数变化方面是有效的。该方法还在公开可用的成纤维癌细胞系数据、乳腺癌细胞系数据和乳腺癌细胞系 aCGH 数据集上进行了测试。在这些细胞系上,我们的方法能够检测到那些没有被圆形二进制分割(CBS)方法识别但具有生物学验证的变化,其敏感性和特异性均高于 CBS。

相似文献

5
Single-cell copy number variation detection.单细胞拷贝数变异检测。
Genome Biol. 2011 Aug 29;12(8):R80. doi: 10.1186/gb-2011-12-8-r80.

引用本文的文献

本文引用的文献

1
Bayesian Hidden Markov Modeling of Array CGH Data.阵列比较基因组杂交数据的贝叶斯隐马尔可夫模型
J Am Stat Assoc. 2008 Jun 1;103(482):485-497. doi: 10.1198/016214507000000923.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验