相似文献

1

A novel approach to DNA copy number data segmentation.

J Bioinform Comput Biol. 2011 Feb;9(1):131-48. doi: 10.1142/s0219720011005343.

2

Quantifying copy number variations using a hidden Markov model with inhomogeneous emission distributions.

Biostatistics. 2013 Jul;14(3):600-11. doi: 10.1093/biostatistics/kxt003. Epub 2013 Feb 20.

3

Detecting copy number variations from array CGH data based on a conditional random field model.

J Bioinform Comput Biol. 2010 Apr;8(2):295-314. doi: 10.1142/s021972001000480x.

4

CLImAT-HET: detecting subclonal copy number alterations and loss of heterozygosity in heterogeneous tumor samples from whole-genome sequencing data.

BMC Med Genomics. 2017 Mar 15;10(1):15. doi: 10.1186/s12920-017-0255-4.

5

A statistical approach for array CGH data analysis.

BMC Bioinformatics. 2005 Feb 11;6:27. doi: 10.1186/1471-2105-6-27.

6

CGHPRO -- a comprehensive data analysis tool for array CGH.

BMC Bioinformatics. 2005 Apr 5;6:85. doi: 10.1186/1471-2105-6-85.

7

A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.

BMC Bioinformatics. 2007 Nov 9;8:434. doi: 10.1186/1471-2105-8-434.

8

A novel stationary wavelet denoising algorithm for array-based DNA Copy Number data.

Int J Bioinform Res Appl. 2007;3(2):206-22. doi: 10.1504/IJBRA.2007.013603.

9

biomvRhsmm: genomic segmentation with hidden semi-Markov model.

Biomed Res Int. 2014;2014:910390. doi: 10.1155/2014/910390. Epub 2014 Jun 3.

10

Circular binary segmentation for the analysis of array-based DNA copy number data.

Biostatistics. 2004 Oct;5(4):557-72. doi: 10.1093/biostatistics/kxh008.

引用本文的文献

1

A multiple genomic data fused SF2 prediction model, signature identification, and gene regulatory network inference for personalized radiotherapy.

Technol Cancer Res Treat. 2020 Jan-Dec;19:1533033820909112. doi: 10.1177/1533033820909112.

本文引用的文献

1

Array comparative genomic hybridization-based characterization of genetic alterations in pulmonary neuroendocrine tumors.

Proc Natl Acad Sci U S A. 2010 Jul 20;107(29):13040-5. doi: 10.1073/pnas.1008132107. Epub 2010 Jul 6.

2

Ultrasome: efficient aberration caller for copy number studies of ultra-high resolution.

Bioinformatics. 2009 Apr 15;25(8):1078-9. doi: 10.1093/bioinformatics/btp091. Epub 2009 Feb 19.

3

Sequencing the full-length of the phosphatase and tensin homolog (PTEN) gene in hepatocellular carcinoma (HCC) using the 454 GS20 and Illumina GA DNA sequencing platforms.

World J Surg. 2009 Apr;33(4):647-52. doi: 10.1007/s00268-008-9852-x.

4

Sequence context affects the rate of short insertions and deletions in flies and primates.

Genome Biol. 2008;9(2):R37. doi: 10.1186/gb-2008-9-2-r37. Epub 2008 Feb 21.

5

A novel stationary wavelet denoising algorithm for array-based DNA Copy Number data.

Int J Bioinform Res Appl. 2007;3(2):206-22. doi: 10.1504/IJBRA.2007.013603.

6

Integrating copy number polymorphisms into array CGH analysis using a robust HMM.

Bioinformatics. 2006 Jul 15;22(14):e431-9. doi: 10.1093/bioinformatics/btl238.

7

BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data.

Bioinformatics. 2006 May 1;22(9):1144-6. doi: 10.1093/bioinformatics/btl089. Epub 2006 Mar 13.

8

A comparison study: applying segmentation to array CGH data for downstream analyses.

Bioinformatics. 2005 Nov 15;21(22):4084-91. doi: 10.1093/bioinformatics/bti677. Epub 2005 Sep 13.

9

Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data.

Bioinformatics. 2005 Oct 1;21(19):3763-70. doi: 10.1093/bioinformatics/bti611. Epub 2005 Aug 4.

10

Array comparative genomic hybridization and its applications in cancer.

Nat Genet. 2005 Jun;37 Suppl:S11-7. doi: 10.1038/ng1569.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。

一种用于DNA拷贝数数据分割的新方法。

A novel approach to DNA copy number data segmentation.

作者信息

Wang Siling, Wang Yuhang, Xie Yang, Xiao Guanghua

机构信息

Department of Computer Science and Engineering, Southern Methodist University, Dallas, Texas 75205, USA.

出版信息

J Bioinform Comput Biol. 2011 Feb;9(1):131-48. doi: 10.1142/s0219720011005343.

DOI:10.1142/s0219720011005343

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3084615/

Abstract

DNA copy number (DCN) is the number of copies of DNA at a region of a genome. The alterations of DCN are highly associated with the development of different tumors. Recently, microarray technologies are being employed to detect DCN changes at many loci at the same time in tumor samples. The resulting DCN data are often very noisy, and the tumor sample is often contaminated by normal cells. The goal of computational analysis of array-based DCN data is to infer the underlying DCNs from raw DCN data. Previous methods for this task do not model the tumor/normal cell mixture ratio explicitly and they cannot output segments with DCN annotations. We developed a novel model-based method using the minimum description length (MDL) principle for DCN data segmentation. Our new method can output underlying DCN for each chromosomal segment, and at the same time, infer the underlying tumor proportion in the test samples. Empirical results show that our method achieves better accuracies on average as compared to three previous methods, namely Circular Binary Segmentation, Hidden Markov Model and Ultrasome.

摘要

DNA拷贝数（DCN）是基因组某一区域的DNA拷贝数量。DCN的改变与不同肿瘤的发生高度相关。最近，微阵列技术被用于同时检测肿瘤样本中多个位点的DCN变化。由此产生的DCN数据往往噪声很大，并且肿瘤样本常常被正常细胞污染。基于阵列的DCN数据的计算分析目标是从原始DCN数据中推断潜在的DCN。以前用于此任务的方法没有明确对肿瘤/正常细胞混合比例进行建模，并且它们无法输出带有DCN注释的片段。我们开发了一种基于最小描述长度（MDL）原则的新型基于模型的方法用于DCN数据分割。我们的新方法可以输出每个染色体片段的潜在DCN，同时推断测试样本中的潜在肿瘤比例。实证结果表明，与之前的三种方法（即循环二元分割、隐马尔可夫模型和Ultrasome）相比，我们的方法平均实现了更高的准确率。