基于序列模型选择的分割方法用于检测DNA拷贝数变异。

Sequential model selection-based segmentation to detect DNA copy number variation.

作者信息

Hu Jianhua, Zhang Liwen, Wang Huixia Judy

机构信息

Department of Biostatistics, UT M. D. Anderson Cancer Center, Houston, Texas 77030, U.S.A..

School of Economics, Shanghai University, Shanghai 200444, China.

出版信息

Biometrics. 2016 Sep;72(3):815-26. doi: 10.1111/biom.12478. Epub 2016 Mar 8.

DOI:10.1111/biom.12478

PMID:26954760

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5018994/

Abstract

Array-based CGH experiments are designed to detect genomic aberrations or regions of DNA copy-number variation that are associated with an outcome, typically a state of disease. Most of the existing statistical methods target on detecting DNA copy number variations in a single sample or array. We focus on the detection of group effect variation, through simultaneous study of multiple samples from multiple groups. Rather than using direct segmentation or smoothing techniques, as commonly seen in existing detection methods, we develop a sequential model selection procedure that is guided by a modified Bayesian information criterion. This approach improves detection accuracy by accumulatively utilizing information across contiguous clones, and has computational advantage over the existing popular detection methods. Our empirical investigation suggests that the performance of the proposed method is superior to that of the existing detection methods, in particular, in detecting small segments or separating neighboring segments with differential degrees of copy-number variation.

摘要

基于芯片的比较基因组杂交（CGH）实验旨在检测与某种结果（通常是疾病状态）相关的基因组畸变或DNA拷贝数变异区域。现有的大多数统计方法旨在检测单个样本或芯片中的DNA拷贝数变异。我们专注于通过同时研究来自多个组的多个样本，检测组效应变异。与现有检测方法中常见的直接分割或平滑技术不同，我们开发了一种由改进的贝叶斯信息准则指导的顺序模型选择程序。这种方法通过累积利用相邻克隆的信息提高了检测准确性，并且在计算上比现有的流行检测方法具有优势。我们的实证研究表明，所提出的方法的性能优于现有检测方法，特别是在检测小片段或分离具有不同拷贝数变异程度的相邻片段方面。

相似文献

Sequential model selection-based segmentation to detect DNA copy number variation.

Biometrics. 2016 Sep;72(3):815-26. doi: 10.1111/biom.12478. Epub 2016 Mar 8.

Detecting copy number variations from array CGH data based on a conditional random field model.

J Bioinform Comput Biol. 2010 Apr;8(2):295-314. doi: 10.1142/s021972001000480x.

Heavy-Tailed Noise Suppression and Derivative Wavelet Scalogram for Detecting DNA Copy Number Aberrations.

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1625-1635. doi: 10.1109/TCBB.2017.2723884. Epub 2017 Jul 6.

Joint segmentation, calling, and normalization of multiple CGH profiles.

Biostatistics. 2011 Jul;12(3):413-28. doi: 10.1093/biostatistics/kxq076. Epub 2011 Jan 5.

Identification of differential aberrations in multiple-sample array CGH studies.

Biometrics. 2011 Jun;67(2):353-62. doi: 10.1111/j.1541-0420.2010.01457.x. Epub 2010 Jul 9.

Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression.

PLoS Comput Biol. 2016 May 13;12(5):e1004871. doi: 10.1371/journal.pcbi.1004871. eCollection 2016 May.

Application of Nexus copy number software for CNV detection and analysis.

Curr Protoc Hum Genet. 2010 Apr;Chapter 4:Unit 4.14.1-28. doi: 10.1002/0471142905.hg0414s65.

Genomic profiling of myeloma: the best approach, a comparison of cytogenetics, FISH and array-CGH of 112 myeloma cases.

J Clin Pathol. 2016 Jan;69(1):82-6. doi: 10.1136/jclinpath-2015-203054. Epub 2015 Sep 3.

Modified screening and ranking algorithm for copy number variation detection.

Bioinformatics. 2015 May 1;31(9):1341-8. doi: 10.1093/bioinformatics/btu850. Epub 2014 Dec 25.

VEGA: variational segmentation for copy number detection.

Bioinformatics. 2010 Dec 15;26(24):3020-7. doi: 10.1093/bioinformatics/btq586. Epub 2010 Oct 19.

引用本文的文献

Robust group fused lasso for multisample copy number variation detection under uncertainty.

IET Syst Biol. 2016 Dec;10(6):229-236. doi: 10.1049/iet-syb.2015.0081.

本文引用的文献

Identification of regulatory SNPs associated with genetic modifications in lung adenocarcinoma.

BMC Res Notes. 2015 Mar 24;8:92. doi: 10.1186/s13104-015-1053-8.

Personalized identification of altered pathways in cancer using accumulated normal tissue data.

Bioinformatics. 2014 Sep 1;30(17):i422-9. doi: 10.1093/bioinformatics/btu449.

Genome-wide screening of cytogenetic abnormalities in multiple myeloma patients using array-CGH technique: a Czech multicenter experience.

Biomed Res Int. 2014;2014:209670. doi: 10.1155/2014/209670. Epub 2014 Jun 2.

An evaluation of copy number variation detection tools from whole-exome sequencing data.

Hum Mutat. 2014 Jul;35(7):899-907. doi: 10.1002/humu.22537. Epub 2014 May 1.

THE SCREENING AND RANKING ALGORITHM TO DETECT DNA COPY NUMBER VARIATIONS.

Ann Appl Stat. 2012 Sep;6(3):1306-1326. doi: 10.1214/12-AOAS539SUPP.

Simultaneous Discovery of Rare and Common Segment Variants.

Biometrika. 2013;100(1):157-172. doi: 10.1093/biomet/ass059.

Comparative studies of copy number variation detection methods for next-generation sequencing technologies.

PLoS One. 2013;8(3):e59128. doi: 10.1371/journal.pone.0059128. Epub 2013 Mar 20.

Detecting simultaneous changepoints in multiple sequences.

Biometrika. 2010 Sep;97(3):631-645. doi: 10.1093/biomet/asq025. Epub 2010 Jun 16.

Bayesian Hidden Markov Modeling of Array CGH Data.

J Am Stat Assoc. 2008 Jun 1;103(482):485-497. doi: 10.1198/016214507000000923.

Integrated analyses of copy number variations and gene expression in lung adenocarcinoma.

PLoS One. 2011;6(9):e24829. doi: 10.1371/journal.pone.0024829. Epub 2011 Sep 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序列模型选择的分割方法用于检测DNA拷贝数变异。

Sequential model selection-based segmentation to detect DNA copy number variation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献