通过同时进行偏差校正和读深度分段来提高拷贝数变异的检测。

Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation.

机构信息

Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599-7264, USA.

出版信息

Nucleic Acids Res. 2013 Feb 1;41(3):1519-32. doi: 10.1093/nar/gks1363. Epub 2012 Dec 28.

DOI:10.1093/nar/gks1363

PMID:23275535

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3561969/

Abstract

Structural variation is an important class of genetic variation in mammals. High-throughput sequencing (HTS) technologies promise to revolutionize copy-number variation (CNV) detection but present substantial analytic challenges. Converging evidence suggests that multiple types of CNV-informative data (e.g. read-depth, read-pair, split-read) need be considered, and that sophisticated methods are needed for more accurate CNV detection. We observed that various sources of experimental biases in HTS confound read-depth estimation, and note that bias correction has not been adequately addressed by existing methods. We present a novel read-depth-based method, GENSENG, which uses a hidden Markov model and negative binomial regression framework to identify regions of discrete copy-number changes while simultaneously accounting for the effects of multiple confounders. Based on extensive calibration using multiple HTS data sets, we conclude that our method outperforms existing read-depth-based CNV detection algorithms. The concept of simultaneous bias correction and CNV detection can serve as a basis for combining read-depth with other types of information such as read-pair or split-read in a single analysis. A user-friendly and computationally efficient implementation of our method is freely available.

摘要

结构变异是哺乳动物中一类重要的遗传变异。高通量测序（HTS）技术有望彻底改变拷贝数变异（CNV）的检测，但也带来了巨大的分析挑战。越来越多的证据表明，需要考虑多种类型的 CNV 信息数据（例如，读深度、读对、分读），并且需要更复杂的方法来进行更准确的 CNV 检测。我们观察到 HTS 中的各种实验偏差源会干扰读深度的估计，并注意到现有方法尚未充分解决偏差校正问题。我们提出了一种新的基于读深度的方法 GENSENG，它使用隐马尔可夫模型和负二项式回归框架来识别离散拷贝数变化的区域，同时考虑到多个混杂因素的影响。通过使用多个 HTS 数据集进行广泛的校准，我们得出结论，我们的方法优于现有的基于读深度的 CNV 检测算法。同时进行偏差校正和 CNV 检测的概念可以为在单个分析中结合读深度与其他类型的信息（如读对或分读）提供基础。我们的方法具有用户友好和计算高效的实现，可免费使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e4bc/3561969/7c832a1d87df/gks1363f1p.jpg

相似文献

Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation.

Nucleic Acids Res. 2013 Feb 1;41(3):1519-32. doi: 10.1093/nar/gks1363. Epub 2012 Dec 28.

A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection.

BMC Bioinformatics. 2018 Mar 1;19(1):74. doi: 10.1186/s12859-018-2077-6.

Noise cancellation using total variation for copy number variation detection.

BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.

Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

Nucleic Acids Res. 2015 Aug 18;43(14):e90. doi: 10.1093/nar/gkv319. Epub 2015 Apr 16.

Identification and utilization of copy number information for correcting Hi-C contact map of cancer cell lines.

BMC Bioinformatics. 2020 Nov 7;21(1):506. doi: 10.1186/s12859-020-03832-8.

Quantifying copy number variations using a hidden Markov model with inhomogeneous emission distributions.

Biostatistics. 2013 Jul;14(3):600-11. doi: 10.1093/biostatistics/kxt003. Epub 2013 Feb 20.

CNV-BAC: Copy number Variation Detection in Bacterial Circular Genome.

Bioinformatics. 2020 Jun 1;36(12):3890-3891. doi: 10.1093/bioinformatics/btaa208.

CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

PLoS One. 2015 Aug 20;10(8):e0135895. doi: 10.1371/journal.pone.0135895. eCollection 2015.

SRBreak: A Read-Depth and Split-Read Framework to Identify Breakpoints of Different Events Inside Simple Copy-Number Variable Regions.

Front Genet. 2016 Sep 15;7:160. doi: 10.3389/fgene.2016.00160. eCollection 2016.

On the core segmentation algorithms of copy number variation detection tools.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae022.

引用本文的文献

Comparative study of tools for copy number variation detection using next-generation sequencing data.

Sci Rep. 2025 Jul 1;15(1):22145. doi: 10.1038/s41598-025-06527-3.

Copy number variant detection using next-generation sequencing in EYS-associated retinitis pigmentosa.

PLoS One. 2024 Jun 24;19(6):e0305812. doi: 10.1371/journal.pone.0305812. eCollection 2024.

On the core segmentation algorithms of copy number variation detection tools.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae022.

ARFID Genes and Environment (ARFID-GEN): study protocol.

BMC Psychiatry. 2023 Nov 21;23(1):863. doi: 10.1186/s12888-023-05266-x.

Novel deleterious splicing variant in HFM1 causes gametogenesis defect and recurrent implantation failure: concerning the risk of chromosomal abnormalities in embryos.

J Assist Reprod Genet. 2023 Jul;40(7):1689-1702. doi: 10.1007/s10815-023-02761-8. Epub 2023 Mar 3.

Comparison of day 5 blastocyst with day 6 blastocyst: Evidence from NGS-based PGT-A results.

J Assist Reprod Genet. 2022 Feb;39(2):369-377. doi: 10.1007/s10815-022-02397-0. Epub 2022 Jan 10.

Correspondence of aCGH and long-read genome assembly for detection of copy number differences: A proof-of-concept with cichlid genomes.

PLoS One. 2021 Oct 7;16(10):e0258193. doi: 10.1371/journal.pone.0258193. eCollection 2021.

Detection of chromosomal abnormalities in spontaneous miscarriage by low‑coverage next‑generation sequencing.

Mol Med Rep. 2020 Aug;22(2):1269-1276. doi: 10.3892/mmr.2020.11208. Epub 2020 Jun 3.

CNspector: a web-based tool for visualisation and clinical diagnosis of copy number variation from next generation sequencing.

Sci Rep. 2019 Apr 23;9(1):6426. doi: 10.1038/s41598-019-42858-8.

iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.

PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018.

本文引用的文献

An integrated map of genetic variation from 1,092 human genomes.

Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.

Copy number variation in the genomes of domestic animals.

Anim Genet. 2012 Oct;43(5):503-17. doi: 10.1111/j.1365-2052.2012.02317.x. Epub 2012 Mar 6.

CNVs: harbingers of a rare variant revolution in psychiatric genetics.

Cell. 2012 Mar 16;148(6):1223-41. doi: 10.1016/j.cell.2012.02.039.

Structural variation: the genome's hidden architecture.

Nat Methods. 2012 Jan 30;9(2):133-7. doi: 10.1038/nmeth.1858.

Repetitive DNA and next-generation sequencing: computational challenges and solutions.

Nat Rev Genet. 2011 Nov 29;13(1):36-46. doi: 10.1038/nrg3117.

Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.

Proc Natl Acad Sci U S A. 2011 Nov 15;108(46):E1128-36. doi: 10.1073/pnas.1110574108. Epub 2011 Nov 7.

Sequence-based characterization of structural variation in the mouse genome.

Nature. 2011 Sep 14;477(7364):326-9. doi: 10.1038/nature10432.

Mouse genomic variation and its effect on phenotypes and gene regulation.

Nature. 2011 Sep 14;477(7364):289-94. doi: 10.1038/nature10413.

ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions.

Genome Biol. 2011 Jul 25;12(7):R67. doi: 10.1186/gb-2011-12-7-r67.

Subspecific origin and haplotype diversity in the laboratory mouse.

Nat Genet. 2011 May 29;43(7):648-55. doi: 10.1038/ng.847.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过同时进行偏差校正和读深度分段来提高拷贝数变异的检测。

Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献