基于阵列的平台和调用算法的全面评估，用于检测拷贝数变异。

Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants.

机构信息

The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada.

出版信息

Nat Biotechnol. 2011 May 8;29(6):512-20. doi: 10.1038/nbt.1852.

DOI:10.1038/nbt.1852

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3270583/

Abstract

We have systematically compared copy number variant (CNV) detection on eleven microarrays to evaluate data quality and CNV calling, reproducibility, concordance across array platforms and laboratory sites, breakpoint accuracy and analysis tool variability. Different analytic tools applied to the same raw data typically yield CNV calls with <50% concordance. Moreover, reproducibility in replicate experiments is <70% for most platforms. Nevertheless, these findings should not preclude detection of large CNVs for clinical diagnostic purposes because large CNVs with poor reproducibility are found primarily in complex genomic regions and would typically be removed by standard clinical data curation. The striking differences between CNV calls from different platforms and analytic tools highlight the importance of careful assessment of experimental design in discovery and association studies and of strict data curation and filtering in diagnostics. The CNV resource presented here allows independent data evaluation and provides a means to benchmark new algorithms.

摘要

我们系统地比较了十一种微阵列上的拷贝数变异 (CNV) 检测，以评估数据质量和 CNV 调用、重现性、不同阵列平台和实验室之间的一致性、断点准确性以及分析工具的可变性。不同的分析工具应用于相同的原始数据通常会产生 <50%的 CNV 调用一致性。此外，大多数平台的重复实验的重现性<70%。然而，这些发现不应排除用于临床诊断目的的大 CNV 检测，因为具有低重现性的大 CNV 主要存在于复杂的基因组区域中，通常会被标准的临床数据管理所去除。来自不同平台和分析工具的 CNV 调用之间的显著差异突出表明，在发现和关联研究中，需要仔细评估实验设计，在诊断中需要严格的数据管理和过滤。这里提供的 CNV 资源允许对数据进行独立评估，并为基准测试新算法提供了一种手段。

相似文献

1

Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants.基于阵列的平台和调用算法的全面评估，用于检测拷贝数变异。

Nat Biotechnol. 2011 May 8;29(6):512-20. doi: 10.1038/nbt.1852.

2

Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。

BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.

3

Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.用于人类全基因组拷贝数变异（CNV）分析的高分辨率阵列平台的综合性能比较

BMC Genomics. 2017 Apr 24;18(1):321. doi: 10.1186/s12864-017-3658-x.

4

CNV Workshop: an integrated platform for high-throughput copy number variation discovery and clinical diagnostics.CNV 工作坊：一个用于高通量拷贝数变异发现和临床诊断的集成平台。

BMC Bioinformatics. 2010 Feb 4;11:74. doi: 10.1186/1471-2105-11-74.

5

Comparison of genome-wide array genomic hybridization platforms for the detection of copy number variants in idiopathic mental retardation.比较基因组范围的阵列基因组杂交平台在特发性智力障碍的拷贝数变异检测中的应用。

BMC Med Genomics. 2011 Mar 25;4:25. doi: 10.1186/1755-8794-4-25.

6

CNV-ROC: A cost effective, computer-aided analytical performance evaluator of chromosomal microarrays.CNV-ROC：一种具有成本效益的染色体微阵列计算机辅助分析性能评估工具。

J Biomed Inform. 2015 Apr;54:106-13. doi: 10.1016/j.jbi.2015.01.001. Epub 2015 Jan 13.

7

Accuracy of CNV Detection from GWAS Data.从 GWAS 数据中检测 CNV 的准确性。

PLoS One. 2011 Jan 13;6(1):e14511. doi: 10.1371/journal.pone.0014511.

8

Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.人类基因组拷贝数变异的全基因组图谱绘制：高分辨率阵列平台的比较分析。

PLoS One. 2011;6(11):e27859. doi: 10.1371/journal.pone.0027859. Epub 2011 Nov 30.

9

Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。

BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.

10

Systematic inference of copy-number genotypes from personal genome sequencing data reveals extensive olfactory receptor gene content diversity.系统推断个人基因组测序数据的拷贝数基因型揭示了广泛的嗅觉受体基因含量多样性。

PLoS Comput Biol. 2010 Nov 11;6(11):e1000988. doi: 10.1371/journal.pcbi.1000988.

引用本文的文献

1

Construction of a Genome-Wide Copy Number Variation Map and Association Analysis of Black Spot in Jujube.枣全基因组拷贝数变异图谱构建及黑斑病关联分析

Plants (Basel). 2025 Sep 5;14(17):2782. doi: 10.3390/plants14172782.

2

Chromosomal quality control in hPSCs: A practical guide to SNP array analysis with GenomeStudio.人多能干细胞中的染色体质量控制：使用GenomeStudio进行SNP阵列分析的实用指南。

Front Cell Dev Biol. 2025 Jul 1;13:1599923. doi: 10.3389/fcell.2025.1599923. eCollection 2025.

3

Mechanistic Evaluation of Thymoquinone Derivative-Induced Apoptosis in Human Glioblastoma Cells.对人胶质母细胞瘤细胞中百里醌衍生物诱导凋亡的机制评估

J Pharm Bioallied Sci. 2025 Jun;17(Suppl 2):S1313-S1315. doi: 10.4103/jpbs.jpbs_1967_24. Epub 2025 Jun 18.

4

MarkerMatch: A Proximity-Based Probe-Matching Algorithm for Joint Analysis of Copy-Number Variants from Different Genotyping Arrays.MarkerMatch：一种基于邻近性的探针匹配算法，用于联合分析来自不同基因分型阵列的拷贝数变异

bioRxiv. 2025 Jul 4:2025.06.30.662249. doi: 10.1101/2025.06.30.662249.

5

The genomic comparison between autochthonous and cosmopolitan cows reveals structural variants involved in environmental adaptation.本地牛和全球分布牛之间的基因组比较揭示了参与环境适应的结构变异。

Sci Rep. 2025 Jul 1;15(1):22280. doi: 10.1038/s41598-025-07165-5.

6

Genome-wide association meta-analysis and rare copy number variant analysis of treatment-resistant depression.难治性抑郁症的全基因组关联荟萃分析和罕见拷贝数变异分析

Mol Psychiatry. 2025 Jun 26. doi: 10.1038/s41380-025-03084-z.

7

Genome-Wide Analysis of Copy Number Variations in Three Populations of Nanyang Cattle Using Whole-Genome Resequencing.利用全基因组重测序技术对三个南阳牛群体拷贝数变异进行全基因组分析

Genes (Basel). 2025 May 12;16(5):568. doi: 10.3390/genes16050568.

8

The Eating Disorders Genetics Initiative 2 (EDGI2): study protocol.饮食失调遗传学倡议2（EDGI2）：研究方案。

BMC Psychiatry. 2025 May 26;25(1):532. doi: 10.1186/s12888-025-06777-5.

9

EMcnv: enhancing CNV detection performance through ensemble strategies with heterogeneous meta-graph neural networks.EMcnv：通过使用异构元图神经网络的集成策略提高拷贝数变异（CNV）检测性能。

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf135.

10

Rare copy number variant analysis in case-control studies using snp array data: a scalable and automated data analysis pipeline.基于 SNP 芯片数据的病例对照研究中的罕见拷贝数变异分析：一种可扩展和自动化的数据分析流程。

BMC Bioinformatics. 2024 Nov 15;25(1):357. doi: 10.1186/s12859-024-05979-0.

本文引用的文献

1

Mapping copy number variation by population-scale genome sequencing.通过群体规模的基因组测序来绘制拷贝数变异图谱。

Nature. 2011 Feb 3;470(7332):59-65. doi: 10.1038/nature09708.

2

A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

3

Tackling the widespread and critical impact of batch effects in high-throughput data.解决高通量数据中广泛存在且极具影响力的批次效应问题。

Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.

4

Functional impact of global rare copy number variation in autism spectrum disorders.自闭症谱系障碍中全球罕见拷贝数变异的功能影响。

Nature. 2010 Jul 15;466(7304):368-72. doi: 10.1038/nature09146. Epub 2010 Jun 9.

5

Towards a comprehensive structural variation map of an individual human genome.构建人类个体基因组的综合结构变异图谱。

Genome Biol. 2010;11(5):R52. doi: 10.1186/gb-2010-11-5-r52. Epub 2010 May 19.

6

Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies.共识声明：对于患有发育障碍或先天畸形的个体，染色体微阵列是一线临床诊断测试。

Am J Hum Genet. 2010 May 14;86(5):749-64. doi: 10.1016/j.ajhg.2010.04.006.

7

Mutation spectrum revealed by breakpoint sequencing of human germline CNVs.人类种系 CNV 断点测序揭示的突变谱。

Nat Genet. 2010 May;42(5):385-91. doi: 10.1038/ng.564. Epub 2010 Apr 4.

8

Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.全基因组关联研究分析了 16000 例 8 种常见疾病和 3000 例共享对照的 CNVs。

Nature. 2010 Apr 1;464(7289):713-20. doi: 10.1038/nature08979.

9

Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays.七种基于单核苷酸多态性微阵列的拷贝数变异识别算法的比较分析。

Nucleic Acids Res. 2010 May;38(9):e105. doi: 10.1093/nar/gkq040. Epub 2010 Feb 8.

10

Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library.使用BreakSeq和断点文库对结构变异进行核苷酸分辨率分析。

Nat Biotechnol. 2010 Jan;28(1):47-55. doi: 10.1038/nbt.1600. Epub 2009 Dec 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验