利用全基因组短序列寡核苷酸阵列优化拷贝数变异分析。

Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.

机构信息

Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, NY 10065, USA.

出版信息

Nucleic Acids Res. 2010 Jun;38(10):3275-86. doi: 10.1093/nar/gkq073. Epub 2010 Feb 15.

DOI:10.1093/nar/gkq073

PMID:20156996

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2879534/

Abstract

The detection of copy number variants (CNV) by array-based platforms provides valuable insight into understanding human diversity. However, suboptimal study design and data processing negatively affect CNV assessment. We quantitatively evaluate their impact when short-sequence oligonucleotide arrays are applied (Affymetrix Genome-Wide Human SNP Array 6.0) by evaluating 42 HapMap samples for CNV detection. Several processing and segmentation strategies are implemented, and results are compared to CNV assessment obtained using an oligonucleotide array CGH platform designed to query CNVs at high resolution (Agilent). We quantitatively demonstrate that different reference models (e.g. single versus pooled sample reference) used to detect CNVs are a major source of inter-platform discrepancy (up to 30%) and that CNVs residing within segmental duplication regions (higher reference copy number) are significantly harder to detect (P < 0.0001). After adjusting Affymetrix data to mimic the Agilent experimental design (reference sample effect), we applied several common segmentation approaches and evaluated differential sensitivity and specificity for CNV detection, ranging 39-77% and 86-100% for non-segmental duplication regions, respectively, and 18-55% and 39-77% for segmental duplications. Our results are relevant to any array-based CNV study and provide guidelines to optimize performance based on study-specific objectives.

摘要

基于阵列的平台检测拷贝数变异（CNV）为理解人类多样性提供了有价值的见解。然而，不理想的研究设计和数据处理会对 CNV 评估产生负面影响。我们通过评估 42 个 HapMap 样本中的 CNV 检测，定量评估了在应用短序列寡核苷酸阵列（Affymetrix Genome-Wide Human SNP Array 6.0）时这些因素的影响。我们实施了几种处理和分割策略，并将结果与旨在高分辨率查询 CNV 的寡核苷酸阵列 CGH 平台（Agilent）的 CNV 评估进行了比较。我们定量证明了用于检测 CNV 的不同参考模型（例如，单个样本与混合样本参考）是平台间差异的主要来源（高达 30%），并且位于片段重复区域（较高的参考拷贝数）内的 CNV 更难以检测（P < 0.0001）。在将 Affymetrix 数据调整为模拟 Agilent 实验设计（参考样本效应）后，我们应用了几种常见的分割方法，并评估了针对非片段重复区域的 CNV 检测的差异敏感性和特异性，分别为 39-77%和 86-100%，以及针对片段重复区域的 18-55%和 39-77%。我们的结果与任何基于阵列的 CNV 研究都相关，并为根据特定研究目标优化性能提供了指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7bd/2879534/fc1de0cc3099/gkq073f1.jpg

相似文献

Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.利用全基因组短序列寡核苷酸阵列优化拷贝数变异分析。

Nucleic Acids Res. 2010 Jun;38(10):3275-86. doi: 10.1093/nar/gkq073. Epub 2010 Feb 15.

Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.用于人类全基因组拷贝数变异（CNV）分析的高分辨率阵列平台的综合性能比较

BMC Genomics. 2017 Apr 24;18(1):321. doi: 10.1186/s12864-017-3658-x.

Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.人类基因组拷贝数变异的全基因组图谱绘制：高分辨率阵列平台的比较分析。

PLoS One. 2011;6(11):e27859. doi: 10.1371/journal.pone.0027859. Epub 2011 Nov 30.

Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。

BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.

Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。

BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.

Exploiting sequence similarity to validate the sensitivity of SNP arrays in detecting fine-scaled copy number variations.利用序列相似性验证 SNP 芯片检测精细尺度拷贝数变异的灵敏度。

Bioinformatics. 2010 Apr 15;26(8):1007-14. doi: 10.1093/bioinformatics/btq088. Epub 2010 Feb 25.

Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays.使用高密度DNA寡核苷酸阵列进行全基因组人类拷贝数变异检测。

Genome Res. 2006 Dec;16(12):1575-84. doi: 10.1101/gr.5629106. Epub 2006 Nov 22.

Improved detection of global copy number variation using high density, non-polymorphic oligonucleotide probes.使用高密度、非多态性寡核苷酸探针改进全基因组拷贝数变异检测。

BMC Genet. 2008 Mar 28;9:27. doi: 10.1186/1471-2156-9-27.

A remark on copy number variation detection methods.关于拷贝数变异检测方法的评论。

PLoS One. 2018 Apr 27;13(4):e0196226. doi: 10.1371/journal.pone.0196226. eCollection 2018.

Use of Affymetrix Arrays in the Diagnosis of Gene Copy-Number Variation.Affymetrix基因芯片在基因拷贝数变异诊断中的应用。

Curr Protoc Hum Genet. 2015 Apr 1;85:8.13.1-8.13.13. doi: 10.1002/0471142905.hg0813s85.

引用本文的文献

Recurrent copy number variants associated with bronchopulmonary dysplasia.与支气管肺发育不良相关的复发性拷贝数变异

Pediatr Res. 2016 Jun;79(6):940-5. doi: 10.1038/pr.2016.23. Epub 2016 Mar 14.

Variants at IRX4 as prostate cancer expression quantitative trait loci.IRX4基因变异作为前列腺癌表达数量性状位点

Eur J Hum Genet. 2014 Apr;22(4):558-63. doi: 10.1038/ejhg.2013.195. Epub 2013 Sep 11.

Current analysis platforms and methods for detecting copy number variation.当前用于检测拷贝数变异的分析平台和方法。

Physiol Genomics. 2013 Jan 7;45(1):1-16. doi: 10.1152/physiolgenomics.00082.2012. Epub 2012 Nov 6.

Identification of functionally active, low frequency copy number variants at 15q21.3 and 12q21.31 associated with prostate cancer risk.鉴定与前列腺癌风险相关的功能性低频拷贝数变异体 15q21.3 和 12q21.31。

Proc Natl Acad Sci U S A. 2012 Apr 24;109(17):6686-91. doi: 10.1073/pnas.1117405109. Epub 2012 Apr 10.

PLoS One. 2011;6(11):e27859. doi: 10.1371/journal.pone.0027859. Epub 2011 Nov 30.

Genomic analysis of circulating cell-free DNA infers breast cancer dormancy.循环游离 DNA 的基因组分析推断乳腺癌休眠。

Genome Res. 2012 Feb;22(2):220-31. doi: 10.1101/gr.123497.111. Epub 2011 Oct 11.

SNP and gene networks construction and analysis from classification of copy number variations data.从拷贝数变异数据的分类中构建 SNP 和基因网络并进行分析。

BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S4. doi: 10.1186/1471-2105-12-S5-S4. Epub 2011 Jul 27.

Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。

BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.

A computational framework discovers new copy number variants with functional importance.一个计算框架发现了具有功能重要性的新拷贝数变异。

PLoS One. 2011 Mar 29;6(3):e17539. doi: 10.1371/journal.pone.0017539.

Application of genetic/genomic approaches to allergic disorders.遗传/基因组方法在过敏性疾病中的应用。

J Allergy Clin Immunol. 2010 Sep;126(3):425-36; quiz 437-8. doi: 10.1016/j.jaci.2010.05.025. Epub 2010 Jul 16.

本文引用的文献

High-resolution mapping and analysis of copy number variations in the human genome: a data resource for clinical and research applications.人类基因组中拷贝数变异的高分辨率图谱绘制与分析：临床及研究应用的数据资源。

Genome Res. 2009 Sep;19(9):1682-90. doi: 10.1101/gr.083501.108. Epub 2009 Jul 10.

Copy number variants (CNVs) in primate species using array-based comparative genomic hybridization.使用基于芯片的比较基因组杂交技术检测灵长类动物物种中的拷贝数变异（CNV）

Methods. 2009 Sep;49(1):18-25. doi: 10.1016/j.ymeth.2009.06.001. Epub 2009 Jun 21.

The evolutionary significance of copy number variation in the human genome.人类基因组中拷贝数变异的进化意义。

Cytogenet Genome Res. 2008;123(1-4):283-7. doi: 10.1159/000184719. Epub 2009 Mar 11.

Population analysis of large copy number variants and hotspots of human genetic disease.人类遗传疾病的大片段拷贝数变异和热点区域的群体分析。

Am J Hum Genet. 2009 Feb;84(2):148-61. doi: 10.1016/j.ajhg.2008.12.014. Epub 2009 Jan 22.

Distinct genomic aberrations associated with ERG rearranged prostate cancer.与ERG重排前列腺癌相关的独特基因组畸变。

Genes Chromosomes Cancer. 2009 Apr;48(4):366-80. doi: 10.1002/gcc.20647.

Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性（SNPs）、常见拷贝数多态性和罕见拷贝数变异（CNVs）的整合基因型分型与关联分析。

Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.

Integrated detection and population-genetic analysis of SNPs and copy number variation.单核苷酸多态性（SNPs）与拷贝数变异的综合检测及群体遗传分析

Nat Genet. 2008 Oct;40(10):1166-74. doi: 10.1038/ng.238. Epub 2008 Sep 7.

SNP panel identification assay (SPIA): a genetic-based assay for the identification of cell lines.单核苷酸多态性芯片鉴定分析（SPIA）：一种基于基因的细胞系鉴定分析方法。

Nucleic Acids Res. 2008 Apr;36(7):2446-56. doi: 10.1093/nar/gkn089. Epub 2008 Feb 27.

The fine-scale and complex architecture of human copy-number variation.人类拷贝数变异的精细尺度与复杂结构。

Am J Hum Genet. 2008 Mar;82(3):685-95. doi: 10.1016/j.ajhg.2007.12.010. Epub 2008 Jan 24.

Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies.恒河猴基因组拷贝数变异分析确定了用于进化和人类疾病研究的候选基因座。

Hum Mol Genet. 2008 Apr 15;17(8):1127-36. doi: 10.1093/hmg/ddn002. Epub 2008 Jan 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用全基因组短序列寡核苷酸阵列优化拷贝数变异分析。

Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献