• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

算法对拷贝数变异检测的影响。

The effect of algorithms on copy number variant detection.

机构信息

Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, Washington, United States of America.

出版信息

PLoS One. 2010 Dec 30;5(12):e14456. doi: 10.1371/journal.pone.0014456.

DOI:10.1371/journal.pone.0014456
PMID:21209939
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3012691/
Abstract

BACKGROUND

The detection of copy number variants (CNVs) and the results of CNV-disease association studies rely on how CNVs are defined, and because array-based technologies can only infer CNVs, CNV-calling algorithms can produce vastly different findings. Several authors have noted the large-scale variability between CNV-detection methods, as well as the substantial false positive and false negative rates associated with those methods. In this study, we use variations of four common algorithms for CNV detection (PennCNV, QuantiSNP, HMMSeg, and cnvPartition) and two definitions of overlap (any overlap and an overlap of at least 40% of the smaller CNV) to illustrate the effects of varying algorithms and definitions of overlap on CNV discovery.

METHODOLOGY AND PRINCIPAL FINDINGS

We used a 56 K Illumina genotyping array enriched for CNV regions to generate hybridization intensities and allele frequencies for 48 Caucasian schizophrenia cases and 48 age-, ethnicity-, and gender-matched control subjects. No algorithm found a difference in CNV burden between the two groups. However, the total number of CNVs called ranged from 102 to 3,765 across algorithms. The mean CNV size ranged from 46 kb to 787 kb, and the average number of CNVs per subject ranged from 1 to 39. The number of novel CNVs not previously reported in normal subjects ranged from 0 to 212.

CONCLUSIONS AND SIGNIFICANCE

Motivated by the availability of multiple publicly available genome-wide SNP arrays, investigators are conducting numerous analyses to identify putative additional CNVs in complex genetic disorders. However, the number of CNVs identified in array-based studies, and whether these CNVs are novel or valid, will depend on the algorithm(s) used. Thus, given the variety of methods used, there will be many false positives and false negatives. Both guidelines for the identification of CNVs inferred from high-density arrays and the establishment of a gold standard for validation of CNVs are needed.

摘要

背景

拷贝数变异(CNV)的检测和 CNV 与疾病关联研究的结果依赖于 CNV 的定义方式,由于基于阵列的技术只能推断 CNV,因此 CNV 调用算法可能会产生大相径庭的结果。几位作者已经注意到 CNV 检测方法之间存在大规模的可变性,以及这些方法相关的大量假阳性和假阴性率。在这项研究中,我们使用了四种常见的 CNV 检测算法(PennCNV、QuantiSNP、HMMSeg 和 cnvPartition)的变体以及两种重叠定义(任何重叠和至少 40%较小 CNV 的重叠)来说明不同算法和重叠定义对 CNV 发现的影响。

方法和主要发现

我们使用经过 CNV 区域富集的 56 K Illumina 基因分型阵列来生成 48 例白种人精神分裂症病例和 48 例年龄、种族和性别匹配的对照个体的杂交强度和等位基因频率。没有一种算法在两组之间发现 CNV 负担的差异。然而,各种算法之间调用的 CNV 总数从 102 到 3765 不等。CNV 的平均大小范围从 46 kb 到 787 kb,每个个体的平均 CNV 数量从 1 到 39 不等。新发现的以前在正常个体中未报道的 CNV 数量从 0 到 212 不等。

结论和意义

受多种可用的全基因组 SNP 阵列的启发,研究人员正在进行大量分析以确定复杂遗传疾病中的潜在额外 CNV。然而,基于阵列的研究中识别的 CNV 数量,以及这些 CNV 是否是新的或有效的,将取决于使用的算法。因此,鉴于使用的方法种类繁多,将会有许多假阳性和假阴性。需要为从高密度阵列推断的 CNV 识别制定指南和建立 CNV 验证的金标准。

相似文献

1
The effect of algorithms on copy number variant detection.算法对拷贝数变异检测的影响。
PLoS One. 2010 Dec 30;5(12):e14456. doi: 10.1371/journal.pone.0014456.
2
Rare CNVs in Suicide Attempt include Schizophrenia-Associated Loci and Neurodevelopmental Genes: A Pilot Genome-Wide and Family-Based Study.自杀未遂中的罕见拷贝数变异包括精神分裂症相关基因座和神经发育基因:一项全基因组和基于家系的初步研究。
PLoS One. 2016 Dec 28;11(12):e0168531. doi: 10.1371/journal.pone.0168531. eCollection 2016.
3
Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。
BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.
4
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.使用高密度或中密度单核苷酸多态性基因分型面板检测到的拷贝数变异与从牛侧翼高密度单核苷酸多态性单倍型推断拷贝数变异的一致性。
BMC Genomics. 2020 Mar 4;21(1):205. doi: 10.1186/s12864-020-6627-8.
5
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
6
Accuracy of CNV Detection from GWAS Data.从 GWAS 数据中检测 CNV 的准确性。
PLoS One. 2011 Jan 13;6(1):e14511. doi: 10.1371/journal.pone.0014511.
7
Effect of Combining Multiple CNV Defining Algorithms on the Reliability of CNV Calls from SNP Genotyping Data.多种拷贝数变异(CNV)定义算法相结合对基于单核苷酸多态性(SNP)基因分型数据的CNV检测可靠性的影响
Genomics Inform. 2012 Sep;10(3):194-9. doi: 10.5808/GI.2012.10.3.194. Epub 2012 Sep 28.
8
Assessing the reproducibility of exome copy number variations predictions.评估外显子拷贝数变异预测的可重复性。
Genome Med. 2016 Aug 8;8(1):82. doi: 10.1186/s13073-016-0336-6.
9
Genome-wide identification of copy number variations in Holstein cattle from Baja California, Mexico, using high-density SNP genotyping arrays.利用高密度SNP基因分型阵列对墨西哥下加利福尼亚州荷斯坦奶牛的全基因组拷贝数变异进行鉴定。
Genet Mol Res. 2015 Oct 2;14(4):11848-59. doi: 10.4238/2015.October.2.18.
10
Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort.利用大型临床队列中的 SNP 基因分型阵列鉴定和验证拷贝数变异。
BMC Genomics. 2012 Jun 15;13:241. doi: 10.1186/1471-2164-13-241.

引用本文的文献

1
Mapping copy number variable regions correlated with reproduction and production traits in Karan Fries cattle mammalian genomics.绘制与卡兰·弗里斯牛繁殖和生产性状相关的拷贝数可变区域 哺乳动物基因组学
Mamm Genome. 2025 Aug 15. doi: 10.1007/s00335-025-10152-w.
2
Chromosomal quality control in hPSCs: A practical guide to SNP array analysis with GenomeStudio.人多能干细胞中的染色体质量控制:使用GenomeStudio进行SNP阵列分析的实用指南。
Front Cell Dev Biol. 2025 Jul 1;13:1599923. doi: 10.3389/fcell.2025.1599923. eCollection 2025.
3
The genetic landscape of autism spectrum disorder in the Middle Eastern population.

本文引用的文献

1
Evolution in health and medicine Sackler colloquium: Genomic disorders: a window into human gene and genome evolution.健康与医学领域的演变:萨克勒研讨会——基因组疾病:洞察人类基因与基因组进化的窗口
Proc Natl Acad Sci U S A. 2010 Jan 26;107 Suppl 1(Suppl 1):1765-71. doi: 10.1073/pnas.0906222107. Epub 2010 Jan 13.
2
The role of copy number variation in susceptibility to amyotrophic lateral sclerosis: genome-wide association study and comparison with published loci.拷贝数变异在肌萎缩侧索硬化易感性中的作用:全基因组关联研究及与已发表基因座的比较。
PLoS One. 2009 Dec 4;4(12):e8175. doi: 10.1371/journal.pone.0008175.
3
Comparing CNV detection methods for SNP arrays.
中东人群自闭症谱系障碍的遗传图谱。
Front Genet. 2024 Mar 20;15:1363849. doi: 10.3389/fgene.2024.1363849. eCollection 2024.
4
Pharmacogenomic variation in the Malagasy population: implications for the antimalarial drug primaquine metabolism.马达加斯加人群的药物基因组学变异:对抗疟药物伯氨喹代谢的影响。
Pharmacogenomics. 2023 Jul;24(11):583-597. doi: 10.2217/pgs-2023-0091. Epub 2023 Aug 8.
5
CNest: A novel copy number association discovery method uncovers 862 new associations from 200,629 whole-exome sequence datasets in the UK Biobank.CNest:一种新型的拷贝数关联发现方法,从英国生物银行的200,629个全外显子序列数据集中发现了862个新的关联。
Cell Genom. 2022 Aug 10;2(8):100167. doi: 10.1016/j.xgen.2022.100167.
6
Similar Rates of Deleterious Copy Number Variants in Early-Onset Psychosis and Autism Spectrum Disorder.早发性精神病和自闭症谱系障碍中有害拷贝数变异的相似发生率。
Am J Psychiatry. 2022 Nov 1;179(11):853-861. doi: 10.1176/appi.ajp.21111175. Epub 2022 Aug 24.
7
Analysis of copy number variation in dogs implicates genomic structural variation in the development of anterior cruciate ligament rupture.分析犬的拷贝数变异提示基因组结构变异在前交叉韧带断裂的发生中起作用。
PLoS One. 2020 Dec 31;15(12):e0244075. doi: 10.1371/journal.pone.0244075. eCollection 2020.
8
Double hits in schizophrenia.精神分裂症中的双重打击。
Hum Mol Genet. 2018 Aug 1;27(15):2755-2761. doi: 10.1093/hmg/ddy175.
9
MinorityReport, software for generalized analysis of causal genetic variants.《少数派报告》,用于因果基因变异广义分析的软件。
Malar J. 2017 Feb 23;16(1):90. doi: 10.1186/s12936-017-1730-2.
10
Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data.拷贝数变异(CNV)检测算法的比较分析:文献综述及基于牛高密度单核苷酸多态性(SNP)数据的案例研究
Microarrays (Basel). 2013 Jun 25;2(3):171-85. doi: 10.3390/microarrays2030171.
比较单核苷酸多态性(SNP)阵列的拷贝数变异(CNV)检测方法。
Brief Funct Genomic Proteomic. 2009 Sep;8(5):353-66. doi: 10.1093/bfgp/elp017. Epub 2009 Sep 8.
4
The HapMap and genome-wide association studies in diagnosis and therapy.国际人类基因组单体型图计划及全基因组关联研究在诊断与治疗中的应用
Annu Rev Med. 2009;60:443-56. doi: 10.1146/annurev.med.60.061907.093117.
5
Genome-wide association studies, field synopses, and the development of the knowledge base on genetic variation and human diseases.全基因组关联研究、领域概述以及关于遗传变异与人类疾病知识库的发展。
Am J Epidemiol. 2009 Aug 1;170(3):269-79. doi: 10.1093/aje/kwp119. Epub 2009 Jun 4.
6
Population analysis of large copy number variants and hotspots of human genetic disease.人类遗传疾病的大片段拷贝数变异和热点区域的群体分析。
Am J Hum Genet. 2009 Feb;84(2):148-61. doi: 10.1016/j.ajhg.2008.12.014. Epub 2009 Jan 22.
7
Extending genome-wide association studies to copy-number variation.将全基因组关联研究扩展至拷贝数变异
Hum Mol Genet. 2008 Oct 15;17(R2):R135-42. doi: 10.1093/hmg/ddn282.
8
Systematic assessment of copy number variant detection via genome-wide SNP genotyping.通过全基因组单核苷酸多态性基因分型对拷贝数变异检测进行系统评估。
Nat Genet. 2008 Oct;40(10):1199-203. doi: 10.1038/ng.236. Epub 2008 Sep 7.
9
Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性(SNPs)、常见拷贝数多态性和罕见拷贝数变异(CNVs)的整合基因型分型与关联分析。
Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.
10
Large recurrent microdeletions associated with schizophrenia.与精神分裂症相关的大型复发性微缺失
Nature. 2008 Sep 11;455(7210):232-6. doi: 10.1038/nature07229.