• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯分层混合建模,用于从靶向 CNV 阵列中分配拷贝数。

Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array.

机构信息

Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, United Kingdom.

出版信息

Genet Epidemiol. 2011 Sep;35(6):536-48. doi: 10.1002/gepi.20604. Epub 2011 Jul 18.

DOI:10.1002/gepi.20604
PMID:21769931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3159791/
Abstract

Accurate assignment of copy number at known copy number variant (CNV) loci is important for both increasing understanding of the structural evolution of genomes as well as for carrying out association studies of copy number with disease. As with calling SNP genotypes, the task can be framed as a clustering problem but for a number of reasons assigning copy number is much more challenging. CNV assays have lower signal-to-noise ratios than SNP assays, often display heavy tailed and asymmetric intensity distributions, contain outlying observations and may exhibit systematic technical differences among different cohorts. In addition, the number of copy-number classes at a CNV in the population may be unknown a priori. Due to these complications, automatic and robust assignment of copy number from array data remains a challenging problem. We have developed a copy number assignment algorithm, CNVCALL, for a targeted CNV array, such as that used by the Wellcome Trust Case Control Consortium's recent CNV association study. We use a Bayesian hierarchical mixture model that robustly identifies both the number of different copy number classes at a specific locus as well as relative copy number for each individual in the sample. This approach is fully automated which is a critical requirement when analyzing large numbers of CNVs. We illustrate the methods performance using real data from the Wellcome Trust Case Control Consortium's CNV association study and using simulated data.

摘要

准确地确定已知拷贝数变异(CNV)位点的拷贝数对于增加对基因组结构演化的理解以及进行拷贝数与疾病的关联研究都很重要。与调用 SNP 基因型一样,该任务可以被构造成聚类问题,但由于多种原因,分配拷贝数更具挑战性。CNV 检测的信号与噪声比低于 SNP 检测,通常表现出重尾和非对称的强度分布,包含异常值观察值,并且可能在不同队列之间表现出系统的技术差异。此外,人群中特定 CNV 的拷贝数类别数量可能事先未知。由于这些复杂性,从阵列数据中自动且稳健地分配拷贝数仍然是一个具有挑战性的问题。我们已经开发了一种拷贝数分配算法,即 CNVCALL,用于靶向 CNV 阵列,例如由惠康信托基金病例对照协会的最近 CNV 关联研究使用的阵列。我们使用贝叶斯分层混合模型,该模型稳健地识别特定位置的不同拷贝数类别数量以及样本中每个个体的相对拷贝数。这种方法是完全自动化的,这是分析大量 CNV 时的关键要求。我们使用来自惠康信托基金病例对照协会的 CNV 关联研究的真实数据和模拟数据来说明方法的性能。

相似文献

1
Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array.贝叶斯分层混合建模,用于从靶向 CNV 阵列中分配拷贝数。
Genet Epidemiol. 2011 Sep;35(6):536-48. doi: 10.1002/gepi.20604. Epub 2011 Jul 18.
2
Genome-wide association study of copy number variation with lung function identifies a novel signal of association near BANP for forced vital capacity.拷贝数变异与肺功能的全基因组关联研究确定了一个靠近BANP的与用力肺活量相关的新关联信号。
BMC Genet. 2016 Aug 11;17(1):116. doi: 10.1186/s12863-016-0423-0.
3
Bayesian EM algorithm for scoring polymorphic deletions from SNP data and application to a common CNV on 8q24.用于从SNP数据中对多态性缺失进行评分的贝叶斯期望最大化算法及其在8q24上常见拷贝数变异中的应用
Genet Epidemiol. 2009 May;33(4):357-68. doi: 10.1002/gepi.20391.
4
HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断
PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.
5
Genotype copy number variations using Gaussian mixture models: theory and algorithms.使用高斯混合模型的基因型拷贝数变异:理论与算法
Stat Appl Genet Mol Biol. 2012 Oct 12;11(5):5. doi: 10.1515/1544-6115.1725.
6
Noise-robust assessment of SNP array based CNV calls through local noise estimation of log R ratios.通过对数R比率的局部噪声估计对基于单核苷酸多态性(SNP)阵列的拷贝数变异(CNV)调用进行抗噪声评估。
Stat Appl Genet Mol Biol. 2018 Apr 28;17(2):sagmb-2017-0026. doi: 10.1515/sagmb-2017-0026.
7
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
8
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
9
The effect of algorithms on copy number variant detection.算法对拷贝数变异检测的影响。
PLoS One. 2010 Dec 30;5(12):e14456. doi: 10.1371/journal.pone.0014456.
10
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.使用高密度或中密度单核苷酸多态性基因分型面板检测到的拷贝数变异与从牛侧翼高密度单核苷酸多态性单倍型推断拷贝数变异的一致性。
BMC Genomics. 2020 Mar 4;21(1):205. doi: 10.1186/s12864-020-6627-8.

引用本文的文献

1
Bayesian copy number detection and association in large-scale studies.贝叶斯拷贝数检测及其在大规模研究中的关联分析。
BMC Cancer. 2020 Sep 7;20(1):856. doi: 10.1186/s12885-020-07304-3.
2
Genome-Wide Association of Copy Number Polymorphisms and Kidney Function.全基因组范围内拷贝数多态性与肾功能的关联研究
PLoS One. 2017 Jan 30;12(1):e0170815. doi: 10.1371/journal.pone.0170815. eCollection 2017.
3
Whole exome association of rare deletions in multiplex oral cleft families.多重口腔裂隙家族中罕见缺失的全外显子组关联研究
Genet Epidemiol. 2017 Jan;41(1):61-69. doi: 10.1002/gepi.22010. Epub 2016 Dec 1.
4
Genome-wide association study of copy number variation with lung function identifies a novel signal of association near BANP for forced vital capacity.拷贝数变异与肺功能的全基因组关联研究确定了一个靠近BANP的与用力肺活量相关的新关联信号。
BMC Genet. 2016 Aug 11;17(1):116. doi: 10.1186/s12863-016-0423-0.
5
Association analysis of copy numbers of FC-gamma receptor genes for rheumatoid arthritis and other immune-mediated phenotypes.类风湿关节炎及其他免疫介导表型的Fcγ受体基因拷贝数关联分析
Eur J Hum Genet. 2016 Feb;24(2):263-70. doi: 10.1038/ejhg.2015.95. Epub 2015 May 13.
6
Generalized species sampling priors with latent Beta reinforcements.具有潜在贝塔增强的广义物种抽样先验。
J Am Stat Assoc. 2014 Dec 1;109(508):1466-1480. doi: 10.1080/01621459.2014.950735.
7
Modified screening and ranking algorithm for copy number variation detection.用于拷贝数变异检测的改进筛选与排序算法
Bioinformatics. 2015 May 1;31(9):1341-8. doi: 10.1093/bioinformatics/btu850. Epub 2014 Dec 25.
8
Copy number polymorphisms near SLC2A9 are associated with serum uric acid concentrations.溶质载体家族2成员9(SLC2A9)附近的拷贝数多态性与血清尿酸浓度相关。
BMC Genet. 2014 Jul 9;15:81. doi: 10.1186/1471-2156-15-81.
9
A HIERARCHICAL BAYESIAN MODEL FOR INFERENCE OF COPY NUMBER VARIANTS AND THEIR ASSOCIATION TO GENE EXPRESSION.用于推断拷贝数变异及其与基因表达关联的分层贝叶斯模型。
Ann Appl Stat. 2014 Mar 1;8(1):148-175. doi: 10.1214/13-AOAS705.
10
ParseCNV integrative copy number variation association software with quality tracking.ParseCNV 整合拷贝数变异关联软件,具有质量跟踪功能。
Nucleic Acids Res. 2013 Mar 1;41(5):e64. doi: 10.1093/nar/gks1346. Epub 2013 Jan 4.

本文引用的文献

1
Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.全基因组关联研究分析了 16000 例 8 种常见疾病和 3000 例共享对照的 CNVs。
Nature. 2010 Apr 1;464(7289):713-20. doi: 10.1038/nature08979.
2
Origins and functional impact of copy number variation in the human genome.人类基因组中拷贝数变异的起源和功能影响。
Nature. 2010 Apr 1;464(7289):704-12. doi: 10.1038/nature08516. Epub 2009 Oct 7.
3
Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease.IRGM上游的缺失多态性与IRGM表达改变及克罗恩病相关。
Nat Genet. 2008 Sep;40(9):1107-12. doi: 10.1038/ng.215.
4
A robust statistical method for case-control association testing with copy number variation.一种用于拷贝数变异病例对照关联测试的稳健统计方法。
Nat Genet. 2008 Oct;40(10):1245-52. doi: 10.1038/ng.206. Epub 2008 Sep 7.
5
Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性(SNPs)、常见拷贝数多态性和罕见拷贝数变异(CNVs)的整合基因型分型与关联分析。
Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.
6
Integrated detection and population-genetic analysis of SNPs and copy number variation.单核苷酸多态性(SNPs)与拷贝数变异的综合检测及群体遗传分析
Nat Genet. 2008 Oct;40(10):1166-74. doi: 10.1038/ng.238. Epub 2008 Sep 7.
7
Large recurrent microdeletions associated with schizophrenia.与精神分裂症相关的大型复发性微缺失
Nature. 2008 Sep 11;455(7210):232-6. doi: 10.1038/nature07229.
8
Rare chromosomal deletions and duplications increase risk of schizophrenia.罕见的染色体缺失和重复会增加患精神分裂症的风险。
Nature. 2008 Sep 11;455(7210):237-41. doi: 10.1038/nature07239. Epub 2008 Jul 30.
9
A second generation human haplotype map of over 3.1 million SNPs.一张包含超过310万个单核苷酸多态性的第二代人类单倍型图谱。
Nature. 2007 Oct 18;449(7164):851-61. doi: 10.1038/nature06258.
10
PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.PennCNV:一种为在全基因组单核苷酸多态性基因分型数据中进行高分辨率拷贝数变异检测而设计的集成隐马尔可夫模型。
Genome Res. 2007 Nov;17(11):1665-74. doi: 10.1101/gr.6861907. Epub 2007 Oct 5.