• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组算法检测与疾病相关的 CNV 关联。

Genome-wide algorithm for detecting CNV associations with diseases.

机构信息

Department of Epidemiology, The University of Texas MD Anderson Cancer Center, 1155 Pressler St,, Houston, Texas 77030, USA.

出版信息

BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.

DOI:10.1186/1471-2105-12-331
PMID:21827692
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3173460/
Abstract

BACKGROUND

SNP genotyping arrays have been developed to characterize single-nucleotide polymorphisms (SNPs) and DNA copy number variations (CNVs). Nonparametric and model-based statistical algorithms have been developed to detect CNVs from SNP data using the marker intensities. However, these algorithms lack specificity to detect small CNVs owing to the high false positive rate when calling CNVs based on the intensity values. Therefore, the resulting association tests lack power even if the CNVs affecting disease risk are common. An alternative procedure called PennCNV uses information from both the marker intensities as well as the genotypes and therefore has increased sensitivity.

RESULTS

By using the hidden Markov model (HMM) implemented in PennCNV to derive the probabilities of different copy number states which we subsequently used in a logistic regression model, we developed a new genome-wide algorithm to detect CNV associations with diseases. We compared this new method with association test applied to the most probable copy number state for each individual that is provided by PennCNV after it performs an initial HMM analysis followed by application of the Viterbi algorithm, which removes information about copy number probabilities. In one of our simulation studies, we showed that for large CNVs (number of SNPs ≥ 10), the association tests based on PennCNV calls gave more significant results, but the new algorithm retained high power. For small CNVs (number of SNPs <10), the logistic algorithm provided smaller average p-values (e.g., p = 7.54e - 17 when relative risk RR = 3.0) in all the scenarios and could capture signals that PennCNV did not (e.g., p = 0.020 when RR = 3.0). From a second set of simulations, we showed that the new algorithm is more powerful in detecting disease associations with small CNVs (number of SNPs ranging from 3 to 5) under different penetrance models (e.g., when RR = 3.0, for relatively weak signals, power = 0.8030 comparing to 0.2879 obtained from the association tests based on PennCNV calls). The new method was implemented in software GWCNV. It is freely available at http://gwcnv.sourceforge.net, distributed under a GPL license.

CONCLUSIONS

We conclude that the new algorithm is more sensitive and can be more powerful in detecting CNV associations with diseases than the existing HMM algorithm, especially when the CNV association signal is weak and a limited number of SNPs are located in the CNV.

摘要

背景

SNP 基因分型阵列已被开发用于描述单核苷酸多态性(SNP)和 DNA 拷贝数变异(CNV)。已经开发了非参数和基于模型的统计算法,以使用标记强度从 SNP 数据中检测 CNV。然而,由于基于强度值调用 CNV 时的假阳性率很高,这些算法缺乏检测小 CNV 的特异性。因此,即使影响疾病风险的 CNV 很常见,由此产生的关联测试也缺乏效力。一种称为 PennCNV 的替代方法利用了标记强度以及基因型的信息,因此具有更高的灵敏度。

结果

通过使用 PennCNV 中实现的隐马尔可夫模型(HMM)来推导不同拷贝数状态的概率,我们随后将这些概率用于逻辑回归模型中,从而开发了一种新的全基因组算法来检测与疾病相关的 CNV 关联。我们将这种新方法与关联测试进行了比较,关联测试适用于 PennCNV 执行初始 HMM 分析后,应用维特比算法为每个个体提供的最可能的拷贝数状态,该算法消除了拷贝数概率的信息。在我们的一项模拟研究中,我们表明对于大 CNV(SNP 数量≥10),基于 PennCNV 调用的关联测试给出了更显著的结果,但新算法保留了高功效。对于小 CNV(SNP 数量<10),逻辑算法在所有情况下提供了更小的平均 p 值(例如,RR=3.0 时 p=7.54e-17),并且可以捕获 PennCNV 未捕获的信号(例如,RR=3.0 时 p=0.020)。从第二组模拟中,我们表明,新算法在不同的外显率模型下(例如,RR=3.0,对于相对较弱的信号,功效=0.8030,而基于 PennCNV 调用的关联测试的功效为 0.2879),在检测与小 CNV(SNP 数量为 3 到 5)相关的疾病关联时,具有更高的功效。新方法已在软件 GWCNV 中实现。它可在 http://gwcnv.sourceforge.net 上免费获得,根据 GPL 许可证分发。

结论

我们的结论是,新算法比现有的 HMM 算法更敏感,并且在检测与疾病相关的 CNV 关联方面更有效,特别是当 CNV 关联信号较弱且 CNV 中仅包含有限数量的 SNP 时。

相似文献

1
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
2
Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。
BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.
3
Family-Based Benchmarking of Copy Number Variation Detection Software.基于家族的拷贝数变异检测软件基准测试
PLoS One. 2015 Jul 21;10(7):e0133465. doi: 10.1371/journal.pone.0133465. eCollection 2015.
4
Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。
BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.
5
Fast detection of de novo copy number variants from SNP arrays for case-parent trios.基于 SNP 芯片的先证者-父母三体型检测新发拷贝数变异的快速方法。
BMC Bioinformatics. 2012 Dec 12;13:330. doi: 10.1186/1471-2105-13-330.
6
Identification of Copy Number Variants from SNP Arrays Using PennCNV.使用PennCNV从SNP阵列中鉴定拷贝数变异
Methods Mol Biol. 2018;1833:1-28. doi: 10.1007/978-1-4939-8666-8_1.
7
Cheek swabs, SNP chips, and CNVs: assessing the quality of copy number variant calls generated with subject-collected mail-in buccal brush DNA samples on a high-density genotyping microarray.颊拭子、SNP 芯片和 CNV:使用高密度基因分型微阵列评估通过邮寄采集的口腔刷 DNA 样本生成的拷贝数变异体调用的质量。
BMC Med Genet. 2012 Jun 26;13:51. doi: 10.1186/1471-2350-13-51.
8
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.使用高密度或中密度单核苷酸多态性基因分型面板检测到的拷贝数变异与从牛侧翼高密度单核苷酸多态性单倍型推断拷贝数变异的一致性。
BMC Genomics. 2020 Mar 4;21(1):205. doi: 10.1186/s12864-020-6627-8.
9
PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.PennCNV:一种为在全基因组单核苷酸多态性基因分型数据中进行高分辨率拷贝数变异检测而设计的集成隐马尔可夫模型。
Genome Res. 2007 Nov;17(11):1665-74. doi: 10.1101/gr.6861907. Epub 2007 Oct 5.
10
Genome-wide association study of copy number variation with lung function identifies a novel signal of association near BANP for forced vital capacity.拷贝数变异与肺功能的全基因组关联研究确定了一个靠近BANP的与用力肺活量相关的新关联信号。
BMC Genet. 2016 Aug 11;17(1):116. doi: 10.1186/s12863-016-0423-0.

引用本文的文献

1
CNVDeep: deep association of copy number variants with neurocognitive disorders.CNVDeep:拷贝数变异与神经认知障碍的深度关联。
BMC Bioinformatics. 2024 Aug 29;25(1):283. doi: 10.1186/s12859-024-05874-8.
2
Deviation from Mendelian transmission of autosomal SNPs can be used to estimate germline mutations in humans exposed to ionizing radiation.常染色体单核苷酸多态性的孟德尔传递偏差可用于估计人类暴露于电离辐射后的种系基因突变。
PLoS One. 2020 Oct 27;15(10):e0233941. doi: 10.1371/journal.pone.0233941. eCollection 2020.
3
Genomic population structure and prevalence of copy number variations in South African Nguni cattle.

本文引用的文献

1
Mapping copy number variation by population-scale genome sequencing.通过群体规模的基因组测序来绘制拷贝数变异图谱。
Nature. 2011 Feb 3;470(7332):59-65. doi: 10.1038/nature09708.
2
Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip.从一个 60kSNP 基因芯片推断出的猪基因组拷贝数变异。
BMC Genomics. 2010 Oct 22;11:593. doi: 10.1186/1471-2164-11-593.
3
Missing heritability and strategies for finding the underlying causes of complex disease.复杂疾病遗传率缺失及其潜在病因的研究策略。
南非恩古尼牛的基因组群体结构及拷贝数变异的流行情况
BMC Genomics. 2015 Nov 4;16:894. doi: 10.1186/s12864-015-2122-z.
4
Investigation of copy number variation in children with conotruncal heart defects.圆锥动脉干型心脏缺陷患儿的拷贝数变异研究。
Arq Bras Cardiol. 2015 Jan;104(1):24-31. doi: 10.5935/abc.20140169. Epub 2014 Nov 11.
5
The Growing Importance of CNVs: New Insights for Detection and Clinical Interpretation.CNVs 的重要性日益增加:检测和临床解读的新见解。
Front Genet. 2013 May 30;4:92. doi: 10.3389/fgene.2013.00092. eCollection 2013.
6
Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients.解读 CNV 数据的持续困难:一项澳大利亚 HNPCC/林奇综合征患者全基因组 CNV 关联研究的经验教训。
BMC Med Genomics. 2013 Mar 26;6:10. doi: 10.1186/1755-8794-6-10.
Nat Rev Genet. 2010 Jun;11(6):446-50. doi: 10.1038/nrg2809.
4
Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。
Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.
5
Origins and functional impact of copy number variation in the human genome.人类基因组中拷贝数变异的起源和功能影响。
Nature. 2010 Apr 1;464(7289):704-12. doi: 10.1038/nature08516. Epub 2009 Oct 7.
6
Genome-wide analyses of exonic copy number variants in a family-based study point to novel autism susceptibility genes.一项基于家系研究的外显子拷贝数变异全基因组分析指向了新的自闭症易感基因。
PLoS Genet. 2009 Jun;5(6):e1000536. doi: 10.1371/journal.pgen.1000536. Epub 2009 Jun 26.
7
Copy number variation at 1q21.1 associated with neuroblastoma.1q21.1处的拷贝数变异与神经母细胞瘤相关。
Nature. 2009 Jun 18;459(7249):987-91. doi: 10.1038/nature08035.
8
CNV discovery using SNP genotyping arrays.使用单核苷酸多态性(SNP)基因分型阵列进行拷贝数变异(CNV)检测。
Cytogenet Genome Res. 2008;123(1-4):307-12. doi: 10.1159/000184722. Epub 2009 Mar 11.
9
Deletion of the late cornified envelope LCE3B and LCE3C genes as a susceptibility factor for psoriasis.晚期角质包膜LCE3B和LCE3C基因缺失作为银屑病的一个易感因素。
Nat Genet. 2009 Feb;41(2):211-5. doi: 10.1038/ng.313. Epub 2009 Jan 25.
10
Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease.IRGM上游的缺失多态性与IRGM表达改变及克罗恩病相关。
Nat Genet. 2008 Sep;40(9):1107-12. doi: 10.1038/ng.215.