• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

wuHMM:一种利用长寡核苷酸微阵列数据检测DNA拷贝数变异的稳健算法。

wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data.

作者信息

Cahan Patrick, Godfrey Laura E, Eis Peggy S, Richmond Todd A, Selzer Rebecca R, Brent Michael, McLeod Howard L, Ley Timothy J, Graubert Timothy A

机构信息

Department of Internal Medicine and Department of Genetics, Division of Oncology, Stem Cell Biology Section, Washington University, St Louis, MO, USA.

出版信息

Nucleic Acids Res. 2008 Apr;36(7):e41. doi: 10.1093/nar/gkn110. Epub 2008 Mar 11.

DOI:10.1093/nar/gkn110
PMID:18334530
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2367727/
Abstract

Copy number variants (CNVs) are currently defined as genomic sequences that are polymorphic in copy number and range in length from 1000 to several million base pairs. Among current array-based CNV detection platforms, long-oligonucleotide arrays promise the highest resolution. However, the performance of currently available analytical tools suffers when applied to these data because of the lower signal:noise ratio inherent in oligonucleotide-based hybridization assays. We have developed wuHMM, an algorithm for mapping CNVs from array comparative genomic hybridization (aCGH) platforms comprised of 385 000 to more than 3 million probes. wuHMM is unique in that it can utilize sequence divergence information to reduce the false positive rate (FPR). We apply wuHMM to 385K-aCGH, 2.1M-aCGH and 3.1M-aCGH experiments comparing the 129X1/SvJ and C57BL/6J inbred mouse genomes. We assess wuHMM's performance on the 385K platform by comparison to the higher resolution platforms and we independently validate 10 CNVs. The method requires no training data and is robust with respect to changes in algorithm parameters. At a FPR of <10%, the algorithm can detect CNVs with five probes on the 385K platform and three on the 2.1M and 3.1M platforms, resulting in effective resolutions of 24 kb, 2-5 kb and 1 kb, respectively.

摘要

拷贝数变异(CNV)目前被定义为拷贝数呈多态性且长度范围从1000个碱基对到数百万个碱基对的基因组序列。在当前基于阵列的CNV检测平台中,长寡核苷酸阵列有望实现最高分辨率。然而,由于基于寡核苷酸的杂交检测中固有的较低信噪比,当前可用的分析工具在应用于这些数据时性能不佳。我们开发了wuHMM,这是一种用于从由385,000个到超过300万个探针组成的阵列比较基因组杂交(aCGH)平台映射CNV的算法。wuHMM的独特之处在于它可以利用序列差异信息来降低假阳性率(FPR)。我们将wuHMM应用于比较129X1/SvJ和C57BL/6J近交小鼠基因组的385K-aCGH、2.1M-aCGH和3.1M-aCGH实验。我们通过与更高分辨率平台比较来评估wuHMM在385K平台上的性能,并独立验证了10个CNV。该方法不需要训练数据,并且在算法参数变化方面具有鲁棒性。在FPR<10%时,该算法在385K平台上可以用五个探针检测CNV,在2.1M和3.1M平台上可以用三个探针检测CNV,有效分辨率分别为24 kb、2 - 5 kb和1 kb。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/3bf27abdbde1/gkn110f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/262a7553a1cf/gkn110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/0017a2d5678f/gkn110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/245f53b0f0f5/gkn110f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/f3e73ca4f8ab/gkn110f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/3bf27abdbde1/gkn110f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/262a7553a1cf/gkn110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/0017a2d5678f/gkn110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/245f53b0f0f5/gkn110f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/f3e73ca4f8ab/gkn110f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81f2/2367727/3bf27abdbde1/gkn110f5.jpg

相似文献

1
wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data.wuHMM:一种利用长寡核苷酸微阵列数据检测DNA拷贝数变异的稳健算法。
Nucleic Acids Res. 2008 Apr;36(7):e41. doi: 10.1093/nar/gkn110. Epub 2008 Mar 11.
2
Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.用于人类全基因组拷贝数变异(CNV)分析的高分辨率阵列平台的综合性能比较
BMC Genomics. 2017 Apr 24;18(1):321. doi: 10.1186/s12864-017-3658-x.
3
Gene-resolution analysis of DNA copy number variation using oligonucleotide expression microarrays.使用寡核苷酸表达微阵列对DNA拷贝数变异进行基因分辨率分析。
BMC Genomics. 2007 Apr 30;8:111. doi: 10.1186/1471-2164-8-111.
4
Improved detection of global copy number variation using high density, non-polymorphic oligonucleotide probes.使用高密度、非多态性寡核苷酸探针改进全基因组拷贝数变异检测。
BMC Genet. 2008 Mar 28;9:27. doi: 10.1186/1471-2156-9-27.
5
A multi-sample based method for identifying common CNVs in normal human genomic structure using high-resolution aCGH data.基于多样本的方法,利用高分辨率 aCGH 数据识别正常人类基因组结构中的常见 CNV。
PLoS One. 2011;6(10):e26975. doi: 10.1371/journal.pone.0026975. Epub 2011 Oct 31.
6
Microarray-based maps of copy-number variant regions in European and sub-Saharan populations.基于微阵列的欧洲和撒哈拉以南人群拷贝数变异区域图谱。
PLoS One. 2010 Dec 16;5(12):e15246. doi: 10.1371/journal.pone.0015246.
7
Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.利用全基因组短序列寡核苷酸阵列优化拷贝数变异分析。
Nucleic Acids Res. 2010 Jun;38(10):3275-86. doi: 10.1093/nar/gkq073. Epub 2010 Feb 15.
8
Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays.使用高密度DNA寡核苷酸阵列进行全基因组人类拷贝数变异检测。
Genome Res. 2006 Dec;16(12):1575-84. doi: 10.1101/gr.5629106. Epub 2006 Nov 22.
9
A high-resolution map of segmental DNA copy number variation in the mouse genome.小鼠基因组中DNA片段拷贝数变异的高分辨率图谱。
PLoS Genet. 2007 Jan 5;3(1):e3. doi: 10.1371/journal.pgen.0030003. Epub 2006 Nov 22.
10
Detection of pathogenic gene copy number variations in patients with mental retardation by genomewide oligonucleotide array comparative genomic hybridization.采用全基因组寡核苷酸阵列比较基因组杂交技术检测智力发育迟缓患者的致病基因拷贝数变异
Hum Mutat. 2007 Nov;28(11):1124-32. doi: 10.1002/humu.20581.

引用本文的文献

1
Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression.使用具有小波压缩的隐马尔可夫模型进行拷贝数变异的快速贝叶斯推断。
PLoS Comput Biol. 2016 May 13;12(5):e1004871. doi: 10.1371/journal.pcbi.1004871. eCollection 2016 May.
2
Multiple samples aCGH analysis for rare CNVs detection.用于罕见拷贝数变异检测的多样本阵列比较基因组杂交分析。
J Clin Bioinforma. 2013 Jun 11;3(1):12. doi: 10.1186/2043-9113-3-12.
3
Copy number variation signature to predict human ancestry.拷贝数变异特征预测人类起源。

本文引用的文献

1
A sequence-based variation map of 8.27 million SNPs in inbred mouse strains.近交系小鼠品系中827万个单核苷酸多态性(SNP)的基于序列的变异图谱。
Nature. 2007 Aug 30;448(7157):1050-3. doi: 10.1038/nature06067. Epub 2007 Jul 29.
2
Methods and strategies for analyzing copy number variation using DNA microarrays.使用DNA微阵列分析拷贝数变异的方法和策略。
Nat Genet. 2007 Jul;39(7 Suppl):S16-21. doi: 10.1038/ng2028.
3
Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome.人类基因组中与拷贝数变异相关断点的系统预测与验证
BMC Bioinformatics. 2012 Dec 27;13:336. doi: 10.1186/1471-2105-13-336.
4
Fast detection of de novo copy number variants from SNP arrays for case-parent trios.基于 SNP 芯片的先证者-父母三体型检测新发拷贝数变异的快速方法。
BMC Bioinformatics. 2012 Dec 12;13:330. doi: 10.1186/1471-2105-13-330.
5
Parsimonious higher-order hidden Markov models for improved array-CGH analysis with applications to Arabidopsis thaliana.简约高阶隐马尔可夫模型提高了微阵列比较基因组杂交分析的性能,并在拟南芥中得到了应用。
PLoS Comput Biol. 2012 Jan;8(1):e1002286. doi: 10.1371/journal.pcbi.1002286. Epub 2012 Jan 12.
6
Sequencing a mouse acute promyelocytic leukemia genome reveals genetic events relevant for disease progression.对小鼠急性早幼粒细胞白血病基因组进行测序揭示了与疾病进展相关的遗传事件。
J Clin Invest. 2011 Apr;121(4):1445-55. doi: 10.1172/JCI45284. Epub 2011 Mar 23.
7
Genome-wide studies of copy number variation and exome sequencing identify rare variants in BAG3 as a cause of dilated cardiomyopathy.全基因组拷贝数变异和外显子组测序研究发现 BAG3 中的罕见变异是扩张型心肌病的一个原因。
Am J Hum Genet. 2011 Mar 11;88(3):273-82. doi: 10.1016/j.ajhg.2011.01.016. Epub 2011 Feb 25.
8
Microarray-based maps of copy-number variant regions in European and sub-Saharan populations.基于微阵列的欧洲和撒哈拉以南人群拷贝数变异区域图谱。
PLoS One. 2010 Dec 16;5(12):e15246. doi: 10.1371/journal.pone.0015246.
9
Integrated genomics of susceptibility to alkylator-induced leukemia in mice.小鼠烷化剂诱导性白血病易感性的综合基因组学研究。
BMC Genomics. 2010 Nov 17;11:638. doi: 10.1186/1471-2164-11-638.
10
Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model.基于逐步贝叶斯模型,利用阵列强度和测序读取深度检测拷贝数变异。
BMC Bioinformatics. 2010 Oct 31;11:539. doi: 10.1186/1471-2105-11-539.
Proc Natl Acad Sci U S A. 2007 Jun 12;104(24):10110-5. doi: 10.1073/pnas.0703834104. Epub 2007 Jun 5.
4
A new look towards BAC-based array CGH through a comprehensive comparison with oligo-based array CGH.通过与基于寡核苷酸的阵列比较基因组杂交进行全面比较,重新审视基于细菌人工染色体的阵列比较基因组杂交。
BMC Genomics. 2007 Mar 29;8:84. doi: 10.1186/1471-2164-8-84.
5
Strong association of de novo copy number mutations with autism.新发拷贝数突变与自闭症的强关联。
Science. 2007 Apr 20;316(5823):445-9. doi: 10.1126/science.1138659. Epub 2007 Mar 15.
6
Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia.急性淋巴细胞白血病基因改变的全基因组分析
Nature. 2007 Apr 12;446(7137):758-64. doi: 10.1038/nature05690.
7
Mapping autism risk loci using genetic linkage and chromosomal rearrangements.利用遗传连锁和染色体重排绘制自闭症风险基因座图谱。
Nat Genet. 2007 Mar;39(3):319-28. doi: 10.1038/ng1985. Epub 2007 Feb 18.
8
Relative impact of nucleotide and copy number variation on gene expression phenotypes.核苷酸和拷贝数变异对基因表达表型的相对影响。
Science. 2007 Feb 9;315(5813):848-53. doi: 10.1126/science.1136678.
9
A high-resolution map of segmental DNA copy number variation in the mouse genome.小鼠基因组中DNA片段拷贝数变异的高分辨率图谱。
PLoS Genet. 2007 Jan 5;3(1):e3. doi: 10.1371/journal.pgen.0030003. Epub 2006 Nov 22.
10
A comprehensive analysis of common copy-number variations in the human genome.对人类基因组中常见拷贝数变异的综合分析。
Am J Hum Genet. 2007 Jan;80(1):91-104. doi: 10.1086/510560. Epub 2006 Dec 5.