• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于逐步贝叶斯模型,利用阵列强度和测序读取深度检测拷贝数变异。

Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model.

机构信息

Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA.

出版信息

BMC Bioinformatics. 2010 Oct 31;11:539. doi: 10.1186/1471-2105-11-539.

DOI:10.1186/1471-2105-11-539
PMID:21034510
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2992546/
Abstract

BACKGROUND

Copy number variants (CNVs) have been demonstrated to occur at a high frequency and are now widely believed to make a significant contribution to the phenotypic variation in human populations. Array-based comparative genomic hybridization (array-CGH) and newly developed read-depth approach through ultrahigh throughput genomic sequencing both provide rapid, robust, and comprehensive methods to identify CNVs on a whole-genome scale.

RESULTS

We developed a Bayesian statistical analysis algorithm for the detection of CNVs from both types of genomic data. The algorithm can analyze such data obtained from PCR-based bacterial artificial chromosome arrays, high-density oligonucleotide arrays, and more recently developed high-throughput DNA sequencing. Treating parameters--e.g., the number of CNVs, the position of each CNV, and the data noise level--that define the underlying data generating process as random variables, our approach derives the posterior distribution of the genomic CNV structure given the observed data. Sampling from the posterior distribution using a Markov chain Monte Carlo method, we get not only best estimates for these unknown parameters but also Bayesian credible intervals for the estimates. We illustrate the characteristics of our algorithm by applying it to both synthetic and experimental data sets in comparison to other segmentation algorithms.

CONCLUSIONS

In particular, the synthetic data comparison shows that our method is more sensitive than other approaches at low false positive rates. Furthermore, given its Bayesian origin, our method can also be seen as a technique to refine CNVs identified by fast point-estimate methods and also as a framework to integrate array-CGH and sequencing data with other CNV-related biological knowledge, all through informative priors.

摘要

背景

已证实拷贝数变异(CNVs)频繁发生,并被广泛认为对人类群体的表型变异有重要贡献。基于阵列的比较基因组杂交(array-CGH)和新开发的超高通量基因组测序的读深度方法都提供了快速、稳健和全面的方法,可在全基因组范围内识别 CNVs。

结果

我们开发了一种贝叶斯统计分析算法,用于从这两种类型的基因组数据中检测 CNVs。该算法可以分析基于 PCR 的细菌人工染色体阵列、高密度寡核苷酸阵列以及最近开发的高通量 DNA 测序获得的数据。将定义潜在数据生成过程的参数(例如,CNV 的数量、每个 CNV 的位置以及数据噪声水平)视为随机变量,我们的方法可以从观察到的数据中推导出基因组 CNV 结构的后验分布。通过使用马尔可夫链蒙特卡罗方法从后验分布中进行抽样,我们不仅可以得到这些未知参数的最佳估计值,还可以得到估计值的贝叶斯可信区间。我们通过将其应用于合成数据集和实验数据集并与其他分割算法进行比较,展示了我们算法的特点。

结论

特别是,与其他方法相比,合成数据的比较表明我们的方法在低假阳性率下更敏感。此外,由于其贝叶斯起源,我们的方法还可以被视为一种通过快速点估计方法来改进 CNV 的技术,也可以作为一种框架,通过信息丰富的先验知识将 array-CGH 和测序数据与其他与 CNV 相关的生物学知识整合在一起。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/42d9f4844c2c/1471-2105-11-539-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/1b6d862b38bf/1471-2105-11-539-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/51b707cf1f20/1471-2105-11-539-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/6b7c90902bb3/1471-2105-11-539-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/148c619d550a/1471-2105-11-539-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/b53575dc6b8b/1471-2105-11-539-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/3859c9633d19/1471-2105-11-539-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/42d9f4844c2c/1471-2105-11-539-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/1b6d862b38bf/1471-2105-11-539-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/51b707cf1f20/1471-2105-11-539-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/6b7c90902bb3/1471-2105-11-539-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/148c619d550a/1471-2105-11-539-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/b53575dc6b8b/1471-2105-11-539-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/3859c9633d19/1471-2105-11-539-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d44/2992546/42d9f4844c2c/1471-2105-11-539-7.jpg

相似文献

1
Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model.基于逐步贝叶斯模型,利用阵列强度和测序读取深度检测拷贝数变异。
BMC Bioinformatics. 2010 Oct 31;11:539. doi: 10.1186/1471-2105-11-539.
2
Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.使用贝叶斯信息准则检测全基因组测序数据中的拷贝数变异。
Proc Natl Acad Sci U S A. 2011 Nov 15;108(46):E1128-36. doi: 10.1073/pnas.1110574108. Epub 2011 Nov 7.
3
Reference-unbiased copy number variant analysis using CGH microarrays.基于 CGH 微阵列的无参考偏倚拷贝数变异分析。
Nucleic Acids Res. 2010 Nov;38(20):e190. doi: 10.1093/nar/gkq730. Epub 2010 Aug 27.
4
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
5
Detection of copy number variations in rice using array-based comparative genomic hybridization.利用基于阵列的比较基因组杂交技术检测水稻中的拷贝数变异。
BMC Genomics. 2011 Jul 20;12:372. doi: 10.1186/1471-2164-12-372.
6
CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths.CONY:一种基于测序深度的拷贝数变异检测的贝叶斯方法。
Sci Rep. 2020 Jun 26;10(1):10493. doi: 10.1038/s41598-020-64353-1.
7
Optimization of Signal Decomposition Matched Filtering (SDMF) for Improved Detection of Copy-Number Variations.用于改进拷贝数变异检测的信号分解匹配滤波(SDMF)优化
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):584-91. doi: 10.1109/TCBB.2015.2448077.
8
Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。
BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.
9
Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.人类基因组拷贝数变异的全基因组图谱绘制:高分辨率阵列平台的比较分析。
PLoS One. 2011;6(11):e27859. doi: 10.1371/journal.pone.0027859. Epub 2011 Nov 30.
10
Assessing copy number from exome sequencing and exome array CGH based on CNV spectrum in a large clinical cohort.基于大型临床队列中的拷贝数变异谱,评估外显子组测序和外显子组阵列比较基因组杂交的拷贝数。
Genet Med. 2015 Aug;17(8):623-9. doi: 10.1038/gim.2014.160. Epub 2014 Nov 6.

引用本文的文献

1
The mutation rate of mycobacterial repetitive unit loci in strains of M. tuberculosis from cynomolgus macaque infection.从食蟹猴感染的结核分枝杆菌菌株中分枝杆菌重复单元基因座的突变率。
BMC Genomics. 2013 Mar 5;14:145. doi: 10.1186/1471-2164-14-145.
2
A genetic model for neurodevelopmental disease.神经发育疾病的遗传模型。
Curr Opin Neurobiol. 2012 Oct;22(5):829-36. doi: 10.1016/j.conb.2012.04.007. Epub 2012 May 2.
3
DNA sequencing: clinical applications of new DNA sequencing technologies.DNA测序:新型DNA测序技术的临床应用

本文引用的文献

1
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.随机松弛,吉布斯分布,以及贝叶斯图像恢复。
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.
2
A Bayesian segmentation approach to ascertain copy number variations at the population level.一种用于在群体水平上确定拷贝数变异的贝叶斯分割方法。
Bioinformatics. 2009 Jul 1;25(13):1669-79. doi: 10.1093/bioinformatics/btp270. Epub 2009 Apr 23.
3
Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA.
Circulation. 2012 Feb 21;125(7):931-44. doi: 10.1161/CIRCULATIONAHA.110.972828.
4
Identification of genomic indels and structural variations using split reads.利用拆分读识别基因组插入/缺失和结构变异。
BMC Genomics. 2011 Jul 25;12:375. doi: 10.1186/1471-2164-12-375.
使用GADA对多个DNA阵列上的拷贝数变异和参考强度进行联合估计。
Bioinformatics. 2009 May 15;25(10):1223-30. doi: 10.1093/bioinformatics/btp119. Epub 2009 Mar 10.
4
A fast and flexible method for the segmentation of aCGH data.一种用于阵列比较基因组杂交(aCGH)数据分割的快速灵活方法。
Bioinformatics. 2008 Aug 15;24(16):i139-45. doi: 10.1093/bioinformatics/btn272.
5
A fast Bayesian change point analysis for the segmentation of microarray data.一种用于微阵列数据分割的快速贝叶斯变化点分析方法。
Bioinformatics. 2008 Oct 1;24(19):2143-8. doi: 10.1093/bioinformatics/btn404. Epub 2008 Jul 29.
6
wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data.wuHMM:一种利用长寡核苷酸微阵列数据检测DNA拷贝数变异的稳健算法。
Nucleic Acids Res. 2008 Apr;36(7):e41. doi: 10.1093/nar/gkn110. Epub 2008 Mar 11.
7
Flexible and accurate detection of genomic copy-number changes from aCGH.从比较基因组杂交(aCGH)中灵活且准确地检测基因组拷贝数变化。
PLoS Comput Biol. 2007 Jun;3(6):e122. doi: 10.1371/journal.pcbi.0030122. Epub 2007 May 16.
8
Global variation in copy number in the human genome.人类基因组中拷贝数的全球变异。
Nature. 2006 Nov 23;444(7118):444-54. doi: 10.1038/nature05329.
9
Integrating copy number polymorphisms into array CGH analysis using a robust HMM.使用稳健的隐马尔可夫模型将拷贝数多态性整合到阵列比较基因组杂交分析中。
Bioinformatics. 2006 Jul 15;22(14):e431-9. doi: 10.1093/bioinformatics/btl238.
10
A Bayes regression approach to array-CGH data.一种用于阵列比较基因组杂交(array-CGH)数据的贝叶斯回归方法。
Stat Appl Genet Mol Biol. 2006;5:Article3. doi: 10.2202/1544-6115.1149. Epub 2006 Feb 16.