• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用GADA对多个DNA阵列上的拷贝数变异和参考强度进行联合估计。

Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA.

作者信息

Pique-Regi Roger, Ortega Antonio, Asgharzadeh Shahab

机构信息

Signal and Image Processing Institute, Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, EEB 400, 3740 McClintock Ave, Los Angeles, CA 90089-2564, USA.

出版信息

Bioinformatics. 2009 May 15;25(10):1223-30. doi: 10.1093/bioinformatics/btp119. Epub 2009 Mar 10.

DOI:10.1093/bioinformatics/btp119
PMID:19276152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2732310/
Abstract

MOTIVATION

The complexity of a large number of recently discovered copy number polymorphisms is much higher than initially thought, thus making it more difficult to detect them in the presence of significant measurement noise. In this scenario, separate normalization and segmentation is prone to lead to many false detections of changes in copy number. New approaches capable of jointly modeling the copy number and the non-copy number (noise) hybridization effects across multiple samples will potentially lead to more accurate results.

METHODS

In this article, the genome alteration detection analysis (GADA) approach introduced in our previous work is extended to a multiple sample model. The copy number component is independent for each sample and uses a sparse Bayesian prior, while the reference hybridization level is not necessarily sparse but identical on all samples. The expectation maximization (EM) algorithm used to fit the model iteratively determines whether the observed hybridization levels are more likely due to a copy number variation or to a shared hybridization bias.

RESULTS

The new proposed approach is compared with the currently used strategy of separate normalization followed by independent segmentation of each array. Real microarray data obtained from HapMap samples are randomly partitioned to create different reference sets. Using the new approach, copy number and reference intensity estimates are significantly less variable if the reference set changes; and a higher consistency on copy numbers detected within HapMap family trios is obtained. Finally, the running time to fit the model grows linearly in the number samples and probes.

AVAILABILITY

http://biron.usc.edu/~piquereg/GADA.

摘要

动机

大量最近发现的拷贝数多态性的复杂性比最初设想的要高得多,因此在存在显著测量噪声的情况下更难检测到它们。在这种情况下,单独的归一化和分割容易导致许多拷贝数变化的误检测。能够跨多个样本联合建模拷贝数和非拷贝数(噪声)杂交效应的新方法可能会带来更准确的结果。

方法

在本文中,我们先前工作中引入的基因组改变检测分析(GADA)方法被扩展为一个多样本模型。拷贝数成分对于每个样本是独立的,并使用稀疏贝叶斯先验,而参考杂交水平不一定稀疏但在所有样本上是相同的。用于拟合模型迭代地确定观察到的杂交水平更可能是由于拷贝数变异还是由于共享杂交偏差的期望最大化(EM)算法。

结果

将新提出的方法与当前使用的先单独归一化然后对每个阵列进行独立分割的策略进行比较。从HapMap样本获得的真实微阵列数据被随机划分以创建不同的参考集。使用新方法,如果参考集改变,拷贝数和参考强度估计的变异性显著更小;并且在HapMap家系三联体中检测到的拷贝数上获得了更高的一致性。最后,拟合模型的运行时间随样本和探针数量线性增长。

可用性

http://biron.usc.edu/~piquereg/GADA。

相似文献

1
Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA.使用GADA对多个DNA阵列上的拷贝数变异和参考强度进行联合估计。
Bioinformatics. 2009 May 15;25(10):1223-30. doi: 10.1093/bioinformatics/btp119. Epub 2009 Mar 10.
2
Sparse representation and Bayesian detection of genome copy number alterations from microarray data.基于微阵列数据的基因组拷贝数变异的稀疏表示与贝叶斯检测
Bioinformatics. 2008 Feb 1;24(3):309-18. doi: 10.1093/bioinformatics/btm601. Epub 2008 Jan 18.
3
Segmentation and estimation for SNP microarrays: a Bayesian multiple change-point approach.SNP微阵列的分割与估计:一种贝叶斯多重变化点方法。
Biometrics. 2010 Sep;66(3):675-83. doi: 10.1111/j.1541-0420.2009.01328.x.
4
Estimation and assessment of raw copy numbers at the single locus level.单基因座水平上原始拷贝数的估计与评估。
Bioinformatics. 2008 Mar 15;24(6):759-67. doi: 10.1093/bioinformatics/btn016. Epub 2008 Jan 19.
5
Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios.Illumina Infinium全基因组单核苷酸多态性(SNP)数据的标准化可改善拷贝数估计和等位基因强度比。
BMC Bioinformatics. 2008 Oct 2;9:409. doi: 10.1186/1471-2105-9-409.
6
An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays.一种使用新一代群体优化人类阵列分析杂交强度和基因型的综合分析工具。
BMC Genomics. 2016 Mar 31;17:266. doi: 10.1186/s12864-016-2478-8.
7
Accurate detection of aneuploidies in array CGH and gene expression microarray data.在阵列比较基因组杂交和基因表达微阵列数据中准确检测非整倍体。
Bioinformatics. 2004 Dec 12;20(18):3533-43. doi: 10.1093/bioinformatics/bth440. Epub 2004 Jul 29.
8
CNAnova: a new approach for finding recurrent copy number abnormalities in cancer SNP microarray data.CNAnova:一种在癌症 SNP 微阵列数据中寻找反复出现的拷贝数异常的新方法。
Bioinformatics. 2010 Jun 1;26(11):1395-402. doi: 10.1093/bioinformatics/btq145. Epub 2010 Apr 18.
9
R-Gada: a fast and flexible pipeline for copy number analysis in association studies.R-Gada:关联研究中用于拷贝数分析的快速灵活的管道。
BMC Bioinformatics. 2010 Jul 16;11:380. doi: 10.1186/1471-2105-11-380.
10
Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data.寡核苷酸微阵列数据中基因组拷贝数变异高通量检测算法的评估
BMC Bioinformatics. 2007 Oct 2;8:368. doi: 10.1186/1471-2105-8-368.

引用本文的文献

1
Gene copy number variation in natural populations of Plasmodium falciparum in Eastern Africa.东非地区恶性疟原虫自然种群中的基因拷贝数变异。
BMC Genomics. 2018 May 21;19(1):372. doi: 10.1186/s12864-018-4689-7.
2
Genome instability model of metastatic neuroblastoma tumorigenesis by a dictionary learning algorithm.基于字典学习算法的转移性神经母细胞瘤肿瘤发生的基因组不稳定性模型
BMC Med Genomics. 2015 Sep 10;8:57. doi: 10.1186/s12920-015-0132-y.
3
HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断
PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.
4
A genome-wide study of de novo deletions identifies a candidate locus for non-syndromic isolated cleft lip/palate risk.全基因组范围内的新发缺失研究确定了非综合征性孤立性唇腭裂风险的候选基因座。
BMC Genet. 2014 Feb 14;15:24. doi: 10.1186/1471-2156-15-24.
5
Novel multisample scheme for inferring phylogenetic markers from whole genome tumor profiles.从全基因组肿瘤图谱推断系统发育标记的新型多样本方案。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1422-31. doi: 10.1109/TCBB.2013.33.
6
Structural polymorphism in the promoter of pfmrp2 confers Plasmodium falciparum tolerance to quinoline drugs.pfmrp2 启动子中的结构多态性使恶性疟原虫对奎宁类药物产生耐受性。
Mol Microbiol. 2014 Mar;91(5):918-34. doi: 10.1111/mmi.12505. Epub 2014 Jan 23.
7
Genome wide analysis reveals single nucleotide polymorphisms associated with fatness and putative novel copy number variants in three pig breeds.全基因组分析揭示了三个猪品种中与肥胖相关的单核苷酸多态性以及假定的新型拷贝数变异。
BMC Genomics. 2013 Nov 13;14:784. doi: 10.1186/1471-2164-14-784.
8
Multisample aCGH data analysis via total variation and spectral regularization.基于全变差和谱正则化的多重样本 aCGH 数据分析。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Jan-Feb;10(1):230-5. doi: 10.1109/TCBB.2012.166.
9
Human spermatogenic failure purges deleterious mutation load from the autosomes and both sex chromosomes, including the gene DMRT1.人类精子发生失败会从常染色体和两条性染色体(包括 DMRT1 基因)中清除有害的突变负荷。
PLoS Genet. 2013 Mar;9(3):e1003349. doi: 10.1371/journal.pgen.1003349. Epub 2013 Mar 21.
10
Copy number variation signature to predict human ancestry.拷贝数变异特征预测人类起源。
BMC Bioinformatics. 2012 Dec 27;13:336. doi: 10.1186/1471-2105-13-336.

本文引用的文献

1
Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms.全基因组SNP基因分型平台信号强度中基因组波的调整。
Nucleic Acids Res. 2008 Nov;36(19):e126. doi: 10.1093/nar/gkn556. Epub 2008 Sep 10.
2
Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性(SNPs)、常见拷贝数多态性和罕见拷贝数变异(CNVs)的整合基因型分型与关联分析。
Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.
3
Integrated detection and population-genetic analysis of SNPs and copy number variation.单核苷酸多态性(SNPs)与拷贝数变异的综合检测及群体遗传分析
Nat Genet. 2008 Oct;40(10):1166-74. doi: 10.1038/ng.238. Epub 2008 Sep 7.
4
The fine-scale and complex architecture of human copy-number variation.人类拷贝数变异的精细尺度与复杂结构。
Am J Hum Genet. 2008 Mar;82(3):685-95. doi: 10.1016/j.ajhg.2007.12.010. Epub 2008 Jan 24.
5
ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays.斜体:一种用于Affymetrix SNP阵列归一化和DNA拷贝数调用的算法。
Bioinformatics. 2008 Mar 15;24(6):768-74. doi: 10.1093/bioinformatics/btn048. Epub 2008 Feb 5.
6
Estimation and assessment of raw copy numbers at the single locus level.单基因座水平上原始拷贝数的估计与评估。
Bioinformatics. 2008 Mar 15;24(6):759-67. doi: 10.1093/bioinformatics/btn016. Epub 2008 Jan 19.
7
Sparse representation and Bayesian detection of genome copy number alterations from microarray data.基于微阵列数据的基因组拷贝数变异的稀疏表示与贝叶斯检测
Bioinformatics. 2008 Feb 1;24(3):309-18. doi: 10.1093/bioinformatics/btm601. Epub 2008 Jan 18.
8
Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization.突破浪潮:基于微阵列比较基因组杂交技术提高拷贝数变异检测
Genome Biol. 2007;8(10):R228. doi: 10.1186/gb-2007-8-10-r228.
9
Global variation in copy number in the human genome.人类基因组中拷贝数的全球变异。
Nature. 2006 Nov 23;444(7118):444-54. doi: 10.1038/nature05329.
10
Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays.使用高密度DNA寡核苷酸阵列进行全基因组人类拷贝数变异检测。
Genome Res. 2006 Dec;16(12):1575-84. doi: 10.1101/gr.5629106. Epub 2006 Nov 22.