• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

突破浪潮:基于微阵列比较基因组杂交技术提高拷贝数变异检测

Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization.

作者信息

Marioni John C, Thorne Natalie P, Valsesia Armand, Fitzgerald Tomas, Redon Richard, Fiegler Heike, Andrews T Daniel, Stranger Barbara E, Lynch Andrew G, Dermitzakis Emmanouil T, Carter Nigel P, Tavaré Simon, Hurles Matthew E

机构信息

Computational Biology Group, Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Centre for Mathematical Sciences, Wilberforce Road, Cambridge CB3 0WA, UK.

出版信息

Genome Biol. 2007;8(10):R228. doi: 10.1186/gb-2007-8-10-r228.

DOI:10.1186/gb-2007-8-10-r228
PMID:17961237
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2246302/
Abstract

BACKGROUND

Large-scale high throughput studies using microarray technology have established that copy number variation (CNV) throughout the genome is more frequent than previously thought. Such variation is known to play an important role in the presence and development of phenotypes such as HIV-1 infection and Alzheimer's disease. However, methods for analyzing the complex data produced and identifying regions of CNV are still being refined.

RESULTS

We describe the presence of a genome-wide technical artifact, spatial autocorrelation or 'wave', which occurs in a large dataset used to determine the location of CNV across the genome. By removing this artifact we are able to obtain both a more biologically meaningful clustering of the data and an increase in the number of CNVs identified by current calling methods without a major increase in the number of false positives detected. Moreover, removing this artifact is critical for the development of a novel model-based CNV calling algorithm - CNVmix - that uses cross-sample information to identify regions of the genome where CNVs occur. For regions of CNV that are identified by both CNVmix and current methods, we demonstrate that CNVmix is better able to categorize samples into groups that represent copy number gains or losses.

CONCLUSION

Removing artifactual 'waves' (which appear to be a general feature of array comparative genomic hybridization (aCGH) datasets) and using cross-sample information when identifying CNVs enables more biological information to be extracted from aCGH experiments designed to investigate copy number variation in normal individuals.

摘要

背景

使用微阵列技术的大规模高通量研究已证实,全基因组范围内的拷贝数变异(CNV)比之前认为的更为常见。已知这种变异在诸如HIV-1感染和阿尔茨海默病等表型的出现和发展中起着重要作用。然而,用于分析所产生的复杂数据以及识别CNV区域的方法仍在不断完善。

结果

我们描述了一种全基因组范围的技术假象,即空间自相关或“波”,它出现在用于确定全基因组CNV位置的一个大型数据集中。通过去除这种假象,我们既能获得更具生物学意义的数据聚类,又能在当前的检测方法中识别出更多的CNV,同时检测到的假阳性数量没有大幅增加。此外,去除这种假象对于开发一种基于模型的新型CNV检测算法——CNVmix至关重要,该算法利用跨样本信息来识别基因组中发生CNV的区域。对于通过CNVmix和当前方法都识别出的CNV区域,我们证明CNVmix能更好地将样本分类为代表拷贝数增加或减少的组。

结论

去除人为的“波”(这似乎是阵列比较基因组杂交(aCGH)数据集的一个普遍特征),并在识别CNV时使用跨样本信息,能够从旨在研究正常个体拷贝数变异的aCGH实验中提取更多生物学信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/613bfb8f3e12/gb-2007-8-10-r228-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/b2dd2b0e4542/gb-2007-8-10-r228-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/0b0580a4deb6/gb-2007-8-10-r228-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/307e6f263db5/gb-2007-8-10-r228-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/7b6c4839a663/gb-2007-8-10-r228-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/1476de70f24e/gb-2007-8-10-r228-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/c64333b9bafd/gb-2007-8-10-r228-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/613bfb8f3e12/gb-2007-8-10-r228-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/b2dd2b0e4542/gb-2007-8-10-r228-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/0b0580a4deb6/gb-2007-8-10-r228-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/307e6f263db5/gb-2007-8-10-r228-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/7b6c4839a663/gb-2007-8-10-r228-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/1476de70f24e/gb-2007-8-10-r228-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/c64333b9bafd/gb-2007-8-10-r228-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26a9/2246302/613bfb8f3e12/gb-2007-8-10-r228-7.jpg

相似文献

1
Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization.突破浪潮:基于微阵列比较基因组杂交技术提高拷贝数变异检测
Genome Biol. 2007;8(10):R228. doi: 10.1186/gb-2007-8-10-r228.
2
A multi-sample based method for identifying common CNVs in normal human genomic structure using high-resolution aCGH data.基于多样本的方法,利用高分辨率 aCGH 数据识别正常人类基因组结构中的常见 CNV。
PLoS One. 2011;6(10):e26975. doi: 10.1371/journal.pone.0026975. Epub 2011 Oct 31.
3
Human copy number polymorphic genes.人类拷贝数多态性基因。
Cytogenet Genome Res. 2008;123(1-4):234-43. doi: 10.1159/000184713. Epub 2009 Mar 11.
4
Large scale copy number variation (CNV) at 14q12 is associated with the presence of genomic abnormalities in neoplasia.14q12区域的大规模拷贝数变异(CNV)与肿瘤形成中的基因组异常存在相关。
BMC Genomics. 2006 Jun 6;7:138. doi: 10.1186/1471-2164-7-138.
5
CopyMap: localization and calling of copy number variation by joint analysis of hybridization data from multiple individuals.CopyMap:通过对来自多个个体的杂交数据进行联合分析来定位和调用拷贝数变异。
Bioinformatics. 2010 Nov 1;26(21):2776-7. doi: 10.1093/bioinformatics/btq515. Epub 2010 Oct 5.
6
Segmental copy-number variation observed in Japanese by array-CGH.通过比较基因组杂交芯片技术在日本人中观察到的节段性拷贝数变异。
Ann Hum Genet. 2008 Mar;72(Pt 2):193-204. doi: 10.1111/j.1469-1809.2007.00415.x. Epub 2008 Jan 20.
7
Inter- and intra-breed genome-wide copy number diversity in a large cohort of European equine breeds.在一个大型欧洲马品种队列中,品种间和品种内的全基因组拷贝数多样性。
BMC Genomics. 2019 Oct 22;20(1):759. doi: 10.1186/s12864-019-6141-z.
8
The fine-scale and complex architecture of human copy-number variation.人类拷贝数变异的精细尺度与复杂结构。
Am J Hum Genet. 2008 Mar;82(3):685-95. doi: 10.1016/j.ajhg.2007.12.010. Epub 2008 Jan 24.
9
Identification of recurrent regions of Copy-Number Variants across multiple individuals.跨多个个体鉴定拷贝数变异的重现区域。
BMC Bioinformatics. 2010 Mar 22;11:147. doi: 10.1186/1471-2105-11-147.
10
A fused lasso latent feature model for analyzing multi-sample aCGH data.用于分析多样本 aCGH 数据的融合套索潜在特征模型。
Biostatistics. 2011 Oct;12(4):776-91. doi: 10.1093/biostatistics/kxr012. Epub 2011 Jun 3.

引用本文的文献

1
MicroRNAs Expression Profile in MN1-Altered Astroblastoma.MN1改变的成星形细胞瘤中的微小RNA表达谱
Biomedicines. 2025 Jan 6;13(1):112. doi: 10.3390/biomedicines13010112.
2
VGLL fusions define a new class of intraparenchymal central nervous system schwannoma.VGLL融合基因定义了一种新的脑实质内中枢神经系统神经鞘瘤。
Neuro Oncol. 2025 May 15;27(4):1031-1045. doi: 10.1093/neuonc/noae269.
3
Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach.使用基于机器学习的方法提高微阵列数据中CNV的检测性能。

本文引用的文献

1
CGHcall: calling aberrations for array CGH tumor profiles.CGHcall:用于阵列比较基因组杂交肿瘤图谱的畸变检测
Bioinformatics. 2007 Apr 1;23(7):892-4. doi: 10.1093/bioinformatics/btm030. Epub 2007 Jan 31.
2
A comprehensive analysis of common copy-number variations in the human genome.对人类基因组中常见拷贝数变异的综合分析。
Am J Hum Genet. 2007 Jan;80(1):91-104. doi: 10.1086/510560. Epub 2006 Dec 5.
3
Global variation in copy number in the human genome.人类基因组中拷贝数的全球变异。
Diagnostics (Basel). 2023 Dec 29;14(1):84. doi: 10.3390/diagnostics14010084.
4
Genome-wide association studies for economically important traits in mink using copy number variation.基于拷贝数变异的水貂重要经济性状全基因组关联研究
Sci Rep. 2024 Jan 2;14(1):24. doi: 10.1038/s41598-023-50497-3.
5
A pipeline for copy number profiling of single circulating tumour cells to assess intrapatient tumour heterogeneity.一种用于分析单个循环肿瘤细胞拷贝数以评估患者内肿瘤异质性的分析方法。
Mol Oncol. 2022 Aug;16(16):2981-3000. doi: 10.1002/1878-0261.13174. Epub 2022 Jul 8.
6
Bayesian copy number detection and association in large-scale studies.贝叶斯拷贝数检测及其在大规模研究中的关联分析。
BMC Cancer. 2020 Sep 7;20(1):856. doi: 10.1186/s12885-020-07304-3.
7
Induction of Effective Immunity against Trypanosoma cruzi.诱导对克氏锥虫的有效免疫。
Infect Immun. 2020 Mar 23;88(4). doi: 10.1128/IAI.00908-19.
8
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.DNA提取方案对通过微阵列、数字PCR和全基因组测序进行的核DNA分析以及对线粒体DNA拷贝数估计的影响。
PLoS One. 2017 Jul 6;12(7):e0180467. doi: 10.1371/journal.pone.0180467. eCollection 2017.
9
On the association analysis of CNV data: a fast and robust family-based association method.关于拷贝数变异(CNV)数据的关联分析:一种快速且稳健的基于家系的关联方法。
BMC Bioinformatics. 2017 Apr 18;18(1):217. doi: 10.1186/s12859-017-1622-z.
10
Diversity and regulatory impact of copy number variation in the primate Macaca fascicularis.猕猴(Macaca fascicularis)中拷贝数变异的多样性及其调控影响
BMC Genomics. 2017 Feb 10;18(1):144. doi: 10.1186/s12864-017-3531-y.
Nature. 2006 Nov 23;444(7118):444-54. doi: 10.1038/nature05329.
4
Accurate and reliable high-throughput detection of copy number variation in the human genome.人类基因组中拷贝数变异的准确可靠的高通量检测。
Genome Res. 2006 Dec;16(12):1566-74. doi: 10.1101/gr.5630906. Epub 2006 Nov 22.
5
STAC: A method for testing the significance of DNA copy number aberrations across multiple array-CGH experiments.STAC:一种用于在多个阵列比较基因组杂交实验中检测DNA拷贝数变异显著性的方法。
Genome Res. 2006 Sep;16(9):1149-58. doi: 10.1101/gr.5076506. Epub 2006 Aug 9.
6
Mapping tumor-suppressor genes with multipoint statistics from copy-number-variation data.利用来自拷贝数变异数据的多点统计方法定位肿瘤抑制基因。
Am J Hum Genet. 2006 Jul;79(1):13-22. doi: 10.1086/504354. Epub 2006 May 30.
7
Detection of gene copy number changes in CGH microarrays using a spatially correlated mixture model.使用空间相关混合模型检测比较基因组杂交微阵列中的基因拷贝数变化。
Bioinformatics. 2006 Apr 15;22(8):911-8. doi: 10.1093/bioinformatics/btl035. Epub 2006 Feb 2.
8
A pseudolikelihood approach for simultaneous analysis of array comparative genomic hybridizations.一种用于同时分析阵列比较基因组杂交的伪似然方法。
Biostatistics. 2006 Jul;7(3):399-421. doi: 10.1093/biostatistics/kxj015. Epub 2006 Jan 9.
9
A high-resolution survey of deletion polymorphism in the human genome.人类基因组中缺失多态性的高分辨率研究。
Nat Genet. 2006 Jan;38(1):75-81. doi: 10.1038/ng1697. Epub 2005 Dec 4.
10
A haplotype map of the human genome.人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.