• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于检测人类基因组中拷贝数变异的新型信号处理方法。

A novel signal processing approach for the detection of copy number variations in the human genome.

机构信息

Department of Radiology, Harvard School of Public Health, Boston, MA 02115, USA.

出版信息

Bioinformatics. 2011 Sep 1;27(17):2338-45. doi: 10.1093/bioinformatics/btr402. Epub 2011 Jul 12.

DOI:10.1093/bioinformatics/btr402
PMID:21752800
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3157926/
Abstract

MOTIVATION

Human genomic variability occurs at different scales, from single nucleotide polymorphisms (SNPs) to large DNA segments. Copy number variations (CNVs) represent a significant part of our genetic heterogeneity and have also been associated with many diseases and disorders. Short, localized CNVs, which may play an important role in human disease, may be undetectable in noisy genomic data. Therefore, robust methodologies are needed for their detection. Furthermore, for meaningful identification of pathological CNVs, estimation of normal allelic aberrations is necessary.

RESULTS

We developed a signal processing-based methodology for sequence denoising followed by pattern matching, to increase SNR in genomic data and improve CNV detection. We applied this signal-decomposition-matched filtering (SDMF) methodology to 429 normal genomic sequences, and compared detected CNVs to those in the Database of Genomic Variants. SDMF successfully detected a significant number of previously identified CNVs with frequencies of occurrence ≥10%, as well as unreported short CNVs. Its performance was also compared to circular binary segmentation (CBS). through simulations. SDMF had a significantly lower false detection rate and was significantly faster than CBS, an important advantage for handling large datasets generated with high-resolution arrays. By focusing on improving SNR (instead of the robustness of the detection algorithm), SDMF is a very promising methodology for identifying CNVs at all genomic spatial scales.

AVAILABILITY

The data are available at http://tcga-data.nci.nih.gov/tcga/ The software and list of analyzed sequence IDs are available at http://www.hsph.harvard.edu/~betensky/ A Matlab code for Empirical Mode Decomposition may be found at: http://www.clear.rice.edu/elec301/Projects02/empiricalMode/code.html

CONTACT

caterina@mit.edu.

摘要

动机

人类基因组的变异性发生在不同的尺度上,从单核苷酸多态性 (SNP) 到大片段 DNA。拷贝数变异 (CNV) 代表了我们遗传异质性的重要组成部分,也与许多疾病和障碍有关。短的、局部的 CNV 可能在人类疾病中发挥重要作用,但在嘈杂的基因组数据中可能无法检测到。因此,需要稳健的方法来检测它们。此外,为了对病理性 CNV 进行有意义的识别,需要估计正常等位基因的异常。

结果

我们开发了一种基于信号处理的序列去噪方法,然后进行模式匹配,以提高基因组数据的信噪比,从而提高 CNV 检测的准确性。我们将这种信号分解匹配滤波 (SDMF) 方法应用于 429 个正常基因组序列,并将检测到的 CNV 与基因组变异数据库中的 CNV 进行比较。SDMF 成功地检测到了大量以前已确定的、出现频率≥10%的 CNV,以及未报告的短 CNV。它的性能也与循环二进制分割 (CBS) 进行了比较。通过模拟。SDMF 的假阳性率显著降低,并且比 CBS 快得多,这对于处理使用高分辨率阵列生成的大型数据集来说是一个重要的优势。通过专注于提高信噪比 (而不是检测算法的稳健性),SDMF 是一种非常有前途的方法,可以在所有基因组空间尺度上识别 CNV。

可用性

数据可在 http://tcga-data.nci.nih.gov/tcga/ 获得。软件和分析序列 ID 列表可在 http://www.hsph.harvard.edu/~betensky/ 获得。经验模态分解的 Matlab 代码可在以下网址找到:http://www.clear.rice.edu/elec301/Projects02/empiricalMode/code.html

联系方式

caterina@mit.edu.

相似文献

1
A novel signal processing approach for the detection of copy number variations in the human genome.一种用于检测人类基因组中拷贝数变异的新型信号处理方法。
Bioinformatics. 2011 Sep 1;27(17):2338-45. doi: 10.1093/bioinformatics/btr402. Epub 2011 Jul 12.
2
Optimization of Signal Decomposition Matched Filtering (SDMF) for Improved Detection of Copy-Number Variations.用于改进拷贝数变异检测的信号分解匹配滤波(SDMF)优化
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):584-91. doi: 10.1109/TCBB.2015.2448077.
3
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
4
Modified screening and ranking algorithm for copy number variation detection.用于拷贝数变异检测的改进筛选与排序算法
Bioinformatics. 2015 May 1;31(9):1341-8. doi: 10.1093/bioinformatics/btu850. Epub 2014 Dec 25.
5
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
6
Robust regression analysis of copy number variation data based on a univariate score.基于单变量评分的拷贝数变异数据稳健回归分析。
PLoS One. 2014 Feb 7;9(2):e86272. doi: 10.1371/journal.pone.0086272. eCollection 2014.
7
A multi-sample based method for identifying common CNVs in normal human genomic structure using high-resolution aCGH data.基于多样本的方法,利用高分辨率 aCGH 数据识别正常人类基因组结构中的常见 CNV。
PLoS One. 2011;6(10):e26975. doi: 10.1371/journal.pone.0026975. Epub 2011 Oct 31.
8
Integrating genomic correlation structure improves copy number variations detection.整合基因组相关结构可改善拷贝数变异检测。
Bioinformatics. 2021 Apr 20;37(3):312-317. doi: 10.1093/bioinformatics/btaa737.
9
An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays.一种使用新一代群体优化人类阵列分析杂交强度和基因型的综合分析工具。
BMC Genomics. 2016 Mar 31;17:266. doi: 10.1186/s12864-016-2478-8.
10
Exploiting sequence similarity to validate the sensitivity of SNP arrays in detecting fine-scaled copy number variations.利用序列相似性验证 SNP 芯片检测精细尺度拷贝数变异的灵敏度。
Bioinformatics. 2010 Apr 15;26(8):1007-14. doi: 10.1093/bioinformatics/btq088. Epub 2010 Feb 25.

引用本文的文献

1
Fuzzy methods for the detection of copy number variations in comparative genomic hybridization arrays.用于检测比较基因组杂交阵列中拷贝数变异的模糊方法。
Saudi J Biol Sci. 2020 Dec;27(12):3647-3654. doi: 10.1016/j.sjbs.2020.08.007. Epub 2020 Aug 13.
2
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
3
Preprocessing Sequence Coverage Data for More Precise Detection of Copy Number Variations.预处理序列覆盖数据以更精确地检测拷贝数变异。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):868-876. doi: 10.1109/TCBB.2018.2869738. Epub 2018 Sep 12.
4
A gene interaction network‑based method to measure the common and heterogeneous mechanisms of gynecological cancer.基于基因交互网络的方法来测量妇科癌症的共同和异质机制。
Mol Med Rep. 2018 Jul;18(1):230-242. doi: 10.3892/mmr.2018.8961. Epub 2018 May 3.
5
Neuronal networks in the developing brain are adversely modulated by early psychosocial neglect.发育中的大脑中的神经网络会受到早期心理社会忽视的不利调节。
J Neurophysiol. 2017 Oct 1;118(4):2275-2288. doi: 10.1152/jn.00014.2017. Epub 2017 Jul 5.
6
Optimization of Signal Decomposition Matched Filtering (SDMF) for Improved Detection of Copy-Number Variations.用于改进拷贝数变异检测的信号分解匹配滤波(SDMF)优化
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):584-91. doi: 10.1109/TCBB.2015.2448077.
7
Early Psychosocial Neglect Adversely Impacts Developmental Trajectories of Brain Oscillations and Their Interactions.早期社会心理忽视对脑振荡的发育轨迹及其相互作用产生不利影响。
J Cogn Neurosci. 2015 Dec;27(12):2512-28. doi: 10.1162/jocn_a_00877. Epub 2015 Sep 9.
8
Function of cancer associated genes revealed by modern univariate and multivariate association tests.现代单变量和多变量关联测试揭示的癌症相关基因的功能
PLoS One. 2015 May 12;10(5):e0126544. doi: 10.1371/journal.pone.0126544. eCollection 2015.
9
Identification of under-detected periodicity in time-series microarray data by using empirical mode decomposition.利用经验模态分解识别时间序列微阵列数据中未被检测到的周期性
PLoS One. 2014 Nov 5;9(11):e111719. doi: 10.1371/journal.pone.0111719. eCollection 2014.
10
Use of autocorrelation scanning in DNA copy number analysis.使用自相关扫描进行 DNA 拷贝数分析。
Bioinformatics. 2013 Nov 1;29(21):2678-82. doi: 10.1093/bioinformatics/btt479. Epub 2013 Sep 16.

本文引用的文献

1
Statistical issues in the analysis of DNA Copy Number Variations.DNA拷贝数变异分析中的统计学问题
Int J Comput Biol Drug Des. 2008;1(4):368-95. doi: 10.1504/IJCBDD.2008.022208.
2
Copy number variation in human health, disease, and evolution.人类健康、疾病与进化中的拷贝数变异
Annu Rev Genomics Hum Genet. 2009;10:451-81. doi: 10.1146/annurev.genom.9.081307.164217.
3
Smoothing waves in array CGH tumor profiles.平滑阵列比较基因组杂交肿瘤图谱中的波形。
Bioinformatics. 2009 May 1;25(9):1099-104. doi: 10.1093/bioinformatics/btp132. Epub 2009 Mar 10.
4
Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms.全基因组SNP基因分型平台信号强度中基因组波的调整。
Nucleic Acids Res. 2008 Nov;36(19):e126. doi: 10.1093/nar/gkn556. Epub 2008 Sep 10.
5
Integrated detection and population-genetic analysis of SNPs and copy number variation.单核苷酸多态性(SNPs)与拷贝数变异的综合检测及群体遗传分析
Nat Genet. 2008 Oct;40(10):1166-74. doi: 10.1038/ng.238. Epub 2008 Sep 7.
6
On the frequency of copy number variants.关于拷贝数变异的频率。
Bioinformatics. 2008 Oct 15;24(20):2350-5. doi: 10.1093/bioinformatics/btn421. Epub 2008 Aug 8.
7
The fine-scale and complex architecture of human copy-number variation.人类拷贝数变异的精细尺度与复杂结构。
Am J Hum Genet. 2008 Mar;82(3):685-95. doi: 10.1016/j.ajhg.2007.12.010. Epub 2008 Jan 24.
8
Genotype, haplotype and copy-number variation in worldwide human populations.全球人类群体中的基因型、单倍型和拷贝数变异。
Nature. 2008 Feb 21;451(7181):998-1003. doi: 10.1038/nature06742.
9
Sparse representation and Bayesian detection of genome copy number alterations from microarray data.基于微阵列数据的基因组拷贝数变异的稀疏表示与贝叶斯检测
Bioinformatics. 2008 Feb 1;24(3):309-18. doi: 10.1093/bioinformatics/btm601. Epub 2008 Jan 18.
10
Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma.评估癌症中染色体畸变的意义:方法及在胶质瘤中的应用
Proc Natl Acad Sci U S A. 2007 Dec 11;104(50):20007-12. doi: 10.1073/pnas.0710052104. Epub 2007 Dec 6.