• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重尾噪声抑制和导数小波标度图在检测 DNA 拷贝数异常中的应用。

Heavy-Tailed Noise Suppression and Derivative Wavelet Scalogram for Detecting DNA Copy Number Aberrations.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1625-1635. doi: 10.1109/TCBB.2017.2723884. Epub 2017 Jul 6.

DOI:10.1109/TCBB.2017.2723884
PMID:28692986
Abstract

Most existing array comparative genomic hybridization (array CGH) data processing methods and evaluation models assumed that the probability density function (pdf) of noise in array CGH data is a Gaussian distribution. However, in practice, such noise distribution is peaky and heavy-tailed. Therefore, a Gaussian pdf is not adequate to approximate the noise in array CGH data and hence introduces wrong detections of chromosomal aberrations and leads misunderstanding on disease pathogenesis. A more accurate and sufficient model of noise in array CGH data is necessary and beneficial to the detection of DNA copy number variations. We analyze the real array CGH data from different platforms and show that the distribution of noise in array CGH data is fitted very well by generalized Gaussian distribution (GGD). Based on our new noise model, we propose a novel array CGH processing method combining the advantages of both the smoothing and segmentation approaches. The new method uses generalized Gaussian bivariate shrinkage function and one-directional derivative wavelet scalogram in generalized Gaussian noise. In the smoothing step, with the new generalized Gaussian noise model, we derive the heavy-tailed noise suppression algorithm in stationary wavelet domain. In the segmentation step, the 1D Gaussian derivative wavelet scalogram is employed to detect break points. Both real and simulated array CGH data with different noises (such as Gaussian noise, GGD noise, and real noise) are used in our experiments. We demonstrate that our new method outperforms other state-of-the-art methods, in terms of both root mean squared errors and receiver operating characteristic curves.

摘要

大多数现有的阵列比较基因组杂交(array CGH)数据处理方法和评估模型都假设 array CGH 数据中的噪声概率密度函数(pdf)是高斯分布。然而,在实践中,这种噪声分布是峰值和重尾的。因此,高斯 pdf 不足以近似 array CGH 数据中的噪声,从而导致染色体异常的错误检测,并导致对疾病发病机制的误解。array CGH 数据中噪声的更准确和充分的模型是必要的,并且有利于检测 DNA 拷贝数变异。我们分析了来自不同平台的真实 array CGH 数据,并表明 array CGH 数据中的噪声分布非常适合广义高斯分布(GGD)。基于我们的新噪声模型,我们提出了一种新的 array CGH 处理方法,结合了平滑和分割方法的优点。该新方法使用广义高斯双变量收缩函数和广义高斯噪声中的单向导数小波谱图。在平滑步骤中,使用新的广义高斯噪声模型,我们推导出了平稳小波域中重尾噪声抑制算法。在分割步骤中,使用 1D 高斯导数小波谱图来检测断点。我们在实验中使用了不同噪声(如高斯噪声、GGD 噪声和真实噪声)的真实和模拟 array CGH 数据。结果表明,我们的新方法在均方根误差和接收器工作特征曲线方面都优于其他最先进的方法。

相似文献

1
Heavy-Tailed Noise Suppression and Derivative Wavelet Scalogram for Detecting DNA Copy Number Aberrations.重尾噪声抑制和导数小波标度图在检测 DNA 拷贝数异常中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1625-1635. doi: 10.1109/TCBB.2017.2723884. Epub 2017 Jul 6.
2
Stationary wavelet packet transform and dependent laplacian bivariate shrinkage estimator for array-CGH data smoothing.用于阵列比较基因组杂交(array-CGH)数据平滑的平稳小波包变换和相关拉普拉斯双变量收缩估计器
J Comput Biol. 2010 Feb;17(2):139-52. doi: 10.1089/cmb.2009.0013.
3
Array CGH data modeling and smoothing in Stationary Wavelet Packet Transform domain.基于平稳小波包变换域的阵列比较基因组杂交数据建模与平滑处理
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S17. doi: 10.1186/1471-2164-9-S2-S17.
4
Exploiting noise in array CGH data to improve detection of DNA copy number change.利用阵列比较基因组杂交数据中的噪声来改善DNA拷贝数变化的检测。
Nucleic Acids Res. 2007;35(5):e35. doi: 10.1093/nar/gkl730. Epub 2007 Feb 1.
5
The use of ultra-dense array CGH analysis for the discovery of micro-copy number alterations and gene fusions in the cancer genome.超高密度阵列 CGH 分析在癌症基因组中发现微小拷贝数改变和基因融合。
BMC Med Genomics. 2011 Jan 27;4:16. doi: 10.1186/1755-8794-4-16.
6
Optimization of Signal Decomposition Matched Filtering (SDMF) for Improved Detection of Copy-Number Variations.用于改进拷贝数变异检测的信号分解匹配滤波(SDMF)优化
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):584-91. doi: 10.1109/TCBB.2015.2448077.
7
Comparative Genomic Hybridization (CGH) in Genotoxicology.
Methods Mol Biol. 2019;2031:209-234. doi: 10.1007/978-1-4939-9646-9_11.
8
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
9
Identification of De Novo and Rare Inherited Copy Number Variants in Children with Syndromic Congenital Heart Defects.患有综合征型先天性心脏病儿童中从头和罕见遗传性拷贝数变异的鉴定
Pediatr Cardiol. 2018 Jun;39(5):924-940. doi: 10.1007/s00246-018-1842-7. Epub 2018 Mar 14.
10
Comparative genomic hybridization to detect variation in the copy number of large DNA segments.比较基因组杂交用于检测大DNA片段拷贝数的变异。
Cold Spring Harb Protoc. 2011 Nov 1;2011(11):1323-33. doi: 10.1101/pdb.top066589.

引用本文的文献

1
TBscreen: A passive cough classifier for tuberculosis screening with a controlled dataset.TBscreen:一个使用受控数据集的结核病筛查的被动咳嗽分类器。
Sci Adv. 2024 Jan 5;10(1):eadi0282. doi: 10.1126/sciadv.adi0282. Epub 2024 Jan 3.