• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MGP-HMM:使用隐马尔可夫模型(HMM)检测全基因组拷贝数变异(CNV),该模型用于对配对末端插入片段大小和读数计数进行建模。

MGP-HMM: Detecting genome-wide CNVs using an HMM for modeling mate pair insertion sizes and read counts.

作者信息

Malekpour Seyed Amir, Pezeshk Hamid, Sadeghi Mehdi

机构信息

School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran.

School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran; School of Biological Sciences, Institute for Research in Fundamental Sciences, Tehran, Iran.

出版信息

Math Biosci. 2016 Sep;279:53-62. doi: 10.1016/j.mbs.2016.07.006. Epub 2016 Jul 16.

DOI:10.1016/j.mbs.2016.07.006
PMID:27424951
Abstract

MOTIVATION

Association of Copy Number Variation (CNV) with schizophrenia, autism, developmental disabilities and fatal diseases such as cancer is verified. Recent developments in Next Generation Sequencing (NGS) have facilitated the CNV studies. However, many of the current CNV detection tools are not capable of discriminating tandem duplication from non-tandem duplications.

RESULTS

In this study, we propose MGP-HMM as a tool which besides detecting genome-wide deletions discriminates tandem duplications from non-tandem duplications. MGP-HMM takes mate pair abnormalities into account and predicts the digitized number of tandem or non-tandem copies. Abnormalities in the mate pair directions and insertion sizes, after being mapped to the reference genome, are elucidated using a Hidden Markov Model (HMM). For this purpose, a Mixture Gaussian density with time-dependent parameters is applied for emitting mate pair insertion sizes from HMM states. Indeed, depending on observed abnormalities in mate pair insertion size or its orientation, each component in the mixture density will have different parameters. MGP-HMM also applies a Poisson distribution for modeling read depth data. This parametric modeling of the mate pair reads enables us to estimate the length of CNVs precisely, which is an advantage over methods which rely only on read depth approach for the CNV detection. Hidden state of the proposed HMM is the digitized copy number of a genomic segment and states correspond to the multipliers of the mixture Gaussian components. The accuracy of our model is validated on a set of next generation sequencing real and simulated data and is compared to other tools.

摘要

动机

拷贝数变异(CNV)与精神分裂症、自闭症、发育障碍以及诸如癌症等致命疾病之间的关联已得到证实。新一代测序(NGS)技术的最新发展推动了CNV研究。然而,当前许多CNV检测工具无法区分串联重复和非串联重复。

结果

在本研究中,我们提出了MGP-HMM这一工具,它除了能检测全基因组缺失外,还能区分串联重复和非串联重复。MGP-HMM考虑了配对末端异常情况,并预测串联或非串联拷贝的数字化数量。将配对末端方向和插入大小的异常情况映射到参考基因组后,使用隐马尔可夫模型(HMM)进行阐释。为此,应用具有时间依赖性参数的混合高斯密度从HMM状态发射配对末端插入大小。实际上,根据观察到的配对末端插入大小或其方向的异常情况,混合密度中的每个成分将具有不同的参数。MGP-HMM还应用泊松分布对读深度数据进行建模。这种对配对末端读段的参数化建模使我们能够精确估计CNV的长度,这是相对于仅依赖读深度方法进行CNV检测的方法的一个优势。所提出的HMM的隐藏状态是基因组片段的数字化拷贝数,状态对应于混合高斯成分的乘数。我们的模型在一组新一代测序真实数据和模拟数据上进行了验证,并与其他工具进行了比较。

相似文献

1
MGP-HMM: Detecting genome-wide CNVs using an HMM for modeling mate pair insertion sizes and read counts.MGP-HMM:使用隐马尔可夫模型(HMM)检测全基因组拷贝数变异(CNV),该模型用于对配对末端插入片段大小和读数计数进行建模。
Math Biosci. 2016 Sep;279:53-62. doi: 10.1016/j.mbs.2016.07.006. Epub 2016 Jul 16.
2
PSE-HMM: genome-wide CNV detection from NGS data using an HMM with Position-Specific Emission probabilities.PSE-HMM:利用具有位置特异性发射概率的隐马尔可夫模型从二代测序数据中进行全基因组拷贝数变异检测。
BMC Bioinformatics. 2016 Nov 3;18(1):30. doi: 10.1186/s12859-016-1296-y.
3
MSeq-CNV: accurate detection of Copy Number Variation from Sequencing of Multiple samples.MSeq-CNV:从多个样本测序中准确检测拷贝数变异
Sci Rep. 2018 Mar 5;8(1):4009. doi: 10.1038/s41598-018-22323-8.
4
Copy number variation detection using next generation sequencing read counts.使用下一代测序读段计数进行拷贝数变异检测。
BMC Bioinformatics. 2014 Apr 14;15:109. doi: 10.1186/1471-2105-15-109.
5
Quantifying copy number variations using a hidden Markov model with inhomogeneous emission distributions.使用具有非齐次发射分布的隐马尔可夫模型来量化拷贝数变异。
Biostatistics. 2013 Jul;14(3):600-11. doi: 10.1093/biostatistics/kxt003. Epub 2013 Feb 20.
6
Copy number variant analysis using genome-wide mate-pair sequencing.利用全基因组配对测序进行拷贝数变异分析。
Genes Chromosomes Cancer. 2018 Sep;57(9):459-470. doi: 10.1002/gcc.5. Epub 2018 Jul 30.
7
On the core segmentation algorithms of copy number variation detection tools.基于拷贝数变异检测工具的核心分割算法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae022.
8
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
9
Algorithmic improvements for discovery of germline copy number variants in next-generation sequencing data.下一代测序数据中胚系拷贝数变异的发现算法改进。
BMC Bioinformatics. 2022 Jul 19;23(1):285. doi: 10.1186/s12859-022-04820-w.
10
Family-Based Benchmarking of Copy Number Variation Detection Software.基于家族的拷贝数变异检测软件基准测试
PLoS One. 2015 Jul 21;10(7):e0133465. doi: 10.1371/journal.pone.0133465. eCollection 2015.

引用本文的文献

1
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.iCopyDAV:用于拷贝数变异检测、注释和可视化的集成平台。
PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018.
2
MSeq-CNV: accurate detection of Copy Number Variation from Sequencing of Multiple samples.MSeq-CNV:从多个样本测序中准确检测拷贝数变异
Sci Rep. 2018 Mar 5;8(1):4009. doi: 10.1038/s41598-018-22323-8.