• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于哺乳动物基因组拷贝数变异分析的贝叶斯非参数隐马尔可夫模型

Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.

作者信息

Yau C, Papaspiliopoulos O, Roberts G O, Holmes C

机构信息

Department of Statistics and the Oxford-Man Institute for Quantitative Finance, University of Oxford,

出版信息

J R Stat Soc Series B Stat Methodol. 2011 Jan 1;73(1):37-57. doi: 10.1111/j.1467-9868.2010.00756.x.

DOI:10.1111/j.1467-9868.2010.00756.x
PMID:21687778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3116623/
Abstract

We consider the development of Bayesian Nonparametric methods for product partition models such as Hidden Markov Models and change point models. Our approach uses a Mixture of Dirichlet Process (MDP) model for the unknown sampling distribution (likelihood) for the observations arising in each state and a computationally efficient data augmentation scheme to aid inference. The method uses novel MCMC methodology which combines recent retrospective sampling methods with the use of slice sampler variables. The methodology is computationally efficient, both in terms of MCMC mixing properties, and robustness to the length of the time series being investigated. Moreover, the method is easy to implement requiring little or no user-interaction. We apply our methodology to the analysis of genomic copy number variation.

摘要

我们考虑为诸如隐马尔可夫模型和变点模型等乘积划分模型开发贝叶斯非参数方法。我们的方法使用狄利克雷过程混合(MDP)模型来处理每个状态下观测值的未知抽样分布(似然),并采用一种计算效率高的数据增强方案来辅助推理。该方法使用了新颖的马尔可夫链蒙特卡罗(MCMC)方法,该方法将最近的回顾性抽样方法与切片采样器变量的使用相结合。该方法在计算效率方面表现出色,无论是在MCMC混合特性方面,还是对所研究时间序列长度的稳健性方面。此外,该方法易于实现,几乎不需要用户交互。我们将我们的方法应用于基因组拷贝数变异的分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/238942526aa7/ukmss-35695-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/38d79877c41b/ukmss-35695-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/0102c8377949/ukmss-35695-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/2abc492e20c2/ukmss-35695-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/34a6c3f9a065/ukmss-35695-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/83368c0b961c/ukmss-35695-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/94d696fa468c/ukmss-35695-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/50659e0118f6/ukmss-35695-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/238942526aa7/ukmss-35695-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/38d79877c41b/ukmss-35695-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/0102c8377949/ukmss-35695-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/2abc492e20c2/ukmss-35695-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/34a6c3f9a065/ukmss-35695-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/83368c0b961c/ukmss-35695-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/94d696fa468c/ukmss-35695-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/50659e0118f6/ukmss-35695-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc1/3116623/238942526aa7/ukmss-35695-f0008.jpg

相似文献

1
Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.应用于哺乳动物基因组拷贝数变异分析的贝叶斯非参数隐马尔可夫模型
J R Stat Soc Series B Stat Methodol. 2011 Jan 1;73(1):37-57. doi: 10.1111/j.1467-9868.2010.00756.x.
2
Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations.从具有未知浓度参数的狄利克雷过程混合模型中进行采样:大数据实现中的混合问题。
Stat Comput. 2015;25(5):1023-1037. doi: 10.1007/s11222-014-9471-3. Epub 2014 May 3.
3
Scalable Bayesian Inference for Coupled Hidden Markov and Semi-Markov Models.耦合隐马尔可夫模型和半马尔可夫模型的可扩展贝叶斯推理
J Comput Graph Stat. 2019 Sep 18;29(2):238-249. doi: 10.1080/10618600.2019.1654880. eCollection 2020.
4
A Bayesian nonparametric approach for uncovering rat hippocampal population codes during spatial navigation.一种用于揭示空间导航过程中大鼠海马体群体编码的贝叶斯非参数方法。
J Neurosci Methods. 2016 Apr 1;263:36-47. doi: 10.1016/j.jneumeth.2016.01.022. Epub 2016 Feb 5.
5
Inference of regulatory networks with a convergence improved MCMC sampler.使用收敛性改进的马尔可夫链蒙特卡罗采样器推断调控网络。
BMC Bioinformatics. 2015 Sep 24;16:306. doi: 10.1186/s12859-015-0734-6.
6
Identifying the Recurrence of Sleep Apnea Using A Harmonic Hidden Markov Model.使用谐波隐马尔可夫模型识别睡眠呼吸暂停的复发情况。
Ann Appl Stat. 2021 Sep;15(3):1171-1193. doi: 10.1214/21-AOAS1455.
7
Generalized species sampling priors with latent Beta reinforcements.具有潜在贝塔增强的广义物种抽样先验。
J Am Stat Assoc. 2014 Dec 1;109(508):1466-1480. doi: 10.1080/01621459.2014.950735.
8
Fast Bayesian Inference in Dirichlet Process Mixture Models.狄利克雷过程混合模型中的快速贝叶斯推理
J Comput Graph Stat. 2011 Jan 1;20(1). doi: 10.1198/jcgs.2010.07081.
9
A menu-driven software package of Bayesian nonparametric (and parametric) mixed models for regression analysis and density estimation.一个用于回归分析和密度估计的贝叶斯非参数(和参数)混合模型的菜单驱动软件包。
Behav Res Methods. 2017 Feb;49(1):335-362. doi: 10.3758/s13428-016-0711-7.
10
Laplacian-P-splines for Bayesian inference in the mixture cure model.拉普拉斯样条在混合治愈模型中贝叶斯推断的应用。
Stat Med. 2022 Jun 30;41(14):2602-2626. doi: 10.1002/sim.9373. Epub 2022 Mar 14.

引用本文的文献

1
A semiparametric Bayesian model for comparing DNA copy numbers.一种用于比较DNA拷贝数的半参数贝叶斯模型。
Braz J Probab Stat. 2016 Aug;30(3):345-365. doi: 10.1214/15-bjps283. Epub 2016 Jul 29.
2
Identifying the Recurrence of Sleep Apnea Using A Harmonic Hidden Markov Model.使用谐波隐马尔可夫模型识别睡眠呼吸暂停的复发情况。
Ann Appl Stat. 2021 Sep;15(3):1171-1193. doi: 10.1214/21-AOAS1455.
3
Uncovering ecological state dynamics with hidden Markov models.利用隐马尔可夫模型揭示生态状态动态。

本文引用的文献

1
A segmental maximum a posteriori approach to genome-wide copy number profiling.一种用于全基因组拷贝数分析的分段最大后验概率方法。
Bioinformatics. 2008 Mar 15;24(6):751-8. doi: 10.1093/bioinformatics/btn003. Epub 2008 Jan 19.
2
QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data.QuantiSNP:一种使用单核苷酸多态性(SNP)基因分型数据来检测和精确绘制拷贝数变异图谱的客观贝叶斯隐马尔可夫模型。
Nucleic Acids Res. 2007;35(6):2013-25. doi: 10.1093/nar/gkm076. Epub 2007 Mar 6.
3
Continuous-index hidden Markov modelling of array CGH copy number data.
Ecol Lett. 2020 Dec;23(12):1878-1903. doi: 10.1111/ele.13610. Epub 2020 Oct 19.
4
A hidden Markov modeling approach for identifying tumor subclones in next-generation sequencing studies.一种用于鉴定下一代测序研究中肿瘤亚克隆的隐马尔可夫模型方法。
Biostatistics. 2022 Jan 13;23(1):69-82. doi: 10.1093/biostatistics/kxaa013.
5
Bayesian adaptive group lasso with semiparametric hidden Markov models.贝叶斯自适应分组 lasso 与半参数隐马尔可夫模型。
Stat Med. 2019 Apr 30;38(9):1634-1650. doi: 10.1002/sim.8051. Epub 2018 Nov 28.
6
An Introduction to Infinite HMMs for Single-Molecule Data Analysis.用于单分子数据分析的无限隐马尔可夫模型简介。
Biophys J. 2017 May 23;112(10):2021-2029. doi: 10.1016/j.bpj.2017.04.027.
7
PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes.PReMiuM:一个使用狄利克雷过程的轮廓回归混合模型的R包。
J Stat Softw. 2015 Mar 20;64(7):1-30. doi: 10.18637/jss.v064.i07.
8
A Bayesian nonparametric approach for uncovering rat hippocampal population codes during spatial navigation.一种用于揭示空间导航过程中大鼠海马体群体编码的贝叶斯非参数方法。
J Neurosci Methods. 2016 Apr 1;263:36-47. doi: 10.1016/j.jneumeth.2016.01.022. Epub 2016 Feb 5.
9
Prior Design for Dependent Dirichlet Processes: An Application to Marathon Modeling.相依狄利克雷过程的先验设计:在马拉松建模中的应用
PLoS One. 2016 Jan 28;11(1):e0147402. doi: 10.1371/journal.pone.0147402. eCollection 2016.
10
Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations.从具有未知浓度参数的狄利克雷过程混合模型中进行采样:大数据实现中的混合问题。
Stat Comput. 2015;25(5):1023-1037. doi: 10.1007/s11222-014-9471-3. Epub 2014 May 3.
阵列比较基因组杂交拷贝数数据的连续索引隐马尔可夫模型
Bioinformatics. 2007 Apr 15;23(8):1006-14. doi: 10.1093/bioinformatics/btm059. Epub 2007 Feb 19.
4
Exploiting noise in array CGH data to improve detection of DNA copy number change.利用阵列比较基因组杂交数据中的噪声来改善DNA拷贝数变化的检测。
Nucleic Acids Res. 2007;35(5):e35. doi: 10.1093/nar/gkl730. Epub 2007 Feb 1.
5
Integrating copy number polymorphisms into array CGH analysis using a robust HMM.使用稳健的隐马尔可夫模型将拷贝数多态性整合到阵列比较基因组杂交分析中。
Bioinformatics. 2006 Jul 15;22(14):e431-9. doi: 10.1093/bioinformatics/btl238.
6
Mouse genomic representational oligonucleotide microarray analysis: detection of copy number variations in normal and tumor specimens.小鼠基因组代表性寡核苷酸微阵列分析:检测正常和肿瘤标本中的拷贝数变异
Proc Natl Acad Sci U S A. 2006 Jul 25;103(30):11234-9. doi: 10.1073/pnas.0602984103. Epub 2006 Jul 14.
7
BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data.BioHMM:一种用于分割阵列比较基因组杂交数据的异构隐马尔可夫模型。
Bioinformatics. 2006 May 1;22(9):1144-6. doi: 10.1093/bioinformatics/btl089. Epub 2006 Mar 13.