Suppr超能文献

基于行列式点过程(DPP)的潜在生物结构的贝叶斯推断。

Bayesian inference for latent biologic structure with determinantal point processes (DPP).

作者信息

Xu Yanxun, Müller Peter, Telesca Donatello

机构信息

Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, Texas, U.S.A..

Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, U.S.A..

出版信息

Biometrics. 2016 Sep;72(3):955-64. doi: 10.1111/biom.12482. Epub 2016 Feb 12.

Abstract

We discuss the use of the determinantal point process (DPP) as a prior for latent structure in biomedical applications, where inference often centers on the interpretation of latent features as biologically or clinically meaningful structure. Typical examples include mixture models, when the terms of the mixture are meant to represent clinically meaningful subpopulations (of patients, genes, etc.). Another class of examples are feature allocation models. We propose the DPP prior as a repulsive prior on latent mixture components in the first example, and as prior on feature-specific parameters in the second case. We argue that the DPP is in general an attractive prior model for latent structure when biologically relevant interpretation of such structure is desired. We illustrate the advantages of DPP prior in three case studies, including inference in mixture models for magnetic resonance images (MRI) and for protein expression, and a feature allocation model for gene expression using data from The Cancer Genome Atlas. An important part of our argument are efficient and straightforward posterior simulation methods. We implement a variation of reversible jump Markov chain Monte Carlo simulation for inference under the DPP prior, using a density with respect to the unit rate Poisson process.

摘要

我们讨论了行列式点过程(DPP)作为生物医学应用中潜在结构的先验分布的使用情况,在这些应用中,推理通常集中于将潜在特征解释为具有生物学或临床意义的结构。典型的例子包括混合模型,其中混合项旨在表示具有临床意义的亚群(患者、基因等)。另一类例子是特征分配模型。在第一个例子中,我们提出将DPP先验作为潜在混合成分上的排斥先验;在第二种情况下,作为特征特定参数上的先验。我们认为,当需要对这种结构进行生物学相关解释时,DPP总体上是一种有吸引力的潜在结构先验模型。我们在三个案例研究中说明了DPP先验的优势,包括对磁共振图像(MRI)和蛋白质表达的混合模型进行推理,以及使用来自癌症基因组图谱的数据对基因表达进行特征分配模型。我们论证的一个重要部分是高效且直接的后验模拟方法。我们实现了一种可逆跳跃马尔可夫链蒙特卡罗模拟的变体,用于在DPP先验下进行推理,使用相对于单位速率泊松过程的密度。

相似文献

1
Bayesian inference for latent biologic structure with determinantal point processes (DPP).
Biometrics. 2016 Sep;72(3):955-64. doi: 10.1111/biom.12482. Epub 2016 Feb 12.
2
BAREB: A Bayesian repulsive biclustering model for periodontal data.
Stat Med. 2020 Jul 20;39(16):2139-2151. doi: 10.1002/sim.8536. Epub 2020 Apr 3.
3
A Nonparametric Multidimensional Latent Class IRT Model in a Bayesian Framework.
Psychometrika. 2017 Dec;82(4):952-978. doi: 10.1007/s11336-017-9576-7. Epub 2017 Sep 12.
4
Bayesian inference for Markov jump processes with informative observations.
Stat Appl Genet Mol Biol. 2015 Apr;14(2):169-88. doi: 10.1515/sagmb-2014-0070.
5
PHAISTOS: a framework for Markov chain Monte Carlo simulation and inference of protein structure.
J Comput Chem. 2013 Jul 15;34(19):1697-705. doi: 10.1002/jcc.23292. Epub 2013 Apr 26.
7
Bayesian semiparametric intensity estimation for inhomogeneous spatial point processes.
Biometrics. 2011 Sep;67(3):937-46. doi: 10.1111/j.1541-0420.2010.01531.x. Epub 2010 Dec 22.
8
A full bayesian approach for boolean genetic network inference.
PLoS One. 2014 Dec 31;9(12):e115806. doi: 10.1371/journal.pone.0115806. eCollection 2014.
9
Bayesian mixture modeling using a hybrid sampler with application to protein subfamily identification.
Biostatistics. 2010 Jan;11(1):18-33. doi: 10.1093/biostatistics/kxp033. Epub 2009 Aug 20.
10
Bayesian mixture models of variable dimension for image segmentation.
Comput Methods Programs Biomed. 2009 Apr;94(1):1-14. doi: 10.1016/j.cmpb.2008.05.010. Epub 2008 Nov 25.

引用本文的文献

1
Flexible Regularized Estimation in High-Dimensional Mixed Membership Models.
Comput Stat Data Anal. 2024 Jun;194. doi: 10.1016/j.csda.2024.107931. Epub 2024 Feb 9.
2
A Bayesian feature allocation model for identifying cell subpopulations using CyTOF data.
J R Stat Soc Ser C Appl Stat. 2023 Apr 25;72(3):718-738. doi: 10.1093/jrsssc/qlad029. eCollection 2023 Jun.
3
Bayesian cluster analysis.
Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220149. doi: 10.1098/rsta.2022.0149. Epub 2023 Mar 27.
4
Latent Nested Nonparametric Priors (with Discussion).
Bayesian Anal. 2019 Dec;14(4):1303-1356. doi: 10.1214/19-BA1169. Epub 2019 Jun 27.
5
BAREB: A Bayesian repulsive biclustering model for periodontal data.
Stat Med. 2020 Jul 20;39(16):2139-2151. doi: 10.1002/sim.8536. Epub 2020 Apr 3.

本文引用的文献

1
Posterior Contraction Rates of the Phylogenetic Indian Buffet Processes.
Bayesian Anal. 2016 Jun;11(2):477-497. doi: 10.1214/15-BA958. Epub 2015 Jun 5.
2
MAD Bayes for Tumor Heterogeneity - Feature Allocation with Exponential Family Sampling.
J Am Stat Assoc. 2015 Mar 1;110(510):503-514. doi: 10.1080/01621459.2014.995794.
3
Assessing the clinical utility of cancer genomic and proteomic data across tumor types.
Nat Biotechnol. 2014 Jul;32(7):644-52. doi: 10.1038/nbt.2940. Epub 2014 Jun 22.
4
Comprehensive molecular portraits of human breast tumours.
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
5
Supervised risk predictor of breast cancer based on intrinsic subtypes.
J Clin Oncol. 2009 Mar 10;27(8):1160-7. doi: 10.1200/JCO.2008.18.1370. Epub 2009 Feb 9.
6
Method for quantification of brain, ventricular, and subarachnoid CSF volumes from MR images.
J Comput Assist Tomogr. 1992 Mar-Apr;16(2):274-84. doi: 10.1097/00004728-199203000-00018.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验