Department of Statistics, Virginia Tech, Blacksburg, Virginia, USA.
Stat Med. 2022 Jun 30;41(14):2557-2573. doi: 10.1002/sim.9370. Epub 2022 Mar 9.
We propose a new approach to test associations between binary trees and covariates. In this approach, binary-tree structured data are treated as sample paths of binary fission Markov branching processes (bMBP). We propose a generalized linear regression model and developed inference procedures for association testing, including variable selection and estimation of covariate effects. Simulation studies show that these procedures are able to accurately identify covariates that are associated with the binary tree structure by impacting the rate parameter of the bMBP. The problem of association testing on binary trees is motivated by modeling hierarchical clustering dendrograms of pixel intensities in biomedical images. By using semi-synthetic data generated from a real brain-tumor image, our simulation studies show that the bMBP model is able to capture the characteristics of dendrogram trees in brain-tumor images. Our final analysis of the glioblastoma multiforme brain-tumor data from The Cancer Imaging Archive identified multiple clinical and genetic variables that are potentially associated with brain-tumor heterogeneity.
我们提出了一种新的方法来检验二元树和协变量之间的关联。在这种方法中,二元树结构的数据被视为二元裂变马尔可夫分支过程(bMBP)的样本路径。我们提出了一种广义线性回归模型,并为关联检验开发了推断程序,包括变量选择和协变量效应的估计。模拟研究表明,这些程序能够通过影响 bMBP 的率参数准确识别与二元树结构相关的协变量。二元树关联检验的问题源于对生物医学图像中像素强度的层次聚类树状图的建模。通过使用从真实脑肿瘤图像生成的半合成数据,我们的模拟研究表明,bMBP 模型能够捕捉脑肿瘤图像中树状图的特征。我们对来自癌症成像档案的多形性胶质母细胞瘤脑肿瘤数据的最终分析确定了多个可能与脑肿瘤异质性相关的临床和遗传变量。