Yen A M-F, Liou H-H, Lin H-L, Chen T H-H
Institute of Preventive Medicine, College of Public Health, National Taiwan University, Taipei, 100, Taiwan.
Methods Inf Med. 2006;45(6):631-7.
The study aimed to develop a predictive model to deal with data fraught with heterogeneity that cannot be explained by sampling variation or measured covariates.
The random-effect Poisson regression model was first proposed to deal with over-dispersion for data fraught with heterogeneity after making allowance for measured covariates. Bayesian acyclic graphic model in conjunction with Markov Chain Monte Carlo (MCMC) technique was then applied to estimate the parameters of both relevant covariates and random effect. Predictive distribution was then generated to compare the predicted with the observed for the Bayesian model with and without random effect. Data from repeated measurement of episodes among 44 patients with intractable epilepsy were used as an illustration.
The application of Poisson regression without taking heterogeneity into account to epilepsy data yielded a large value of heterogeneity (heterogeneity factor = 17.90, deviance = 1485, degree of freedom (df) = 83). After taking the random effect into account, the value of heterogeneity factor was greatly reduced (heterogeneity factor = 0.52, deviance = 42.5, df = 81). The Pearson chi2 for the comparison between the expected seizure frequencies and the observed ones at two and three months of the model with and without random effect were 34.27 (p = 1.00) and 1799.90 (p < 0.0001), respectively.
The Bayesian acyclic model using the MCMC method was demonstrated to have great potential for disease prediction while data show over-dispersion attributed either to correlated property or to subject-to-subject variability.
本研究旨在开发一种预测模型,以处理充满异质性的数据,这种异质性无法通过抽样变异或测量的协变量来解释。
首先提出随机效应泊松回归模型,在考虑测量的协变量后处理充满异质性的数据的过度离散问题。然后应用贝叶斯无环图模型结合马尔可夫链蒙特卡罗(MCMC)技术来估计相关协变量和随机效应的参数。接着生成预测分布,以比较有无随机效应的贝叶斯模型的预测值与观测值。以44例难治性癫痫患者发作的重复测量数据为例进行说明。
将未考虑异质性的泊松回归应用于癫痫数据时,得到了较大的异质性值(异质性因子 = 17.90,偏差 = 1485,自由度(df) = 83)。考虑随机效应后,异质性因子的值大幅降低(异质性因子 = 0.52,偏差 = 42.5,df = 81)。有无随机效应的模型在两个月和三个月时预期癫痫发作频率与观测值比较的Pearson卡方值分别为34.27(p = 1.00)和1799.90(p < 0.0001)。
当数据显示由于相关性或个体间变异性导致过度离散时,使用MCMC方法的贝叶斯无环模型在疾病预测方面具有很大潜力。