Slate E H, Turnbull B W
School of Operations Research and Industrial Engineering and Department of Statistical Science, Cornell University, Ithaca, NY 14853-3801, USA.
Stat Med. 2000 Feb 29;19(4):617-37. doi: 10.1002/(sici)1097-0258(20000229)19:4<617::aid-sim360>3.0.co;2-r.
We consider the analysis of serial biomarkers to screen and monitor individuals in a given population for onset of a specific disease of interest. The biomarker readings are subject to error. We survey some of the existing literature and concentrate on two recently proposed models. The first is a fully Bayesian hierarchical structure for a mixed effects segmented regression model. Posterior estimates of the changepoint (onset time) distribution are obtained by Gibbs sampling. The second is a hidden changepoint model in which the onset time distribution is estimated by maximum likelihood using the EM algorithm. Both methods lead to a dynamic index that represents a strength of evidence that onset has occurred by the current time in an individual subject. The methods are applied to some large data sets concerning prostate specific antigen (PSA) as a serial marker for prostate cancer. Rules based on the indices are compared to standard diagnostic criteria through the use of ROC curves adapted for longitudinal data.
我们考虑对系列生物标志物进行分析,以筛查和监测特定人群中感兴趣的特定疾病的发病情况。生物标志物读数存在误差。我们查阅了一些现有文献,并重点关注最近提出的两种模型。第一种是用于混合效应分段回归模型的全贝叶斯层次结构。通过吉布斯采样获得变化点(发病时间)分布的后验估计。第二种是隐藏变化点模型,其中发病时间分布通过使用期望最大化(EM)算法的最大似然估计来确定。这两种方法都得出一个动态指标,该指标表示个体受试者到当前时间已发病的证据强度。这些方法应用于一些关于前列腺特异性抗原(PSA)作为前列腺癌系列标志物的大型数据集。通过使用适用于纵向数据的ROC曲线,将基于这些指标的规则与标准诊断标准进行比较。