Hougaard P, Myglegaard P, Borch-Johnsen K
Biostatistical Department, Novo Nordisk, Bagsvaerd, Denmark.
Biometrics. 1994 Dec;50(4):1178-88.
It is not, in general, possible to include all relevant risk factors in a model of survival or disease incidence. This heterogeneity must be accounted for in the interpretation, as it can imply otherwise unexpected results. This is illustrated by diabetic nephropathy, a serious complication experienced by some diabetic patients. A mathematical model with varying susceptibility can explain that the incidence increases until 20 years duration of diabetes and later decreases. The hospital-based data cover patients diagnosed during 1933-1972. They are interval censored, because early detection of nephropathy requires chemical analysis of urine samples. The data are consistent with a model where less than half of the patients are susceptible, and for each of these the hazard is increasing. The estimated degree of heterogeneity markedly depends on the assumed model. The dependence on age at onset and calendar time of onset is examined. The highest risk is seen at onset age 13-17 years, and the risk decreases with calendar time. The effect of covariates on the hazard is markedly different for the various models, but this is partly a matter of parametrization, as the disagreement is reduced by a reparametrization inspired by accelerated failure time models.
一般来说,在生存或疾病发病率模型中纳入所有相关风险因素是不可能的。在解释过程中必须考虑到这种异质性,因为它可能意味着意想不到的结果。糖尿病肾病就是一个例子,这是一些糖尿病患者会经历的严重并发症。一个具有不同易感性的数学模型可以解释发病率在糖尿病病程达到20年之前会上升,之后会下降。基于医院的数据涵盖了1933年至1972年期间诊断出的患者。这些数据是区间删失的,因为肾病的早期检测需要对尿液样本进行化学分析。数据与一个模型相符,即不到一半的患者具有易感性,而且对于每一个这样的患者,风险都在增加。估计的异质性程度明显取决于所假设的模型。研究了发病年龄和发病日历时间的依赖性。发病年龄在13 - 17岁时风险最高,且风险随日历时间降低。协变量对风险的影响在不同模型中明显不同,但这部分是参数化的问题,因为通过受加速失效时间模型启发的重新参数化,分歧有所减少。