Eckert R S, Carroll R J, Wang N
Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, Indiana 46285, USA.
Biometrics. 1997 Mar;53(1):262-72.
In many problems, one wants to model the relationship between a response Y and a covariate X. Sometimes it is difficult, expensive, or even impossible to observe X directly, but one can instead observe a substitute variable W that is easier to obtain. By far, the most common model for the relationship between the actual covariate of interest X and the substitute W is W = X + U, where the variable U represents measurement error. This assumption of additive measurement error may be unreasonable for certain data sets. We propose a new model, namely h(W) = h(X) + U, where h(.) is a monotone transformation function selected from some family H of monotone functions. The idea of the new model is that, in the correct scale, measurement error is additive. We propose two possible transformation families H. One is based on selecting a transformation that makes the within-sample mean and standard deviation of replicated W's uncorrelated. The second is based on selecting the transformation so that the errors (U's) fit a prespecified distribution. Transformation families used are the parametric power transformations and a cubic spline family. Several data examples are presented to illustrate the methods.
在许多问题中,人们希望对响应变量Y和协变量X之间的关系进行建模。有时,直接观测X是困难的、昂贵的,甚至是不可能的,但人们可以转而观测一个更容易获得的替代变量W。到目前为止,感兴趣的实际协变量X与替代变量W之间关系最常见的模型是W = X + U,其中变量U表示测量误差。对于某些数据集而言,这种加性测量误差的假设可能并不合理。我们提出一种新模型,即h(W) = h(X) + U,其中h(.)是从某个单调函数族H中选取的单调变换函数。新模型的理念是,在正确的尺度下,测量误差是可加的。我们提出了两个可能的变换族H。一个基于选择一种变换,使得重复的W的样本内均值和标准差不相关。另一个基于选择变换,使得误差(U)符合预先指定的分布。所使用的变换族是参数化幂变换和三次样条族。给出了几个数据示例来说明这些方法。