Tellinghuisen Joel
Department of Chemistry, Vanderbilt University, Nashville, Tennessee, USA.
Methods Enzymol. 2009;454:259-85. doi: 10.1016/S0076-6879(08)03810-X.
In the least-squares fitting of data, there is a unique answer to the question of how the data should be weighted: inversely as their variance. Any other weighting gives less than optimal efficiency and leads to unreliable estimates of the parameter uncertainties. In calibration, knowledge of the data variance permits exact prediction of the precision of calibration, empowering the analyst to critically examine different response functions and different data structures. These points are illustrated here with a nonlinear response function that exhibits a type of saturation curvature at large signal like that observed in a number of detection methods. Exact error propagation is used to compute the uncertainty in the fitted response function and to treat common data transformations designed to reduce or eliminate the effects of data heteroscedasticity. Data variance functions can be estimated with adequate reliability from remarkably small data sets, illustrated here with three replicates at each of seven calibration values. As a quantitative goodness-of-fit indicator, chi(2) is better than the widely used R(2); in one application it shows clearly that the dominant source of uncertainty is not the measurement but the preparation of the calibration samples, forcing the conclusion that the calibration regression should be reversed.
在数据的最小二乘拟合中,对于数据应如何加权的问题有一个唯一的答案:与它们的方差成反比。任何其他加权方式都会导致效率低于最优水平,并导致参数不确定性的估计不可靠。在校准中,了解数据方差可以精确预测校准的精度,使分析人员能够严格审查不同的响应函数和不同的数据结构。这里用一个非线性响应函数来说明这些要点,该函数在大信号时呈现出一种饱和曲率,类似于在许多检测方法中观察到的情况。使用精确的误差传播来计算拟合响应函数中的不确定性,并处理旨在减少或消除数据异方差性影响的常见数据变换。数据方差函数可以从非常小的数据集中以足够的可靠性进行估计,这里以七个校准值中的每一个值进行三次重复为例进行说明。作为一种定量的拟合优度指标,卡方比广泛使用的R²更好;在一个应用中,它清楚地表明不确定性的主要来源不是测量,而是校准样品的制备,从而得出校准回归应该颠倒的结论。