Bardsley W G, McGinlay P B
Department of Obstetrics and Gynaecology, University of Manchester, St Mary's Hospital, U.K.
J Theor Biol. 1987 May 21;126(2):183-201. doi: 10.1016/s0022-5193(87)80228-x.
Computer fitting of binding data is discussed and it is concluded that the main problem is the choice of starting estimates and internal scaling parameters, not the optimization software. Solving linear overdetermined systems of equations for starting estimates is investigated. A function, Q, is introduced to study model discrimination with binding isotherms and the behaviour of Q as a function of model parameters is calculated for the case of 2 and 3 sites. The power function of the F test is estimated for models with 2 to 5 binding sites and necessary constraints on parameters for correct model discrimination are given. The sampling distribution of F test statistics is compared to an exact F distribution using the Chi-squared and Kolmogorov-Smirnov tests. For low order modes (n less than 3) the F test statistics are approximately F distributed but for higher order models the test statistics are skewed to the left of the F distribution. The parameter covariance matrix obtained by inverting the Hessian matrix of the objective function is shown to be a good approximation to the estimate obtained by Monte Carlo sampling for low order models (n less than 3). It is concluded that analysis of up to 2 or 3 binding sites presents few problems and linear, normal statistical results are valid. To identify correctly 4 sites is much more difficult, requiring very precise data and extreme parameter values. Discrimination of 5 from 4 sites is an upper limit to the usefulness of the F test.
本文讨论了结合数据的计算机拟合,并得出结论:主要问题在于初始估计值和内部缩放参数的选择,而非优化软件。研究了求解初始估计值的线性超定方程组的方法。引入了一个函数Q来研究结合等温线的模型判别,并针对两个和三个位点的情况计算了Q作为模型参数函数的行为。估计了具有2至5个结合位点的模型的F检验的功效函数,并给出了正确进行模型判别所需的参数约束。使用卡方检验和柯尔莫哥洛夫-斯米尔诺夫检验将F检验统计量的抽样分布与精确的F分布进行比较。对于低阶模型(n小于3),F检验统计量近似服从F分布,但对于高阶模型,检验统计量向左偏离F分布。对于低阶模型(n小于3),通过对目标函数的海森矩阵求逆得到的参数协方差矩阵被证明是对通过蒙特卡罗抽样获得的估计值的良好近似。得出的结论是,分析多达2或3个结合位点几乎没有问题,线性、正态统计结果是有效的。正确识别4个位点要困难得多,需要非常精确的数据和极端的参数值。从4个位点中判别出5个位点是F检验有用性的上限。