Long T
Department of Biostatistics, College of American Pathologists, Northfield, Ill. 60093-2750.
Arch Pathol Lab Med. 1993 Apr;117(4):387-92.
Matrix-induced bias can adversely affect the performance evaluation a clinical laboratory receives on proficiency testing results. Therefore, it is vital that matrix effects from a matrix-biased system are detected and that laboratories using these systems are not falsely penalized on their proficiency testing results. The College of American Pathologists has developed an experimental protocol to test whether an observed bias is, in fact, due to the proficiency testing sample matrix rather than true performance or calibration problems. The probability of detecting a matrix effect using this protocol given a matrix-biased system is defined as statistical power. Five parameters are known to affect the probability of detection: (1) size of the bias in the proficiency testing material, (2) lack of fit coefficient of variation-natural variation in patient samples used to define the relationship between the test and reference methods, (3) pure error coefficient of variation-random variation in the test method, (4) the number of fresh patient samples, and (5) the number of replicates for each sample. The level of significance of the statistical test will also affect the probability of detection. Power curves are generated to show the effect these five parameters have on the determination of power. With the exception of bias, power is most influenced by the components of variance, lack of fit (nonlinear), and pure (random) error. Of these two components, the lack of fit error, which represents an uncontrollable source of error, is usually more influential than pure error, which can be reduced by a larger number of replicates. A large increase in power will result from an increase of fresh patient samples from 10 to 20, and a moderate increase in power will result from an increase of fresh patient samples from 20 to 40; no noticeable increase in power is seen with greater than 40 fresh patient samples. Large increases in power were observed for increases in the number of replicates per sample from one to two, two to three, and three to five. To markedly increase power further, 10 replicates would have to be assayed.
基质诱导偏差会对临床实验室在能力验证结果方面的性能评估产生不利影响。因此,检测来自基质偏差系统的基质效应至关重要,并且使用这些系统的实验室不会因其能力验证结果而被错误扣分。美国病理学家学会已制定了一项实验方案,以测试观察到的偏差实际上是否归因于能力验证样本基质,而非真实性能或校准问题。在给定基质偏差系统的情况下,使用该方案检测基质效应的概率被定义为统计效能。已知有五个参数会影响检测概率:(1)能力验证材料中的偏差大小;(2)失拟变异系数——用于定义检测方法与参考方法之间关系的患者样本中的自然变异;(3)纯误差变异系数——检测方法中的随机变异;(4)新鲜患者样本数量;(5)每个样本的重复次数。统计检验的显著性水平也会影响检测概率。生成效能曲线以显示这五个参数对效能测定的影响。除偏差外,效能受方差成分、失拟(非线性)和纯(随机)误差的影响最大。在这两个成分中,代表不可控误差源的失拟误差通常比纯误差更具影响力,纯误差可通过增加重复次数来降低。新鲜患者样本从10个增加到20个会导致效能大幅增加,从20个增加到40个会导致效能适度增加;新鲜患者样本数量超过40个时,效能无明显增加。当每个样本的重复次数从1次增加到2次、2次增加到3次以及3次增加到5次时,观察到效能有大幅增加。要进一步显著提高效能,则必须测定10次重复。