O'Neill Søren, O'Neill Lotte
Spine Center of Southern Denmark, Lillebælt Hospital, Middelfart, Denmark; Institute of Regional Health Science Research, University of Southern Denmark, Odense, Denmark.
Centre for Medical Education, Aarhus University, INCUBA Science Park Skejby, Århus N, Denmark.
J Pain. 2015 May;16(5):454-62. doi: 10.1016/j.jpain.2015.01.476. Epub 2015 Feb 13.
The reliability of quantitative sensory testing (QST) is affected by the error attributable to both test occasion and rater (examiner) and the interactions between them. Most reliability studies account for only 1 source of error. The present study employed a fully crossed, multivariate generalizability design to account for rater and occasion variance simultaneously. Nineteen healthy volunteers were examined with a battery of 7 QST procedures 4 times on 2 occasions by 2 raters. The QST battery was composed to include a mix of different pain stimuli and response domains, including threshold, intensity, tolerance, and modulation with mechanical, thermal, and chemical stimuli. The classical test-retest and interrater reliability (.19 < intraclass correlation coefficient <.92) was in line with the literature, and generalizability analysis indicated that the universe score was generally the dominant source of variation (relative contribution = 19%, 78%). Error attributable to the interaction between study participant and occasion was also influential. Dependability coefficients indicated that a substantial increase in reliability and feasibility could be achieved by employing a composite QST battery compared to single QST procedures. Reliability was improved more by repeated testing on separate occasions than by repeated testing by different raters.
When balancing reliability and feasibility, the current findings suggest that a carefully selected battery of QST procedures repeated on a few occasions may be optimal.
定量感觉测试(QST)的可靠性受测试时机和评估者(检查者)误差以及二者之间相互作用的影响。大多数可靠性研究仅考虑一种误差来源。本研究采用完全交叉的多元概化设计,同时考虑评估者和时机差异。19名健康志愿者由两名评估者在两个时机进行4次测试,每次测试采用7种QST程序组成的测试组。该测试组包括不同疼痛刺激和反应域的组合,涵盖阈值、强度、耐受性以及机械、热和化学刺激的调制。经典的重测信度和评估者间信度(组内相关系数为0.19<ICC<0.92)与文献一致,概化分析表明全域分数通常是变异的主要来源(相对贡献率=19%,78%)。研究参与者与测试时机之间相互作用产生的误差也有影响。可靠性系数表明,与单一QST程序相比,采用复合QST测试组可显著提高可靠性和可行性。与不同评估者重复测试相比,在不同时机重复测试能更大程度地提高可靠性。
在平衡可靠性和可行性时,当前研究结果表明,精心挑选的QST程序测试组在少数几个时机重复测试可能是最佳选择。