Gorga M P, Neely S T, Dorn P A
Boys Town National Research Hospital, Omaha, Nebraska 68131, USA.
Ear Hear. 1999 Aug;20(4):345-62. doi: 10.1097/00003446-199908000-00007.
DPOAE and audiometric data were analyzed from 1267 ears of 806 subjects. These data were evaluated for three different frequency combinations (2, 3, 4 kHz; 2, 3, 4, 6 kHz; 1.5, 2, 3, 4, 6 kHz). DPOAE data were collected for each of the f2 frequencies listed above, using primary levels (L1/L2) of 65/55 dB SPL and a primary ratio (f2/f1) of 1.22. Sensitivity and specificity were evaluated for signal to noise ratios (SNRs) of 3, 6, and 9 dB, which are in common clinical use. In addition, test performance was evaluated using clinical decision theory, following the convention we have used in previous reports on otoacoustic emission test performance. Both univariate and multivariate analyses techniques were applied to the data. In addition to evaluating DPOAE test performance for the case when audiometric and f2 frequency were equal, multifrequency gold standards and multifrequency criterion responses were evaluated. Three new gold standards were used to assess test performance: average pure-tone thresholds, extrema thresholds that took into account both the magnitude of the loss and the number of frequencies at which hearing loss existed, and a combination of the two. These new gold standards were applied to each of the three frequency groups described above.
As expected, SNR criteria of 3, 6, and 9 dB never resulted in perfect DPOAE test performance. Even the most stringent of these criteria (9 dB SNR) did not result in a sensitivity of 100%. This result suggests that caution should be exercised in the interpretation of DPOAE test results when these a priori criteria are used clinically. Excellent test performance was achieved when auditory status was classified on the basis of the new gold standards and when either SNR or the output of multivariate logistic regressions (LRs) were used as criterion measures. Invariably, the LR resulted in superior test performance compared with what was achieved by the SNR. For SNR criteria of 3, 6, and 9 dB and (by definition) for the LR, specificity, in general, exceeded 80% and often was greater than 90%. Sensitivity, however, depended on the magnitude of hearing loss. Diagnostic errors, when they occurred, were more common for patients with mild hearing losses (21 to 40 dB HL); sensitivity approached 100% once the hearing loss exceeded 40 dB HL. The largest differences between test performance based on SNR or LR occurred for the ears with mild hearing loss, where the LR resulted in more accurate diagnoses.
It should not be assumed that the use of a priori response criteria, such as SNRs of 3, 6, or 9 dB, will identify all ears with hearing loss. Test performance when multifrequency gold standards are used to define an ear as normal or impaired and when data from multiple f2 frequencies are used to make a diagnosis, resulted in excellent test performance, especially when the LR was used. When predicting auditory status with multifrequency gold standards, the LR resulted in relative operating characteristic curve areas of 0.95 or 0.96. An output from the LR can be selected that results in a specificity of 90% or better. When the loss exceeded 40 dB HL, the same output from the LR resulted in test sensitivity of nearly 100%. These were the best test results that were achieved. (ABSTRACT TRUNCATED)
1)描述将先验反应标准应用于大量畸变产物耳声发射(DPOAE)数据时DPOAE测试的性能。2)描述使用听觉功能的多频定义时DPOAE测试的性能。3)根据来自多个频率的DPOAE数据对一只耳朵的听觉状态做出单一判断时,确定DPOAE测试的性能。4)当将多频金标准定义和反应标准应用于DPOAE数据时,比较单变量和多变量测试的性能。
分析了806名受试者1267只耳朵的DPOAE和听力测定数据。对三种不同的频率组合(2、3、4kHz;2、3、4、6kHz;1.5、2、3、4、6kHz)进行了评估。使用65/55dB SPL的初级水平(L1/L2)和1.22的初级比率(f2/f1),针对上述每个f2频率收集DPOAE数据。对临床常用的3、6和9dB的信噪比(SNR)评估了敏感性和特异性。此外,按照我们先前关于耳声发射测试性能报告中使用的惯例,使用临床决策理论评估测试性能。将单变量和多变量分析技术应用于数据。除了评估听力测定和f2频率相等时的DPOAE测试性能外,还评估了多频金标准和多频标准反应。使用了三种新的金标准来评估测试性能:平均纯音阈值、考虑到听力损失程度和存在听力损失的频率数量的极值阈值,以及两者的组合。将这些新的金标准应用于上述三个频率组中的每一组。
正如预期的那样,3、6和9dB的SNR标准从未导致完美的DPOAE测试性能。即使是这些标准中最严格的(9dB SNR)也未导致100%的敏感性。这一结果表明,在临床使用这些先验标准时,对DPOAE测试结果的解释应谨慎。当根据新的金标准对听觉状态进行分类,并且使用SNR或多变量逻辑回归(LR)的输出作为标准测量时,实现了出色的测试性能。与SNR相比,LR总是导致更好的测试性能。对于3、6和9dB的SNR标准以及(根据定义)对于LR,特异性通常超过80%,并且经常大于90%。然而,敏感性取决于听力损失的程度。诊断错误在轻度听力损失(21至40dB HL)患者中更常见;一旦听力损失超过40dB HL,敏感性接近100%。基于SNR或LR的测试性能之间的最大差异发生在轻度听力损失的耳朵中,其中LR导致更准确的诊断。
不应假定使用先验反应标准,如3、6或9dB的SNR,将识别出所有听力损失的耳朵。当使用多频金标准将一只耳朵定义为正常或受损,并且使用来自多个f2频率的数据进行诊断时,测试性能出色,特别是在使用LR时。当使用多频金标准预测听觉状态时,LR导致相对操作特征曲线面积为0.95或0.96。可以选择LR的一个输出,其导致特异性为90%或更好。当损失超过40dB HL时,LR的相同输出导致测试敏感性接近100%。这些是获得的最佳测试结果。(摘要截断)