Department of Radiology, Seoul St. Mary's Hospital, The Catholic University of Korea College of Medicine, Seoul, Korea.
Ultrasonography. 2014 Apr;33(2):91-7. doi: 10.14366/usg.13021. Epub 2014 Feb 26.
To evaluate intra- and interobserver reliability of the gray scale/dynamic range of the phantom image evaluation of ultrasonography using a standardized phantom, and to assess the effect of interactive education on the reliability.
Three radiologists (a resident, and two board-certified radiologists with 2 and 7 years of experience in evaluating ultrasound phantom images) performed the gray scale/dynamic range test for an ultrasound machine using a standardized phantom. They scored the number of visible cylindrical structures of varying degrees of brightness and made a 'pass or fail' decision. First, they scored 49 phantom images twice from a 2010 survey with limited knowledge of phantom images. After this, the radiologists underwent two hours of interactive education for the phantom images and scored another 91 phantom images from a 2011 survey twice. Intra- and interobserver reliability before and after the interactive education session were analyzed using K analyses.
Before education, the K-value for intraobserver reliability for the radiologist with 7 years of experience, 2 years of experience, and the resident was 0.386, 0.469, and 0.465, respectively. After education, the K-values were improved (0.823, 0.611, and 0.711, respectively). For interobserver reliability, the K-value was also better after the education for the 3 participants (0.067, 0.002, and 0.547 before education; 0.635, 0.667, and 0.616 after education, respectively).
The intra- and interobserver reliability of the gray scale/dynamic range was fair to substantial. Interactive education can improve reliability. For more reliable results, double- checking of phantom images by multiple reviewers is recommended.
使用标准化的体模评估超声灰阶/动态范围的体模图像,评价其观察者内和观察者间的可靠性,并评估交互式教育对可靠性的影响。
3 名放射科医师(一名住院医师和两名具有 2 年和 7 年超声体模图像评估经验的认证放射科医师)使用标准化体模对超声机的灰阶/动态范围进行测试。他们对不同亮度的可见圆柱结构的数量进行评分,并做出“通过或失败”的判断。首先,他们在对体模图像的知识有限的情况下,对 2010 年的 49 个体模图像进行了两次评分。之后,放射科医师接受了两小时的体模图像交互式教育,并在 2011 年的调查中对另外 91 个体模图像进行了两次评分。使用 K 分析对交互式教育前后的观察者内和观察者间可靠性进行分析。
在教育之前,具有 7 年经验、2 年经验和住院医师的观察者内可靠性的 K 值分别为 0.386、0.469 和 0.465。在教育之后,K 值有所提高(分别为 0.823、0.611 和 0.711)。对于观察者间可靠性,3 名参与者在接受教育后 K 值也更好(教育前分别为 0.067、0.002 和 0.547;教育后分别为 0.635、0.667 和 0.616)。
灰阶/动态范围的观察者内和观察者间的可靠性为一般到较好。交互式教育可以提高可靠性。为了获得更可靠的结果,建议由多名审阅者对体模图像进行双次检查。