Burgess A E, Jacobson F L, Judy P F
Radiology Department, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA.
Med Phys. 2001 Apr;28(4):419-37. doi: 10.1118/1.1355308.
We determined contrast thresholds for lesion detection as a function of lesion size in both mammograms and filtered noise backgrounds with the same average power spectrum, P(f)=B/f3. Experiments were done using hybrid images with digital images of tumors added to digitized normal backgrounds, displayed on a monochrome monitor. Four tumors were extracted from digitized specimen radiographs. The lesion sizes were varied by digital rescaling to cover the range from 0.5 to 16 mm. Amplitudes were varied to determine the value required for 92% correct detection in two-alternative forced-choice (2AFC) and 90% for search experiments. Three observers participated, two physicists and a radiologist. The 2AFC mammographic results demonstrated a novel contrast-detail (CD) diagram with threshold amplitudes that increased steadily (with slope of 0.3) with increasing size for lesions larger than 1 mm. The slopes for prewhitening model observers were about 0.4. Human efficiency relative to these models was as high as 90%. The CD diagram slopes for the 2AFC experiments with filtered noise were 0.44 for humans and 0.5 for models. Human efficiency relative to the ideal observer was about 40%. The difference in efficiencies for the two types of backgrounds indicates that breast structure cannot be considered to be pure random noise for 2AFC experiments. Instead, 2AFC human detection with mammographic backgrounds is limited by a combination of noise and deterministic masking effects. The search experiments also gave thresholds that increased with lesion size. However, there was no difference in human results for mammographic and filtered noise backgrounds, suggesting that breast structure can be considered to be pure random noise for this task. Our conclusion is that, in spite of the fact that mammographic backgrounds have nonstationary statistics, models based on statistical decision theory can still be applied successfully to estimate human performance.
我们在具有相同平均功率谱(P(f)=B/f^3)的乳腺造影片和滤波噪声背景中,确定了作为病变大小函数的病变检测对比度阈值。实验使用了混合图像,即将肿瘤的数字图像添加到数字化的正常背景上,并显示在单色显示器上。从数字化的标本射线照片中提取了四个肿瘤。通过数字缩放改变病变大小,范围从(0.5)到(16)毫米。改变振幅以确定在二择一强制选择(2AFC)中92%正确检测所需的值以及在搜索实验中90%正确检测所需的值。三名观察者参与了实验,两名物理学家和一名放射科医生。2AFC乳腺造影结果显示了一种新颖的对比度-细节(CD)图,对于大于1毫米的病变,阈值振幅随着大小的增加而稳步增加(斜率为0.3)。预白化模型观察者的斜率约为0.4。相对于这些模型,人类效率高达90%。在滤波噪声的2AFC实验中,人类的CD图斜率为0.44,模型为0.5。相对于理想观察者,人类效率约为40%。两种背景下效率的差异表明,在2AFC实验中,乳腺结构不能被视为纯随机噪声。相反,乳腺造影背景下的2AFC人类检测受到噪声和确定性掩蔽效应的综合限制。搜索实验也给出了随病变大小增加的阈值。然而,乳腺造影和滤波噪声背景下的人类结果没有差异,这表明对于这项任务,乳腺结构可以被视为纯随机噪声。我们的结论是,尽管乳腺造影背景具有非平稳统计特性,但基于统计决策理论的模型仍然可以成功应用于估计人类性能。