Kavuri Amar, Das Mini
University of Houston, Department of Biomedical Engineering, Houston, Texas, United States.
University of Houston, Department of Physics, Houston, Texas, United States.
J Med Imaging (Bellingham). 2025 Jan;12(1):015501. doi: 10.1117/1.JMI.12.1.015501. Epub 2024 Dec 31.
Digital phantoms are one of the key components of virtual imaging trials (VITs) that aim to assess and optimize new medical imaging systems and algorithms. However, these phantoms vary in their voxel resolution, appearance, and structural details. We investigate whether and how variations between digital phantoms influence system optimization with digital breast tomosynthesis (DBT) as a chosen modality.
We selected widely used and open-access digital breast phantoms created with different methods and generated an ensemble of DBT images to test acquisition strategies. Human observer performance was evaluated using localization receiver operating characteristic (LROC) studies for each phantom type. Noise power spectrum and gaze metrics were also employed to compare phantoms and generated images.
Our LROC results show that the arc samplings for peak performance were and 6 deg in Bakic and XCAT breast phantoms, respectively, for the 3-mm lesion detection task and indicate that system optimization outcomes from VITs can vary with phantom types and structural frequency components. In addition, a significant correlation ( ) between gaze metrics and diagnostic performance suggests that gaze analysis can be used to understand and evaluate task difficulty in VITs.
Our results point to the critical need to evaluate realism in digital phantoms and ensure sufficient structural variations at spatial frequencies relevant to the intended task. Standardizing phantom generation and validation tools may help reduce discrepancies among independently conducted VITs for system or algorithmic optimizations.
数字体模是虚拟成像试验(VITs)的关键组成部分之一,旨在评估和优化新的医学成像系统及算法。然而,这些体模在体素分辨率、外观和结构细节方面存在差异。我们以数字乳腺断层合成(DBT)作为选定的模态,研究数字体模之间的差异是否以及如何影响系统优化。
我们选择了用不同方法创建的广泛使用的开放获取数字乳腺体模,并生成了一组DBT图像以测试采集策略。使用定位接收器操作特性(LROC)研究对每种体模类型评估人类观察者的表现。还采用噪声功率谱和注视指标来比较体模和生成的图像。
我们的LROC结果表明,对于3毫米病变检测任务,Bakic和XCAT乳腺体模中达到最佳性能的弧采样分别为和6度,这表明VITs的系统优化结果可能因体模类型和结构频率成分而异。此外,注视指标与诊断性能之间存在显著相关性( ),这表明注视分析可用于理解和评估VITs中的任务难度。
我们的结果表明迫切需要评估数字体模的逼真度,并确保在与预期任务相关的空间频率上有足够的结构变化。标准化体模生成和验证工具可能有助于减少独立进行的用于系统或算法优化的VITs之间的差异。