Physical Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada.
Department of Radiology, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA.
Med Phys. 2023 Dec;50(12):7441-7461. doi: 10.1002/mp.16786. Epub 2023 Oct 13.
The Tomosynthesis Mammography Imaging Screening Trial (TMIST), EA1151 conducted by the Eastern Cooperative Oncology Group (ECOG)/American College of Radiology Imaging Network (ACRIN) is a randomized clinical trial designed to assess the effectiveness for breast cancer screening of digital breast tomosynthesis (TM) compared to digital mammography (DM). Equipment from multiple vendors is being used in the study.
For the findings of the study to be valid and capture the true capacities of the two technology types, it is important that all equipment is operated within appropriate parameters with regard to image quality and dose. A harmonized QC program was established by a core physics team. Since there are over 120 trial sites, a centralized, automated QC program was chosen as the most practical design. This report presents results of the weekly QC testing program. A companion paper will review quality monitoring based on data from the headers of the patient images.
Study images are collected centrally after de-identification using the "TRIAD" application developed by ACR. The core physics team devised and implemented a minimal set of quality control (QC) tests to evaluate the tomosynthesis and 2D mammography systems. Weekly, monthly and annual testing is performed by the site mammography technologists with images submitted directly to the physics core. The weekly physics QC tests are described: SDNR of a low-contrast mass object, artifact spread, spatial resolution, tracking of technical factors, and in-slice noise power spectra.
As of December 31, 2022 (5 years), 145 sites with 411 machines had submitted QC data. A total of 136 742 TMIST participant screening imaging studies had been performed. The 5th and 95th percentile mean glandular doses for a single tomosynthesis exposure to a 4.0 cm thick PMMA phantom ("standard breast phantom") were 1.24 and 1.68 mGy respectively. The largest sources of QC non-conformance were: operator error, not following the QC protocol exactly, unreported software updates and preventive maintenance activities that affected QC setpoints. Noise power spectra were measured, however, standardization of performance targets across machine types and software revisions was difficult. Nevertheless, for each machine type, test measurement results were very consistent when the protocol was followed. Deviations in test results were mostly related to software and hardware changes.
Most systems performed very consistently. Although this is a harmonized program using identical phantoms and testing protocols, it is not appropriate to apply universal threshold or target metrics across the machine types because the systems have different non-linear reconstruction algorithms and image display filters. It was found to be more useful to assess pass/fail criteria in terms of relative deviations from baseline values established when a system is first characterized and after equipment is changed. Generally, systems which needed repair failed suddenly, but in retrospect, for a few cases, drops in SDNR and increases in mAs were observed prior to tube failure. TMIST is registered as NCT03233191 by Clinicaltrials.gov.
东部合作肿瘤学组(ECOG)/美国放射学院成像网络(ACRIN)进行的断层合成乳腺成像筛查试验(TMIST),EA1151,是一项旨在评估数字乳腺断层合成术(TM)与数字乳腺摄影术(DM)相比用于乳腺癌筛查的有效性的随机临床试验。研究中使用了来自多个供应商的设备。
为了使研究结果有效并捕捉两种技术类型的真实能力,重要的是所有设备的图像质量和剂量参数都在适当的范围内运行。一个核心物理团队建立了一个统一的 QC 计划。由于有超过 120 个试验站点,因此选择了集中式自动化 QC 计划作为最实用的设计。本报告介绍了每周 QC 测试计划的结果。另一篇论文将回顾基于患者图像头文件数据的质量监测。
研究图像使用 ACR 开发的“TRIAD”应用程序进行去识别后集中收集。核心物理团队设计并实施了一组最小的质量控制(QC)测试,以评估断层合成术和二维乳腺摄影系统。由站点乳腺摄影技术人员每周、每月和每年进行测试,并直接向物理核心提交图像。描述了每周的物理 QC 测试:低对比度肿块物体的 SDNR、伪影扩散、空间分辨率、技术因素的跟踪以及切片内噪声功率谱。
截至 2022 年 12 月 31 日(5 年),已有 145 个站点的 411 台设备提交了 QC 数据。共有 136,742 名 TMIST 参与者进行了筛查成像研究。单次断层合成术对 4.0cm 厚 PMMA 体模(“标准乳腺体模”)的平均腺体剂量的第 5 个和第 95 个百分位数分别为 1.24 和 1.68 mGy。QC 不符合的最大来源是:操作人员错误、没有完全按照 QC 协议操作、未报告软件更新以及影响 QC 设定值的预防性维护活动。然而,已经测量了噪声功率谱,但跨机器类型和软件版本标准化性能目标是困难的。尽管如此,当遵循协议时,每种机器类型的测试测量结果都非常一致。测试结果的偏差主要与软件和硬件变化有关。
大多数系统表现非常一致。尽管这是一个使用相同的体模和测试协议的统一计划,但由于系统具有不同的非线性重建算法和图像显示滤波器,不适合在机器类型之间应用通用的阈值或目标指标。评估系统首次特征化和设备更换后的基线值的相对偏差的通过/失败标准更为有用。一般来说,需要维修的系统会突然出现故障,但回想起来,在少数情况下,在管失效之前,SDNR 下降和 mAs 增加。TMIST 在 Clinicaltrials.gov 上注册为 NCT03233191。