Clinical Imaging Research Centre, Yong Loo Lin School of Medicine, National University of Singapore, 14 Medical Drive, #B1-01, Singapore, 117599, Singapore.
Department of Diagnostic Imaging, National University Hospital, Singapore, Singapore.
Jpn J Radiol. 2022 Dec;40(12):1290-1299. doi: 10.1007/s11604-022-01311-z. Epub 2022 Jul 9.
To compare the performances of machine learning (ML) and deep learning (DL) in improving the quality of low dose (LD) lung cancer PET images and the minimum counts required.
33 standard dose (SD) PET images, were used to simulate LD PET images at seven-count levels of 0.25, 0.5, 1, 2, 5, 7.5 and 10 million (M) counts. Image quality transfer (IQT), a ML algorithm that uses decision tree and patch-sampling was compared to two DL networks-HighResNet (HRN) and deep-boosted regression (DBR). Supervised training was performed by training the ML and DL algorithms with matched-pair SD and LD images. Image quality evaluation and clinical lesion detection tasks were performed by three readers. Bias in 53 radiomic features, including mean SUV, was evaluated for all lesions.
ML- and DL-estimated images showed higher signal and smaller error than LD images with optimal image quality recovery achieved using LD down to 5 M counts. True positive rate and false discovery rate were fairly stable beyond 5 M counts for the detection of small and large true lesions. Readers rated average or higher ratings to images estimated from LD images of count levels above 5 M only, with higher confidence in detecting true lesions.
LD images with a minimum of 5 M counts (8.72 MBq for 10 min scan or 25 MBq for 3 min scan) are required for optimal clinical use of ML and DL, with slightly better but more varied performance shown by DL.
比较机器学习(ML)和深度学习(DL)在提高低剂量(LD)肺癌 PET 图像质量和所需最小计数方面的性能。
使用 33 个标准剂量(SD)PET 图像,模拟了 7 个计数水平(0.25、0.5、1、2、5、7.5 和 1000 万[M])的 LD PET 图像。与两种 DL 网络(HighResNet[HRN]和深度增强回归[DBR])相比,使用决策树和补丁采样的 ML 算法“图像质量转移(IQT)”。通过用匹配的 SD 和 LD 图像对 ML 和 DL 算法进行监督训练,进行图像质量评估和临床病变检测任务。对所有病变进行了 53 个放射组学特征(包括平均 SUV)的偏倚评估。
ML 和 DL 估计图像的信号高于 LD 图像,误差较小,在使用低至 5 M 计数的情况下可以恢复最佳图像质量。在检测小和大真实病变时,真阳性率和假阳性率在 5 M 计数以上相当稳定。读者对计数水平高于 5 M 的 LD 图像的估计图像给予了平均或更高的评分,对检测真实病变的信心更高。
对于 ML 和 DL 的最佳临床应用,需要使用至少 5 M 计数(10 分钟扫描为 8.72 MBq,3 分钟扫描为 25 MBq)的 LD 图像,DL 表现稍好但变化更多。