Division of Oral and Maxillofacial Radiology, Department of Oral Biology and Diagnostic Sciences, Faculty of Dentistry, Chiang Mai University, Suthep Road, Suthep, Mueang Chiang Mai District, Chiang Mai, 50200, Thailand.
Department of Radiologic Technology, Faculty of Associated Medical Sciences, Chiang Mai University, Suthep Road, Suthep, Mueang Chiang Mai District, Chiang Mai, 50200, Thailand.
J Digit Imaging. 2023 Dec;36(6):2635-2647. doi: 10.1007/s10278-023-00871-4. Epub 2023 Aug 28.
The study aimed to evaluate the impact of image size, area of detection (IoU) thresholds and confidence thresholds on the performance of the YOLO models in the detection of dental caries in bitewing radiographs. A total of 2575 bitewing radiographs were annotated with seven classes according to the ICCMS radiographic scoring system. YOLOv3 and YOLOv7 models were employed with different configurations, and their performances were evaluated based on precision, recall, F1-score and mean average precision (mAP). Results showed that YOLOv7 with 640 × 640 pixel images exhibited significantly superior performance compared to YOLOv3 in terms of precision (0.557 vs. 0.268), F1-score (0.555 vs. 0.375) and mAP (0.562 vs. 0.458), while the recall was significantly lower (0.552 vs. 0.697). The following experiment found that the overall mAPs did not significantly differ between 640 × 640 pixel and 1280 × 1280 pixel images, for YOLOv7 with an IoU of 50% and a confidence threshold of 0.001 (p = 0.866). The last experiment revealed that the precision significantly increased from 0.570 to 0.593 for YOLOv7 with an IoU of 75% and a confidence threshold of 0.5, but the mean-recall significantly decreased and led to lower mAPs in both IoUs. In conclusion, YOLOv7 outperformed YOLOv3 in caries detection and increasing the image size did not enhance the model's performance. Elevating the IoU from 50% to 75% and confidence threshold from 0.001 to 0.5 led to a reduction of the model's performance, while simultaneously improving precision and reducing recall (minimizing false positives and negatives) for carious lesion detection in bitewing radiographs.
本研究旨在评估图像大小、检测面积(IoU)阈值和置信度阈值对 YOLO 模型在口内片龋齿检测中的性能的影响。根据 ICCMS 射线照相评分系统,对总共 2575 张口内片进行了七种类别的标注。使用不同配置的 YOLOv3 和 YOLOv7 模型,并根据精度、召回率、F1 分数和平均精度(mAP)来评估它们的性能。结果表明,与 YOLOv3 相比,使用 640×640 像素图像的 YOLOv7 在精度(0.557 对 0.268)、F1 分数(0.555 对 0.375)和 mAP(0.562 对 0.458)方面表现出显著优势,而召回率则明显较低(0.552 对 0.697)。后续实验发现,对于 IoU 为 50%且置信度阈值为 0.001 的 YOLOv7,在 640×640 像素和 1280×1280 像素图像之间,整体 mAP 没有显著差异(p=0.866)。最后一个实验表明,对于 IoU 为 75%且置信度阈值为 0.5 的 YOLOv7,精度从 0.570 显著增加到 0.593,但平均召回率显著下降,导致两种 IoU 下的 mAP 降低。总之,YOLOv7 在龋齿检测中优于 YOLOv3,增加图像大小并没有提高模型的性能。将 IoU 从 50%提高到 75%,置信度阈值从 0.001 提高到 0.5,会降低模型的性能,同时提高精度,降低口内片龋齿检测的召回率(最小化假阳性和假阴性)。