Yamana Masashi, Bise Ryoma, Endo Makoto, Matsunobu Tomoya, Setsu Nokitaka, Yokoyama Nobuhiko, Nakashima Yasuharu, Uchida Seiichi
Department of Orthopaedic Surgery, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan.
Department of Advanced Information Technology, Kyushu University, Fukuoka, Japan.
Bone Joint Res. 2025 Sep 2;14(9):760-768. doi: 10.1302/2046-3758.149.BJR-2024-0505.R1.
We aim to develop a fully automated deep-learning model to detect and classify benign/malignant bone tumours in full-field limb radiographs using an object detection model. The secondary aim is to identify differences in classification characteristics between the developed automated model, three orthopaedic oncologists, and three general orthopaedic surgeons.
This retrospective analysis included 642 limb bone tumours with 40 diagnoses confirmed pathologically from three institutions (378 benign, 264 malignant including intermediate types). We employed end-to-end object Detection with transformers with Improved deNoising anchOr boxes (DINO) and You Only Look Once (YOLO) models. We performed five-fold cross validation on the collected radiographs, using the training data to train the models, validation data to optimize the models' parameters, and independent test data for final performance evaluation. Firstly, we confirmed DINO achieves a higher detection rate than YOLO. Secondly, we compared the classification performance of DINO with those of doctors, using various metrics such as accuracy, sensitivity, specificity, precision, and F-measure.
The DINO model achieved a higher mean tumour detection rate (85.7% (95% CI 81.5 to 89.8)) than the YOLO model (80.1% (95% CI 77.2 to 82.9)). For the evaluation of classification performance, we used 113 cases that DINO detected out of 128 randomly selected cases as the evaluation test set. The accuracy and sensitivity of the DINO model, as a superior model, were significantly higher than those of general orthopaedic surgeons. The DINO model correctly classified 78.6% (22 out of 28 cases) of the challenging cases that two or more doctors misclassified. However, DINO's diagnostic errors primarily occurred with tumours that were diagnostically challenging for orthopaedic oncologists or present in unusual sites.
The DINO model automatically detects bone tumours better than the YOLO model, and may assist doctors in detecting tumours and classifying malignant/benign bone tumours in clinical practice.
我们旨在开发一种全自动深度学习模型,使用目标检测模型在全视野肢体X光片中检测和分类良性/恶性骨肿瘤。次要目的是确定所开发的自动化模型、三名骨肿瘤学家和三名普通骨科医生在分类特征上的差异。
这项回顾性分析纳入了642例肢体骨肿瘤,其中40例诊断经三家机构病理证实(378例良性,264例恶性,包括中间类型)。我们采用了带有改进去噪锚框的变压器端到端目标检测(DINO)和你只看一次(YOLO)模型。我们对收集到的X光片进行了五折交叉验证,使用训练数据训练模型,验证数据优化模型参数,并使用独立测试数据进行最终性能评估。首先,我们证实DINO的检测率高于YOLO。其次,我们使用准确率、灵敏度、特异性、精确率和F值等各种指标,将DINO的分类性能与医生的分类性能进行了比较。
DINO模型的平均肿瘤检测率(85.7%(95%CI 81.5至89.8))高于YOLO模型(80.1%(95%CI 77.2至82.9))。为了评估分类性能,我们将DINO在128例随机选择的病例中检测出的113例病例用作评估测试集。作为一个 superior 模型,DINO模型的准确率和灵敏度显著高于普通骨科医生。DINO模型正确分类了78.6%(共28例中的22例)两名或更多医生误分类的具有挑战性的病例。然而,DINO的诊断错误主要发生在对骨肿瘤学家来说诊断具有挑战性的肿瘤或位于不寻常部位的肿瘤上。
DINO模型在自动检测骨肿瘤方面比YOLO模型更好,并且在临床实践中可能有助于医生检测肿瘤并对恶性/良性骨肿瘤进行分类。