自动检测、分类和定位踝关节骨折的开发和外部验证：卷积神经网络 (CNN) 的黑盒内。

Development and external validation of automated detection, classification, and localization of ankle fractures: inside the black box of a convolutional neural network (CNN).

机构信息

Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands.

Department of Surgery, Groningen University Medical Centre, Groningen, The Netherlands.

出版信息

Eur J Trauma Emerg Surg. 2023 Apr;49(2):1057-1069. doi: 10.1007/s00068-022-02136-1. Epub 2022 Nov 14.

DOI:10.1007/s00068-022-02136-1

PMID:36374292

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10175446/

Abstract

PURPOSE

Convolutional neural networks (CNNs) are increasingly being developed for automated fracture detection in orthopaedic trauma surgery. Studies to date, however, are limited to providing classification based on the entire image-and only produce heatmaps for approximate fracture localization instead of delineating exact fracture morphology. Therefore, we aimed to answer (1) what is the performance of a CNN that detects, classifies, localizes, and segments an ankle fracture, and (2) would this be externally valid?

METHODS

The training set included 326 isolated fibula fractures and 423 non-fracture radiographs. The Detectron2 implementation of the Mask R-CNN was trained with labelled and annotated radiographs. The internal validation (or 'test set') and external validation sets consisted of 300 and 334 radiographs, respectively. Consensus agreement between three experienced fellowship-trained trauma surgeons was defined as the ground truth label. Diagnostic accuracy and area under the receiver operator characteristic curve (AUC) were used to assess classification performance. The Intersection over Union (IoU) was used to quantify accuracy of the segmentation predictions by the CNN, where a value of 0.5 is generally considered an adequate segmentation.

RESULTS

The final CNN was able to classify fibula fractures according to four classes (Danis-Weber A, B, C and No Fracture) with AUC values ranging from 0.93 to 0.99. Diagnostic accuracy was 89% on the test set with average sensitivity of 89% and specificity of 96%. External validity was 89-90% accurate on a set of radiographs from a different hospital. Accuracies/AUCs observed were 100/0.99 for the 'No Fracture' class, 92/0.99 for 'Weber B', 88/0.93 for 'Weber C', and 76/0.97 for 'Weber A'. For the fracture bounding box prediction by the CNN, a mean IoU of 0.65 (SD ± 0.16) was observed. The fracture segmentation predictions by the CNN resulted in a mean IoU of 0.47 (SD ± 0.17).

CONCLUSIONS

This study presents a look into the 'black box' of CNNs and represents the first automated delineation (segmentation) of fracture lines on (ankle) radiographs. The AUC values presented in this paper indicate good discriminatory capability of the CNN and substantiate further study of CNNs in detecting and classifying ankle fractures.

LEVEL OF EVIDENCE

II, Diagnostic imaging study.

摘要

目的

卷积神经网络（CNN）越来越多地被开发用于骨科创伤手术中的自动骨折检测。然而，迄今为止的研究仅限于基于整个图像进行分类，并且仅生成近似骨折定位的热图，而不是描绘确切的骨折形态。因此，我们旨在回答（1）检测、分类、定位和分割踝关节骨折的 CNN 的性能如何，以及（2）它是否具有外部有效性？

方法

训练集包括 326 例孤立性腓骨骨折和 423 例非骨折 X 线片。使用标记和注释的 X 线片训练了 Detectron2 实现的 Mask R-CNN。内部验证（或“测试集”）和外部验证集分别包括 300 张和 334 张 X 线片。三位经验丰富的 fellowship 培训创伤外科医生之间的共识协议被定义为地面真实标签。诊断准确性和接收器操作特征曲线下的面积（AUC）用于评估分类性能。CNN 的分割预测的交并比（IoU）用于量化准确性，其中值为 0.5 通常被认为是足够的分割。

结果

最终的 CNN 能够根据四个类别（Danis-Weber A、B、C 和无骨折）对腓骨骨折进行分类，AUC 值范围为 0.93 至 0.99。在测试集上的诊断准确性为 89%，平均敏感性为 89%，特异性为 96%。在来自另一家医院的一组 X 线片上，外部有效性的准确率为 89-90%。对于“无骨折”类别，准确性/AUC 为 100/0.99，对于“Weber B”为 92/0.99，对于“ Weber C”为 88/0.93，对于“ Weber A”为 76/0.97。对于 CNN 预测的骨折边界框，观察到平均 IoU 为 0.65（SD ± 0.16）。CNN 预测的骨折分割导致平均 IoU 为 0.47（SD ± 0.17）。

结论

本研究深入探讨了 CNN 的“黑箱”，并代表了首次对（踝关节）X 线片上的骨折线进行自动描绘（分割）。本文提出的 AUC 值表明 CNN 具有良好的区分能力，并证实了 CNN 在检测和分类踝关节骨折方面的进一步研究。

证据水平

II，诊断影像学研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6db/10175446/c9b37a2bb284/68_2022_2136_Fig1_HTML.jpg

相似文献

Development and external validation of automated detection, classification, and localization of ankle fractures: inside the black box of a convolutional neural network (CNN).

Eur J Trauma Emerg Surg. 2023 Apr;49(2):1057-1069. doi: 10.1007/s00068-022-02136-1. Epub 2022 Nov 14.

Convolutional neural network for detecting rib fractures on chest radiographs: a feasibility study.

BMC Med Imaging. 2023 Jan 30;23(1):18. doi: 10.1186/s12880-023-00975-x.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Is Deep Learning On Par with Human Observers for Detection of Radiographically Visible and Occult Fractures of the Scaphoid?

Clin Orthop Relat Res. 2020 Nov;478(11):2653-2659. doi: 10.1097/CORR.0000000000001318.

External validation of an artificial intelligence multi-label deep learning model capable of ankle fracture classification.

BMC Musculoskelet Disord. 2024 Oct 4;25(1):788. doi: 10.1186/s12891-024-07884-2.

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics : are these externally validated and ready for clinical application?

Bone Jt Open. 2021 Oct;2(10):879-885. doi: 10.1302/2633-1462.210.BJO-2021-0133.

Detection of ankle fractures using deep learning algorithms.

Foot Ankle Surg. 2022 Dec;28(8):1259-1265. doi: 10.1016/j.fas.2022.05.005. Epub 2022 May 26.

CheXLocNet: Automatic localization of pneumothorax in chest radiographs using deep convolutional neural networks.

PLoS One. 2020 Nov 9;15(11):e0242013. doi: 10.1371/journal.pone.0242013. eCollection 2020.

Development and Validation of a Convolutional Neural Network for Automated Detection of Scaphoid Fractures on Conventional Radiographs.

Radiol Artif Intell. 2021 Apr 28;3(4):e200260. doi: 10.1148/ryai.2021200260. eCollection 2021 Jul.

Feasibility of a generalized convolutional neural network for automated identification of vertebral compression fractures: The Manitoba Bone Mineral Density Registry.

Bone. 2021 Sep;150:116017. doi: 10.1016/j.bone.2021.116017. Epub 2021 May 19.

引用本文的文献

Cross-validation of an artificial intelligence tool for fracture classification and localization on conventional radiography in Dutch population.

Insights Imaging. 2025 Jul 3;16(1):150. doi: 10.1186/s13244-025-02034-1.

Automated radiography assessment of ankle joint instability using deep learning.

Sci Rep. 2025 Apr 29;15(1):15012. doi: 10.1038/s41598-025-99620-6.

Faster R-CNN model for target recognition and diagnosis of scapular fractures.

J Bone Oncol. 2025 Feb 19;51:100664. doi: 10.1016/j.jbo.2025.100664. eCollection 2025 Apr.

Artificial Intelligence in Surgery: A Systematic Review of Use and Validation.

J Clin Med. 2024 Nov 24;13(23):7108. doi: 10.3390/jcm13237108.

Application and Prospects of Deep Learning Technology in Fracture Diagnosis.

Curr Med Sci. 2024 Dec;44(6):1132-1140. doi: 10.1007/s11596-024-2928-5. Epub 2024 Nov 18.

Artificial intelligence in fracture detection on radiographs: a literature review.

Jpn J Radiol. 2025 Apr;43(4):551-585. doi: 10.1007/s11604-024-01702-4. Epub 2024 Nov 14.

External validation of an artificial intelligence multi-label deep learning model capable of ankle fracture classification.

BMC Musculoskelet Disord. 2024 Oct 4;25(1):788. doi: 10.1186/s12891-024-07884-2.

A Review on the Use of Artificial Intelligence in Fracture Detection.

Cureus. 2024 Apr 16;16(4):e58364. doi: 10.7759/cureus.58364. eCollection 2024 Apr.

Harnessing ResNet50 and SENet for enhanced ankle fracture identification.

BMC Musculoskelet Disord. 2024 Apr 1;25(1):250. doi: 10.1186/s12891-024-07355-8.

Detection and localization of caries and hypomineralization on dental photographs with a vision transformer model.

NPJ Digit Med. 2023 Oct 25;6(1):198. doi: 10.1038/s41746-023-00944-2.

本文引用的文献

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics : are these externally validated and ready for clinical application?

Bone Jt Open. 2021 Oct;2(10):879-885. doi: 10.1302/2633-1462.210.BJO-2021-0133.

Presenting artificial intelligence, deep learning, and machine learning studies to clinicians and healthcare stakeholders: an introductory reference with a guideline and a Clinical AI Research (CAIR) checklist proposal.

Acta Orthop. 2021 Oct;92(5):513-525. doi: 10.1080/17453674.2021.1918389. Epub 2021 May 14.

Automatic opportunistic osteoporosis screening in routine CT: improved prediction of patients with prevalent vertebral fractures compared to DXA.

Eur Radiol. 2021 Aug;31(8):6069-6077. doi: 10.1007/s00330-020-07655-2. Epub 2021 Jan 28.

Deep learning to distinguish pancreatic cancer tissue from non-cancerous pancreatic tissue: a retrospective study with cross-racial external validation.

Lancet Digit Health. 2020 Jun;2(6):e303-e313. doi: 10.1016/S2589-7500(20)30078-9.

Artificial intelligence in orthopaedics: false hope or not? A narrative review along the line of Gartner's hype cycle.

EFORT Open Rev. 2020 Oct 26;5(10):593-603. doi: 10.1302/2058-5241.5.190092. eCollection 2020 Oct.

Deep-learning-assisted detection and segmentation of rib fractures from CT scans: Development and validation of FracNet.

EBioMedicine. 2020 Dec;62:103106. doi: 10.1016/j.ebiom.2020.103106. Epub 2020 Nov 10.

Ankle fracture classification using deep learning: automating detailed AO Foundation/Orthopedic Trauma Association (AO/OTA) 2018 malleolar fracture identification reaches a high degree of correct classification.

Acta Orthop. 2021 Feb;92(1):102-108. doi: 10.1080/17453674.2020.1837420. Epub 2020 Oct 26.

Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension.

Nat Med. 2020 Sep;26(9):1351-1363. doi: 10.1038/s41591-020-1037-7. Epub 2020 Sep 9.

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension.

Nat Med. 2020 Sep;26(9):1364-1374. doi: 10.1038/s41591-020-1034-x. Epub 2020 Sep 9.

Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist.

Nat Med. 2020 Sep;26(9):1320-1324. doi: 10.1038/s41591-020-1041-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自动检测、分类和定位踝关节骨折的开发和外部验证：卷积神经网络 (CNN) 的黑盒内。

Development and external validation of automated detection, classification, and localization of ankle fractures: inside the black box of a convolutional neural network (CNN).

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

LEVEL OF EVIDENCE

目的

方法

结果

结论

证据水平

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献