Department of Radiology, The First Affiliated Hospital of Xi'an Jiaotong University, Yanta West Road No. 277, Xi'an, 710061, China.
The Key Laboratory of Biomedical Information Engineering, Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, 710054, China.
BMC Med Imaging. 2023 Jan 30;23(1):18. doi: 10.1186/s12880-023-00975-x.
Chest radiography is the standard investigation for identifying rib fractures. The application of artificial intelligence (AI) for detecting rib fractures on chest radiographs is limited by image quality control and multilesion screening. To our knowledge, few studies have developed and verified the performance of an AI model for detecting rib fractures by using multi-center radiographs. And existing studies using chest radiographs for multiple rib fracture detection have used more complex and slower detection algorithms, so we aimed to create a multiple rib fracture detection model by using a convolutional neural network (CNN), based on multi-center and quality-normalised chest radiographs.
A total of 1080 radiographs with rib fractures were obtained and randomly divided into the training set (918 radiographs, 85%) and the testing set (162 radiographs, 15%). An object detection CNN, You Only Look Once v3 (YOLOv3), was adopted to build the detection model. Receiver operating characteristic (ROC) and free-response ROC (FROC) were used to evaluate the model's performance. A joint testing group of 162 radiographs with rib fractures and 233 radiographs without rib fractures was used as the internal testing set. Furthermore, an additional 201 radiographs, 121 with rib fractures and 80 without rib fractures, were independently validated to compare the CNN model performance with the diagnostic efficiency of radiologists.
The sensitivity of the model in the training and testing sets was 92.0% and 91.1%, respectively, and the precision was 68.0% and 81.6%, respectively. FROC in the testing set showed that the sensitivity for whole-lesion detection reached 91.3% when the false-positive of each case was 0.56. In the joint testing group, the case-level accuracy, sensitivity, specificity, and area under the curve were 85.1%, 93.2%, 79.4%, and 0.92, respectively. At the fracture level and the case level in the independent validation set, the accuracy and sensitivity of the CNN model were always higher or close to radiologists' readings.
The CNN model, based on YOLOv3, was sensitive for detecting rib fractures on chest radiographs and showed great potential in the preliminary screening of rib fractures, which indicated that CNN can help reduce missed diagnoses and relieve radiologists' workload. In this study, we developed and verified the performance of a novel CNN model for rib fracture detection by using radiography.
胸部 X 射线摄影是识别肋骨骨折的标准检查方法。人工智能(AI)在胸部 X 射线照片上检测肋骨骨折的应用受到图像质量控制和多部位筛查的限制。据我们所知,很少有研究开发和验证基于多中心射线照片的 AI 模型检测肋骨骨折的性能。并且,现有的使用胸部 X 射线照片进行多发性肋骨骨折检测的研究使用了更复杂和更慢的检测算法,因此,我们旨在基于多中心和质量标准化的胸部射线照片,使用卷积神经网络(CNN)创建一种多发性肋骨骨折检测模型。
共获得 1080 张带有肋骨骨折的射线照片,并将其随机分为训练集(918 张射线照片,占 85%)和测试集(162 张射线照片,占 15%)。采用目标检测 CNN(YOLOv3)构建检测模型。使用接收者操作特征(ROC)和自由响应 ROC(FROC)来评估模型性能。使用 162 张带有肋骨骨折的射线照片和 233 张没有肋骨骨折的射线照片的联合测试组作为内部测试集。此外,还独立验证了另外 201 张射线照片,其中 121 张带有肋骨骨折,80 张没有肋骨骨折,以比较 CNN 模型性能与放射科医生的诊断效率。
在训练集和测试集中,该模型的灵敏度分别为 92.0%和 91.1%,精确度分别为 68.0%和 81.6%。在测试集中的 FROC 显示,当每个病例的假阳性为 0.56 时,整体病变检测的灵敏度达到 91.3%。在联合测试组中,病例级别的准确性、灵敏度、特异性和曲线下面积分别为 85.1%、93.2%、79.4%和 0.92。在独立验证集的骨折级别和病例级别上,CNN 模型的准确性和灵敏度始终较高或接近放射科医生的读数。
基于 YOLOv3 的 CNN 模型对胸部 X 射线照片上的肋骨骨折检测具有较高的灵敏度,在肋骨骨折的初步筛查中具有很大的潜力,这表明 CNN 可以帮助减少漏诊并减轻放射科医生的工作量。在这项研究中,我们开发并验证了一种新型 CNN 模型用于肋骨骨折检测的性能,该模型基于射线照片。