Lee Meesun, Choi Young-Hun, Lee Seul-Bi, Choi Jae-Won, Lee Seunghyun, Hwang Jae-Yeon, Cheon Jung-Eun, Hong SungHyuk, Kim Jeonghoon, Cho Yeon-Jin
Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea.
Department of Radiology, Seoul National University College of Medicine, Jongno-gu, Seoul 03080, Republic of Korea.
Diagnostics (Basel). 2025 Apr 14;15(8):993. doi: 10.3390/diagnostics15080993.
: To develop an automated deep learning-based bone age prediction model using the Tanner-Whitehouse (TW3) method and evaluate its feasibility by comparing its performance with that of pediatric radiologists. : The hand and wrist radiographs of 560 Korean children and adolescents (280 female, 280 male, mean age 9.43 ± 2.92 years) were evaluated using the TW3-based model and three pediatric radiologists. Images with bony destruction, congenital anomalies, or non-diagnostic quality were excluded. A commercialized AI solution built upon the Rotated Single Shot MultiBox Detector (SSD) and EfficientNet-B0 was used. Bone age measurements from the model and radiologists were compared using the paired -tests. Linear regression analysis was performed and the coefficient of determination (r²), mean absolute error (MAE), and root mean square error (RMSE) were measured. A Bland-Altman analysis was conducted and the proportion of bone age predictions within 0.6 years of the radiologists' assessments was calculated. : The TW3-based model demonstrated no significant differences between bone age measurements and radiologists, except for participants <6 and >13 years old (overall, = 0.874; 6-8 years, = 0.737; 8-9 years, = 0.093; 9-10 years, = 0.301; 10-11 years, = 0.584; 11-13 years, = 0.976; <6 or >13 years, < 0.001). There was a strong linear correlation between the model prediction and radiologist assessments (r = 0.977). The RMSE and MAE values of the model were 0.529 (95% CI, 0.482-0.575) and 0.388 (95% CI, 0.361-0.417) years. Overall, 82.3% of bone age model predictions were within 0.6 years of the radiologists' interpretation. : Automated deep learning-based bone age assessment has the potential to reduce radiologists' workload and provide standardized measurements for clinical decision making.
使用坦纳 - 怀特豪斯(TW3)方法开发一种基于深度学习的自动化骨龄预测模型,并通过将其性能与儿科放射科医生的性能进行比较来评估其可行性。
对560名韩国儿童和青少年(280名女性,280名男性,平均年龄9.43±2.92岁)的手部和腕部X光片使用基于TW3的模型和三名儿科放射科医生进行评估。排除有骨质破坏、先天性异常或诊断质量不佳的图像。使用基于旋转单阶段多框检测器(SSD)和高效网络B0构建的商业化人工智能解决方案。使用配对t检验比较模型和放射科医生的骨龄测量结果。进行线性回归分析并测量决定系数(r²)、平均绝对误差(MAE)和均方根误差(RMSE)。进行布兰德 - 奥特曼分析并计算骨龄预测在放射科医生评估的0.6年内的比例。
基于TW3的模型在骨龄测量和放射科医生之间没有显示出显著差异,除了年龄小于6岁和大于13岁的参与者(总体,r = 0.874;6 - 8岁,r = 0.737;8 - 9岁,r = 0.093;9 - 10岁,r = 0.301;10 - 11岁,r = 0.584;11 - 13岁,r = 0.976;小于6岁或大于13岁,p < 0.001)。模型预测和放射科医生评估之间存在很强的线性相关性(r = 0.977)。模型的RMSE和MAE值分别为0.529(95%置信区间,0.482 - 0.575)和0.388(95%置信区间,0.361 - 0.417)岁。总体而言,82.3%的骨龄模型预测在放射科医生解释的0.6年内。
基于深度学习的自动化骨龄评估有可能减轻放射科医生的工作量,并为临床决策提供标准化测量。