Suppr超能文献

人工智能驱动的自动 X 射线骨龄分析仪在中国儿童和青少年中的验证:与 Tanner-Whitehouse 3 法的比较。

Validation of an AI-Powered Automated X-ray Bone Age Analyzer in Chinese Children and Adolescents: A Comparison with the Tanner-Whitehouse 3 Method.

机构信息

Department of Pediatrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China.

Hubei Key Laboratory of Pediatric Genetic Metabolic and Endocrine Rare Diseases, Wuhan, 430030, China.

出版信息

Adv Ther. 2024 Sep;41(9):3664-3677. doi: 10.1007/s12325-024-02944-4. Epub 2024 Jul 31.

Abstract

INTRODUCTION

Automated bone age assessment (BAA) is of growing interest because of its accuracy and time efficiency in daily practice. In this study, we validated the clinical applicability of a commercially available artificial intelligence (AI)-powered X-ray bone age analyzer equipped with a deep learning-based automated BAA system and compared its performance with that of the Tanner-Whitehouse 3 (TW-3) method.

METHODS

Radiographs prospectively collected from 30 centers across various regions in China, including 900 Chinese children and adolescents, were assessed independently by six doctors (three experts and three residents) and an AI analyzer for TW3 radius, ulna, and short bones (RUS) and TW3 carpal bone age. The experts' mean estimates were accepted as the gold standard. The performance of the AI analyzer was compared with that of each resident.

RESULTS

For the estimation of TW3-RUS, the AI analyzer had a mean absolute error (MAE) of 0.48 ± 0.42. The percentage of patients with an absolute error of < 1.0 years was 86.78%. The MAE was significantly lower than that of rater 1 (0.54 ± 0.49, P = 0.0068); however, it was not significant for rater 2 (0.48 ± 0.48) or rater 3 (0.49 ± 0.46). For TW3 carpal, the AI analyzer had an MAE of 0.48 ± 0.65. The percentage of patients with an absolute error of < 1.0 years was 88.78%. The MAE was significantly lower than that of rater 2 (0.58 ± 0.67, P = 0.0018) and numerically lower for rater 1 (0.54 ± 0.64) and rater 3 (0.50 ± 0.53). These results were consistent for the subgroups according to sex, and differences between the age groups were observed.

CONCLUSION

In this comprehensive validation study conducted in China, an AI-powered X-ray bone age analyzer showed accuracies that matched or exceeded those of doctor raters. This method may improve the efficiency of clinical routines by reducing reading time without compromising accuracy.

摘要

简介

自动化骨龄评估(BAA)因其在日常实践中的准确性和效率而受到越来越多的关注。本研究旨在验证一种商用人工智能(AI)驱动的 X 射线骨龄分析仪的临床适用性,该分析仪配备了基于深度学习的自动 BAA 系统,并将其性能与 Tanner-Whitehouse 3(TW-3)方法进行比较。

方法

从中国各地 30 个中心前瞻性收集的包括 900 名中国儿童和青少年在内的 X 光片,由 6 名医生(3 名专家和 3 名住院医师)和 AI 分析仪分别独立评估 TW3 桡骨、尺骨和短骨(RUS)和 TW3 腕骨骨龄。专家的平均估计值被接受为金标准。比较 AI 分析仪与每位住院医师的性能。

结果

对于 TW3-RUS 的估计,AI 分析仪的平均绝对误差(MAE)为 0.48±0.42。绝对误差<1.0 岁的患者比例为 86.78%。MAE 明显低于评分者 1(0.54±0.49,P=0.0068);然而,对于评分者 2(0.48±0.48)或评分者 3(0.49±0.46),差异无统计学意义。对于 TW3 腕骨,AI 分析仪的 MAE 为 0.48±0.65。绝对误差<1.0 岁的患者比例为 88.78%。MAE 明显低于评分者 2(0.58±0.67,P=0.0018),与评分者 1(0.54±0.64)和评分者 3(0.50±0.53)相比,差异虽无统计学意义,但数值上有所降低。这些结果在根据性别分组的亚组中是一致的,并且观察到年龄组之间存在差异。

结论

在中国进行的这项综合验证研究中,人工智能驱动的 X 射线骨龄分析仪的准确性与医生评分者相匹配或超过了评分者。该方法可以通过减少阅读时间来提高临床常规的效率,而不会降低准确性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验