MABAL：一种用于机器辅助骨龄标注的新型深度学习架构。

MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.

机构信息

Columbia University Medical Center, PB 1-301, New York, NY, 10032, USA.

出版信息

J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.

DOI:10.1007/s10278-018-0053-3

PMID:29404850

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6113150/

Abstract

Bone age assessment (BAA) is a commonly performed diagnostic study in pediatric radiology to assess skeletal maturity. The most commonly utilized method for assessment of BAA is the Greulich and Pyle method (Pediatr Radiol 46.9:1269-1274, 2016; Arch Dis Child 81.2:172-173, 1999) atlas. The evaluation of BAA can be a tedious and time-consuming process for the radiologist. As such, several computer-assisted detection/diagnosis (CAD) methods have been proposed for automation of BAA. Classical CAD tools have traditionally relied on hard-coded algorithmic features for BAA which suffer from a variety of drawbacks. Recently, the advent and proliferation of convolutional neural networks (CNNs) has shown promise in a variety of medical imaging applications. There have been at least two published applications of using deep learning for evaluation of bone age (Med Image Anal 36:41-51, 2017; JDI 1-5, 2017). However, current implementations are limited by a combination of both architecture design and relatively small datasets. The purpose of this study is to demonstrate the benefits of a customized neural network algorithm carefully calibrated to the evaluation of bone age utilizing a relatively large institutional dataset. In doing so, this study will aim to show that advanced architectures can be successfully trained from scratch in the medical imaging domain and can generate results that outperform any existing proposed algorithm. The training data consisted of 10,289 images of different skeletal age examinations, 8909 from the hospital Picture Archiving and Communication System at our institution and 1383 from the public Digital Hand Atlas Database. The data was separated into four cohorts, one each for male and female children above the age of 8, and one each for male and female children below the age of 10. The testing set consisted of 20 radiographs of each 1-year-age cohort from 0 to 1 years to 14-15+ years, half male and half female. The testing set included left-hand radiographs done for bone age assessment, trauma evaluation without significant findings, and skeletal surveys. A 14 hidden layer-customized neural network was designed for this study. The network included several state of the art techniques including residual-style connections, inception layers, and spatial transformer layers. Data augmentation was applied to the network inputs to prevent overfitting. A linear regression output was utilized. Mean square error was used as the network loss function and mean absolute error (MAE) was utilized as the primary performance metric. MAE accuracies on the validation and test sets for young females were 0.654 and 0.561 respectively. For older females, validation and test accuracies were 0.662 and 0.497 respectively. For young males, validation and test accuracies were 0.649 and 0.585 respectively. Finally, for older males, validation and test set accuracies were 0.581 and 0.501 respectively. The female cohorts were trained for 900 epochs each and the male cohorts were trained for 600 epochs. An eightfold cross-validation set was employed for hyperparameter tuning. Test error was obtained after training on a full data set with the selected hyperparameters. Using our proposed customized neural network architecture on our large available data, we achieved an aggregate validation and test set mean absolute errors of 0.637 and 0.536 respectively. To date, this is the best published performance on utilizing deep learning for bone age assessment. Our results support our initial hypothesis that customized, purpose-built neural networks provide improved performance over networks derived from pre-trained imaging data sets. We build on that initial work by showing that the addition of state-of-the-art techniques such as residual connections and inception architecture further improves prediction accuracy. This is important because the current assumption for use of residual and/or inception architectures is that a large pre-trained network is required for successful implementation given the relatively small datasets in medical imaging. Instead we show that a small, customized architecture incorporating advanced CNN strategies can indeed be trained from scratch, yielding significant improvements in algorithm accuracy. It should be noted that for all four cohorts, testing error outperformed validation error. One reason for this is that our ground truth for our test set was obtained by averaging two pediatric radiologist reads compared to our training data for which only a single read was used. This suggests that despite relatively noisy training data, the algorithm could successfully model the variation between observers and generate estimates that are close to the expected ground truth.

摘要

骨龄评估（BAA）是儿科放射学中常用的一种诊断性研究，用于评估骨骼成熟度。评估 BAA 最常用的方法是 Greulich 和 Pyle 法（Pediatr Radiol 46.9:1269-1274, 2016; Arch Dis Child 81.2:172-173, 1999）图谱。放射科医生对 BAA 的评估可能是一个繁琐且耗时的过程。因此，已经提出了几种计算机辅助检测/诊断（CAD）方法来实现 BAA 的自动化。传统的 CAD 工具在 BAA 方面通常依赖于硬编码的算法特征，这些特征存在多种缺点。最近，卷积神经网络（CNN）的出现和普及在各种医学成像应用中显示出了前景。已经有至少两项使用深度学习评估骨龄的应用（Med Image Anal 36:41-51, 2017; JDI 1-5, 2017）。然而，目前的实现受到架构设计和相对较小数据集的限制。本研究的目的是展示一种精心校准评估骨龄的定制神经网络算法的优势，该算法利用了相对较大的机构数据集。通过这样做，本研究旨在表明先进的架构可以在医学成像领域从头开始成功训练，并可以生成优于任何现有提出算法的结果。训练数据包括来自我们机构的医院图像存档和通信系统（Picture Archiving and Communication System）的 8909 张和公共数字手图谱数据库（Digital Hand Atlas Database）的 1383 张不同骨骼年龄检查的图像，共 10289 张。数据分为四组，一组用于 8 岁以上的男女儿童，一组用于 10 岁以下的男女儿童。测试集由每个 1 岁年龄组的 20 张射线照片组成，从 0 到 1 岁到 14-15+岁，男女各半。测试集包括为骨龄评估、无明显发现的创伤评估和骨骼调查而进行的左手射线照片。本研究设计了一个具有 14 个隐藏层的定制神经网络。该网络包括一些最先进的技术，包括残差风格连接、inception 层和空间变换层。对网络输入进行数据增强以防止过拟合。利用线性回归输出。均方误差被用作网络损失函数，平均绝对误差（MAE）被用作主要性能指标。年轻女性的验证集和测试集的 MAE 准确度分别为 0.654 和 0.561。对于年龄较大的女性，验证集和测试集的准确度分别为 0.662 和 0.497。对于年轻男性，验证集和测试集的准确度分别为 0.649 和 0.585。最后，对于年龄较大的男性，验证集和测试集的准确度分别为 0.581 和 0.501。女性队列每个队列训练 900 个周期，男性队列每个队列训练 600 个周期。采用八折交叉验证集进行超参数调整。使用选定的超参数在全数据集上训练后获得测试误差。在我们可用的大型数据上使用我们提出的定制神经网络架构，我们分别获得了 0.637 和 0.536 的综合验证集和测试集平均绝对误差。到目前为止，这是利用深度学习评估骨龄的最佳发表性能。我们的结果支持我们的初始假设，即定制的、专门构建的神经网络提供了比源自预训练成像数据集的网络更好的性能。我们通过展示添加最新技术（如残差连接和 inception 架构）如何进一步提高预测准确性来扩展这项初始工作。这很重要，因为目前对于使用残差和/或 inception 架构的假设是，由于医学成像中的相对较小数据集，需要一个大型的预训练网络才能成功实施。相反，我们表明，一个小型的、定制的架构，结合先进的 CNN 策略，确实可以从头开始进行训练，从而显著提高算法的准确性。应该注意的是，对于所有四个队列，测试误差都优于验证误差。原因之一是我们的测试集的真实值是通过平均两个儿科放射科医生的读数获得的，而我们的训练数据仅使用了一个读数。这表明，尽管训练数据存在较大噪声，但该算法可以成功模拟观察者之间的差异，并生成接近预期真实值的估计值。

相似文献

MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.

J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.

Deep learning-based automated bone age estimation for Saudi patients on hand radiograph images: a retrospective study.

BMC Med Imaging. 2024 Aug 1;24(1):199. doi: 10.1186/s12880-024-01378-2.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

Fully Automated Deep Learning System for Bone Age Assessment.

J Digit Imaging. 2017 Aug;30(4):427-441. doi: 10.1007/s10278-017-9955-8.

A multi-scale data fusion framework for bone age assessment with convolutional neural networks.

Comput Biol Med. 2019 May;108:161-173. doi: 10.1016/j.compbiomed.2019.03.015. Epub 2019 Mar 19.

Digital hand atlas and web-based bone age assessment: system design and implementation.

Comput Med Imaging Graph. 2000 Sep-Oct;24(5):297-307. doi: 10.1016/s0895-6111(00)00026-4.

Bone age determination using only the index finger: a novel approach using a convolutional neural network compared with human radiologists.

Pediatr Radiol. 2020 Apr;50(4):516-523. doi: 10.1007/s00247-019-04587-y. Epub 2019 Dec 20.

Automated semantic labeling of pediatric musculoskeletal radiographs using deep learning.

Pediatr Radiol. 2019 Jul;49(8):1066-1070. doi: 10.1007/s00247-019-04408-2. Epub 2019 Apr 30.

Carpal Bone Segmentation Using Fully Convolutional Neural Network.

Curr Med Imaging Rev. 2019;15(10):983-989. doi: 10.2174/1573405615666190724101600.

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning.

IEEE Trans Med Imaging. 2016 May;35(5):1285-98. doi: 10.1109/TMI.2016.2528162. Epub 2016 Feb 11.

引用本文的文献

Development of an age estimation method for the coxal bone and lumbar vertebrae obtained from post-mortem computed tomography images using a convolutional neural network.

Int J Legal Med. 2025 Sep 1. doi: 10.1007/s00414-025-03587-y.

A review of methods of age estimation based on postmortem computed tomography.

Forensic Sci Res. 2024 Jul 18;10(1):owae036. doi: 10.1093/fsr/owae036. eCollection 2025 Mar.

Charting the growth through intelligence: A SWOC analysis on AI-assisted radiologic bone age estimation.

Int J Legal Med. 2025 Mar;139(2):679-694. doi: 10.1007/s00414-024-03356-3. Epub 2024 Oct 26.

Deep learning-based automated bone age estimation for Saudi patients on hand radiograph images: a retrospective study.

BMC Med Imaging. 2024 Aug 1;24(1):199. doi: 10.1186/s12880-024-01378-2.

An artificial intelligence-based bone age assessment model for Han and Tibetan children.

Front Physiol. 2024 Feb 15;15:1329145. doi: 10.3389/fphys.2024.1329145. eCollection 2024.

Deep Learning-Assisted Identification of Femoroacetabular Impingement (FAI) on Routine Pelvic Radiographs.

J Imaging Inform Med. 2024 Feb;37(1):339-346. doi: 10.1007/s10278-023-00920-y. Epub 2024 Jan 11.

Skeletal age evaluation using hand X-rays to determine growth problems.

PeerJ Comput Sci. 2023 Nov 22;9:e1512. doi: 10.7717/peerj-cs.1512. eCollection 2023.

Artificial intelligence in forensic medicine and forensic dentistry.

J Forensic Odontostomatol. 2023 Aug 27;41(2):30-41.

The uncovered biases and errors in clinical determination of bone age by using deep learning models.

Eur Radiol. 2023 May;33(5):3544-3556. doi: 10.1007/s00330-022-09330-0. Epub 2022 Dec 20.

Development of a multi-stage model for intelligent and quantitative appraising of skeletal maturity using cervical vertebras cone-beam CT images of Chinese girls.

Int J Comput Assist Radiol Surg. 2022 Apr;17(4):761-773. doi: 10.1007/s11548-021-02550-7. Epub 2022 Jan 4.

本文引用的文献

A multi-resolution approach for spinal metastasis detection using deep Siamese neural networks.

Comput Biol Med. 2017 May 1;84:137-146. doi: 10.1016/j.compbiomed.2017.03.024. Epub 2017 Mar 27.

Fully Automated Deep Learning System for Bone Age Assessment.

J Digit Imaging. 2017 Aug;30(4):427-441. doi: 10.1007/s10278-017-9955-8.

Deep learning for automated skeletal bone age assessment in X-ray images.

Med Image Anal. 2017 Feb;36:41-51. doi: 10.1016/j.media.2016.10.010. Epub 2016 Oct 29.

Large scale deep learning for computer aided detection of mammographic lesions.

Med Image Anal. 2017 Jan;35:303-312. doi: 10.1016/j.media.2016.07.007. Epub 2016 Aug 2.

Bone age assessment practices in infants and older children among Society for Pediatric Radiology members.

Pediatr Radiol. 2016 Aug;46(9):1269-74. doi: 10.1007/s00247-016-3618-7. Epub 2016 May 12.

Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images.

IEEE Trans Med Imaging. 2016 May;35(5):1240-1251. doi: 10.1109/TMI.2016.2538465. Epub 2016 Mar 4.

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network.

IEEE Trans Med Imaging. 2016 May;35(5):1207-1216. doi: 10.1109/TMI.2016.2535865. Epub 2016 Feb 29.

Automatic Detection of Cerebral Microbleeds From MR Images via 3D Convolutional Neural Networks.

IEEE Trans Med Imaging. 2016 May;35(5):1182-1195. doi: 10.1109/TMI.2016.2528129. Epub 2016 Feb 11.

Multi-scale Convolutional Neural Networks for Lung Nodule Classification.

Inf Process Med Imaging. 2015;24:588-99. doi: 10.1007/978-3-319-19992-4_46.

Validation and reference values of automated bone age determination for four ethnicities.

Acad Radiol. 2010 Nov;17(11):1425-32. doi: 10.1016/j.acra.2010.06.007. Epub 2010 Aug 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MABAL：一种用于机器辅助骨龄标注的新型深度学习架构。

MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献