Liu Bowen, Huang Yulin, Li Shaowei, He Jinshui, Zhang Dongxu
State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics, School of Public Health, Xiamen University, Xiamen, China.
Department of Pediatrics, Zhangzhou Affiliated Hospital of Fujian Medical University, Zhangzhou, China.
Quant Imaging Med Surg. 2024 Aug 1;14(8):5902-5914. doi: 10.21037/qims-23-806. Epub 2024 Feb 19.
Bone age assessment (BAA) is crucial for the diagnosis of growth disorders and the optimization of treatments. However, the random error caused by different observers' experiences and the low consistency of repeated assessments harms the quality of such assessments. Thus, automated assessment methods are needed.
Previous research has sought to design localization modules in a strongly or weakly supervised fashion to aggregate part regions to better recognize subtle differences. Conversely, we sought to efficiently deliver information between multi-granularity regions for fine-grained feature learning and to directly model long-distance relationships for global understanding. The proposed method has been named the "Multi-Granularity and Multi-Attention Net (2M-Net)". Specifically, we first applied the jigsaw method to generate related tasks emphasizing regions with different granularities, and we then trained the model on these tasks using a hierarchical sharing mechanism. In effect, the training signals from the extra tasks created as an inductive bias, enabling 2M-Net to discover task relatedness without the need for annotations. Next, the self-attention mechanism acted as a plug-and-play module to effectively enhance the feature representation capabilities. Finally, multi-scale features were applied for prediction.
A public data set of 14,236 hand radiographs, provided by the Radiological Society of North America (RSNA), was used to develop and validate 2M-Net. In the public benchmark testing, the mean absolute error (MAE) between the bone age estimates of the model and of the reviewer was 3.98 months (3.89 months for males and 4.07 months for females).
By using the jigsaw method to construct a multi-task learning strategy and inserting the self-attention module for efficient global modeling, we established 2M-Net, which is comparable to the previous best method in terms of performance.
骨龄评估(BAA)对于生长障碍的诊断和治疗优化至关重要。然而,不同观察者的经验所导致的随机误差以及重复评估的低一致性损害了此类评估的质量。因此,需要自动化评估方法。
先前的研究试图以强监督或弱监督方式设计定位模块,以聚合部分区域,从而更好地识别细微差异。相反,我们试图在多粒度区域之间高效传递信息以进行细粒度特征学习,并直接对长距离关系进行建模以实现全局理解。所提出的方法被命名为“多粒度多注意力网络(2M-Net)”。具体而言,我们首先应用拼图方法生成强调不同粒度区域的相关任务,然后使用分层共享机制在这些任务上训练模型。实际上,来自额外任务的训练信号被创建为一种归纳偏差,使2M-Net能够在无需注释的情况下发现任务相关性。接下来,自注意力机制作为即插即用模块有效地增强了特征表示能力。最后,应用多尺度特征进行预测。
使用由北美放射学会(RSNA)提供的包含14236张手部X光片的公共数据集来开发和验证2M-Net。在公共基准测试中,模型的骨龄估计值与审阅者的骨龄估计值之间的平均绝对误差(MAE)为3.98个月(男性为3.89个月,女性为4.07个月)。
通过使用拼图方法构建多任务学习策略并插入自注意力模块进行高效全局建模,我们建立了2M-Net,其在性能方面与先前的最佳方法相当。