Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA.
Yunu, Inc., Cary, NC, USA.
Comput Med Imaging Graph. 2024 Jan;111:102312. doi: 10.1016/j.compmedimag.2023.102312. Epub 2023 Dec 15.
Accurate lymph node size estimation is critical for staging cancer patients, initial therapeutic management, and assessing response to therapy. Current standard practice for quantifying lymph node size is based on a variety of criteria that use uni-directional or bi-directional measurements. Segmentation in 3D can provide more accurate evaluations of the lymph node size. Fully convolutional neural networks (FCNs) have achieved state-of-the-art results in segmentation for numerous medical imaging applications, including lymph node segmentation. Adoption of deep learning segmentation models in clinical trials often faces numerous challenges. These include lack of pixel-level ground truth annotations for training, generalizability of the models on unseen test domains due to the heterogeneity of test cases and variation of imaging parameters. In this paper, we studied and evaluated the performance of lymph node segmentation models on a dataset that was completely independent of the one used to create the models. We analyzed the generalizability of the models in the face of a heterogeneous dataset and assessed the potential effects of different disease conditions and imaging parameters. Furthermore, we systematically compared fully-supervised and weakly-supervised methods in this context. We evaluated the proposed methods using an independent dataset comprising 806 mediastinal lymph nodes from 540 unique patients. The results show that performance achieved on the independent test set is comparable to that on the training set. Furthermore, neither the underlying disease nor the heterogeneous imaging parameters impacted the performance of the models. Finally, the results indicate that our weakly-supervised method attains 90%- 91% of the performance achieved by the fully supervised training.
准确的淋巴结大小估测对于癌症患者的分期、初始治疗管理以及评估治疗反应至关重要。目前,量化淋巴结大小的标准实践基于各种标准,这些标准使用单向或双向测量。三维分割可以更准确地评估淋巴结大小。全卷积神经网络(FCNs)在包括淋巴结分割在内的许多医学成像应用中的分割中取得了最先进的结果。深度学习分割模型在临床试验中的采用经常面临许多挑战。这些挑战包括缺乏用于训练的像素级真实注释、由于测试用例的异质性和成像参数的变化,模型在未见测试域的通用性。在本文中,我们研究和评估了淋巴结分割模型在与创建模型完全独立的数据集上的性能。我们分析了模型在面对异构数据集时的通用性,并评估了不同疾病状况和成像参数的潜在影响。此外,我们在此背景下系统地比较了完全监督和弱监督方法。我们使用包含 540 个独特患者的 806 个纵隔淋巴结的独立数据集评估了所提出的方法。结果表明,在独立测试集上的性能与在训练集上的性能相当。此外,潜在疾病或异构成像参数均不会影响模型的性能。最后,结果表明,我们的弱监督方法达到了完全监督训练所达到的 90%-91%的性能。