Keuth Ron, Hansen Lasse, Balks Maren, Jäger Ronja, Schröder Anne-Nele, Tüshaus Ludger, Heinrich Mattias
Medical Informatics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany.
EchoScout GmbH, Maria-Goeppert-Str. 3, 23562, Lübeck, Germany.
Int J Comput Assist Radiol Surg. 2025 Mar;20(3):441-451. doi: 10.1007/s11548-024-03315-8. Epub 2025 Jan 23.
Semantic segmentation and landmark detection are fundamental tasks of medical image processing, facilitating further analysis of anatomical objects. Although deep learning-based pixel-wise classification has set a new-state-of-the-art for segmentation, it falls short in landmark detection, a strength of shape-based approaches.
In this work, we propose a dense image-to-shape representation that enables the joint learning of landmarks and semantic segmentation by employing a fully convolutional architecture. Our method intuitively allows the extraction of arbitrary landmarks due to its representation of anatomical correspondences. We benchmark our method against the state-of-the-art for semantic segmentation (nnUNet), a shape-based approach employing geometric deep learning and a convolutional neural network-based method for landmark detection.
We evaluate our method on two medical datasets: one common benchmark featuring the lungs, heart, and clavicle from thorax X-rays, and another with 17 different bones in the paediatric wrist. While our method is on par with the landmark detection baseline in the thorax setting (error in mm of vs. ), it substantially surpassed it in the more complex wrist setting ( vs. ).
We demonstrate that dense geometric shape representation is beneficial for challenging landmark detection tasks and outperforms previous state-of-the-art using heatmap regression. While it does not require explicit training on the landmarks themselves, allowing for the addition of new landmarks without necessitating retraining.
语义分割和地标检测是医学图像处理的基本任务,有助于对解剖对象进行进一步分析。尽管基于深度学习的逐像素分类为分割设定了新的技术水平,但在地标检测方面却有所不足,而地标检测是基于形状的方法的优势所在。
在这项工作中,我们提出了一种密集的图像到形状表示,通过采用全卷积架构实现地标和语义分割的联合学习。由于我们的方法对解剖对应关系的表示,它直观地允许提取任意地标。我们将我们的方法与语义分割的当前技术水平(nnUNet)、一种采用几何深度学习的基于形状的方法以及一种基于卷积神经网络的地标检测方法进行基准测试。
我们在两个医学数据集上评估我们的方法:一个是来自胸部X光片的肺部、心脏和锁骨的常见基准数据集,另一个是儿科手腕部有17块不同骨头的数据集。虽然我们的方法在胸部数据集的地标检测基准测试中与基线相当(误差以毫米计,分别为 与 ),但在更复杂的手腕数据集上它大幅超越了基线(分别为 与 )。
我们证明了密集几何形状表示有利于具有挑战性的地标检测任务,并且优于以前使用热图回归的技术水平。同时它不需要在地标本身进行明确训练,允许在无需重新训练的情况下添加新的地标。