Chen Chaoyu, Yang Xin, Huang Yuhao, Shi Wenlong, Cao Yan, Luo Mingyuan, Hu Xindi, Zhu Lei, Yu Lequan, Yue Kejuan, Zhang Yuanji, Xiong Yi, Ni Dong, Huang Weijun
National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China; Medical Ultrasound Image Computing (MUSIC) Laboratory, Shenzhen University, Shenzhen, China.
Shenzhen RayShape Medical Technology Co., Ltd, Shenzhen, China.
Med Image Anal. 2023 Oct 21;91:103013. doi: 10.1016/j.media.2023.103013.
Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose in US volume has several challenges, including poor image quality, limited GPU memory for tackling high dimensional data, symmetrical or ambiguous anatomical structures, and considerable variations in fetal poses. In this study, we propose a novel 3D fetal pose estimation framework (called FetusMapV2) to overcome the above challenges. Our contribution is three-fold. First, we propose a heuristic scheme that explores the complementary network structure-unconstrained and activation-unreserved GPU memory management approaches, which can enlarge the input image resolution for better results under limited GPU memory. Second, we design a novel Pair Loss to mitigate confusion caused by symmetrical and similar anatomical structures. It separates the hidden classification task from the landmark localization task and thus progressively eases model learning. Last, we propose a shape priors-based self-supervised learning by selecting the relatively stable landmarks to refine the pose online. Extensive experiments and diverse applications on a large-scale fetal US dataset including 1000 volumes with 22 landmarks per volume demonstrate that our method outperforms other strong competitors.
三维超声(US)中的胎儿姿势估计涉及识别一组相关的胎儿解剖标志点。其主要目标是通过标志点连接提供有关胎儿的全面信息,从而有益于各种关键应用,如生物特征测量、平面定位和胎儿运动监测。然而,在超声容积中准确估计三维胎儿姿势存在若干挑战,包括图像质量差、处理高维数据时GPU内存有限、解剖结构对称或模糊以及胎儿姿势存在相当大的变化。在本研究中,我们提出了一种新颖的三维胎儿姿势估计框架(称为FetusMapV2)来克服上述挑战。我们的贡献有三个方面。首先,我们提出了一种启发式方案,探索互补的网络结构——无约束和无保留激活的GPU内存管理方法,这可以在有限的GPU内存下扩大输入图像分辨率以获得更好的结果。其次,我们设计了一种新颖的配对损失来减轻由对称和相似解剖结构引起的混淆。它将隐藏的分类任务与标志点定位任务分开,从而逐步简化模型学习。最后,我们通过选择相对稳定的标志点提出基于形状先验的自监督学习,以在线优化姿势。在一个大规模胎儿超声数据集上进行的广泛实验和多样化应用,该数据集包括1000个容积,每个容积有22个标志点,表明我们的方法优于其他强大的竞争对手。