IEEE Trans Med Imaging. 2024 Nov;43(11):3895-3908. doi: 10.1109/TMI.2024.3405794. Epub 2024 Nov 4.
Probe-based confocal laser endomicroscopy (pCLE) has a role in characterising tissue intraoperatively to guide tumour resection during surgery. To capture good quality pCLE data which is important for diagnosis, the probe-tissue contact needs to be maintained within a working range of micrometre scale. This can be achieved through micro-surgical robotic manipulation which requires the automatic estimation of the probe-tissue distance. In this paper, we propose a novel deep regression framework composed of the Deep Regression Generative Adversarial Network (DR-GAN) and a Sequence Attention (SA) module. The aim of DR-GAN is to train the network using an enhanced image-based supervision approach. It extents the standard generator by using a well-defined function for image generation, instead of a learnable decoder. Also, DR-GAN uses a novel learnable neural perceptual loss which combines for the first time spatial and frequency domain features. This effectively suppresses the adverse effects of noise in the pCLE data. To incorporate temporal information, we've designed the SA module which is a cross-attention module, enhanced with Radial Basis Function based encoding (SA-RBF). Furthermore, to train the regression framework, we designed a multi-step training mechanism. During inference, the trained network is used to generate data representations which are fused along time in the SA-RBF module to boost the regression stability. Our proposed network advances SOTA networks by addressing the challenge of excessive noise in the pCLE data and enhancing regression stability. It outperforms SOTA networks applied on the pCLE Regression dataset (PRD) in terms of accuracy, data quality and stability.
基于探针的共聚焦激光内窥镜检查 (pCLE) 在术中对组织进行特征描述以指导手术中的肿瘤切除方面具有作用。为了捕获对诊断很重要的高质量 pCLE 数据,需要将探针-组织接触保持在微米级的工作范围内。这可以通过微外科机器人操作来实现,这需要自动估计探针-组织距离。在本文中,我们提出了一种新的深度回归框架,该框架由深度回归生成对抗网络 (DR-GAN) 和序列注意力 (SA) 模块组成。DR-GAN 的目的是使用基于图像的增强监督方法训练网络。它通过使用定义良好的图像生成函数来扩展标准生成器,而不是使用可学习的解码器。此外,DR-GAN 使用新的可学习神经感知损失,它首次结合了空间和频域特征。这有效地抑制了 pCLE 数据中噪声的不利影响。为了结合时间信息,我们设计了 SA 模块,它是一个带有基于径向基函数的编码 (SA-RBF) 的交叉注意模块。此外,为了训练回归框架,我们设计了一种多步训练机制。在推断过程中,使用训练有素的网络生成数据表示,然后在 SA-RBF 模块中沿时间融合,以提高回归稳定性。我们提出的网络通过解决 pCLE 数据中噪声过多的挑战并提高回归稳定性,在基于 SOTA 网络的基础上取得了进展。在 pCLE 回归数据集 (PRD) 上,它在准确性、数据质量和稳定性方面都优于 SOTA 网络。