Narayanan Ramanujam, Sundaresan Vaanathi
Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, 560012, Karnataka, India.
Pattern Recognit Lett. 2025 Feb;188:155-163. doi: 10.1016/j.patrec.2024.12.011.
Training models for robust lesion segmentation in medical imaging relies on the availability of sufficiently large pathological datasets and high-quality manual annotations. Hence, training such models is challenging in low-data regimes, even for localised lesions with defined boundaries, due to the lack of representation of variations in contrast, texture and sizes. In this work, we proposed a lesion simulation method, MedLesSynth-LD, to overcome the lack of diversity in localised lesion characteristics for training robust segmentation models. In MedLesSynth-LD, we used noise models inherently based on the physics involved in the acquisition of modalities to generate sufficiently realistic lesion textures by perturbing healthy tissues. Later, we localised these perturbations within masks defined by composites of ellipsoids (thus forming random shapes) and blended them with the input image with varying contrast. The lesion simulation step does not require training and can be tailored to generate defined, localised lesions to introduce sufficient variability (in size, shape, texture and contrast) in the training data pool. We evaluated the performance of a downstream lesion segmentation task using simulated lesionsfor multiple publicly available datasets across imaging modalities and organs: Brain MRI for tumour and white matter hyperintensity segmentation, liver CT for tumour segmentation, breast ultrasound for tumour segmentation, and retinal fundus imaging for exudate segmentation. Using only 75% of labelled real-world data, the proposed method significantly improved lesion segmentation compared to real data-based fully supervised training with an 16% mean increase in the Dice score (DSC) and 33% mean decrease in the normalised 95th percentile of the Hausdorff distance (HD95 (norm)). The proposed method also performed better than state-of-the-art lesion segmentation methods in low-data regimes, with an 10% higher mean DSC and a 19% mean decrease in HD95 (norm). The source code is available at https://github.com/Ramanujam-N/MedLesSynth-LD [commit SHA cc2b15b].
训练用于医学成像中稳健病变分割的模型依赖于足够大的病理数据集和高质量的手动标注。因此,即使对于具有明确边界的局部病变,在低数据量情况下训练此类模型也具有挑战性,因为缺乏对比度、纹理和大小变化的表示。在这项工作中,我们提出了一种病变模拟方法MedLesSynth-LD,以克服用于训练稳健分割模型的局部病变特征缺乏多样性的问题。在MedLesSynth-LD中,我们使用基于模态采集所涉及物理原理的噪声模型,通过扰动健康组织来生成足够逼真的病变纹理。之后,我们将这些扰动定位在由椭球体合成定义的掩码内(从而形成随机形状),并将它们与具有不同对比度的输入图像混合。病变模拟步骤不需要训练,可以进行定制以生成明确的局部病变,从而在训练数据池中引入足够的变异性(在大小、形状、纹理和对比度方面)。我们使用模拟病变对多个跨成像模态和器官的公开可用数据集评估了下游病变分割任务的性能:用于肿瘤和白质高信号分割的脑部MRI、用于肿瘤分割的肝脏CT、用于肿瘤分割的乳腺超声以及用于渗出物分割的视网膜眼底成像。仅使用75%的标记真实世界数据,与基于真实数据的全监督训练相比,所提出的方法显著改善了病变分割,Dice分数(DSC)平均提高了16%,Hausdorff距离的归一化第95百分位数(HD95(norm))平均降低了33%。在所提出的方法在低数据量情况下也比现有最先进的病变分割方法表现更好,平均DSC高10%,HD95(norm)平均降低19%。源代码可在https://github.com/Ramanujam-N/MedLesSynth-LD [提交SHA cc2b15b]获取。