Choi Jiho, Hwang Gyeongyeon, Ji Yewon, Yoon Hakyoung, Lee Sang Jun
Division of Electronics and Information Engineering, College of Engineering, Jeonbuk National University, 567, Baekje-daero, Deokjin-gu, 54896, Jeonju, Republic of Korea.
Department of Veterinary Medical Imaging, College of Veterinary Medicine, Jeonbuk National University, 79, Gobong-ro, 56443, Iksan, Republic of Korea.
Comput Biol Med. 2025 Mar;186:109609. doi: 10.1016/j.compbiomed.2024.109609. Epub 2025 Jan 2.
Kidney stone is a common urological disease in dogs and can lead to serious complications such as pyelonephritis and kidney failure. However, manual diagnosis involves a lot of burdens on radiologists and may cause human errors due to fatigue. Automated methods using deep learning models have been explored to overcome this limitation. Veterinary images present additional challenges due to the various sizes of organs depending on different species, with particularly poor performance on smaller lesions. These challenges suggest the need for a robust deep learning model that can accurately detect various sizes of kidney stones and kidneys. Moreover, public datasets with high-quality CT annotations for dog lesions and organs are almost not available. To address these challenges, we introduce a parallel frequency-spatial hybrid network (PFSH-Net), specifically designed for detecting kidney stones in CT images of dogs. The PFSH-Net consists of an encoder-decoder architecture that simultaneously captures spatial and frequency domain features. Moreover, we propose a multi-scale fusion (MSF) module that integrates low-level and high-level representations in the spatial and frequency domains. We collected a veterinary CT dataset with high-quality labels annotated by expert veterinary radiologists, and this dataset is referred to as the JBNU-ACT dataset. The effectiveness of the proposed method is demonstrated using a real-world dataset, with performance improvements of 4.1366, and 0.6234 on the HD, and ASD metrics, respectively. Moreover, the generalization of the model is evaluated on the publicly available BTCV dataset by achieving the average DSC score of 0.7960, which outperforms the previous method. Our code is available at https://github.com/gyeongyeon-Hwang/veterinary-kidney-segmentation.
肾结石是犬类常见的泌尿系统疾病,可导致肾盂肾炎和肾衰竭等严重并发症。然而,人工诊断给放射科医生带来很大负担,且可能因疲劳导致人为错误。人们已探索使用深度学习模型的自动化方法来克服这一局限性。由于不同物种的器官大小各异,兽医图像带来了额外挑战,对较小病变的检测性能尤其不佳。这些挑战表明需要一个强大的深度学习模型,能够准确检测各种大小的肾结石和肾脏。此外,几乎没有针对犬类病变和器官的高质量CT标注的公共数据集。为应对这些挑战,我们引入了一种并行频率-空间混合网络(PFSH-Net),专门设计用于检测犬类CT图像中的肾结石。PFSH-Net由一个同时捕捉空间和频域特征的编码器-解码器架构组成。此外,我们提出了一种多尺度融合(MSF)模块,该模块在空间和频域中整合低层次和高层次表示。我们收集了一个由兽医放射科专家标注高质量标签的兽医CT数据集,该数据集被称为JBNU-ACT数据集。使用真实世界数据集证明了所提方法的有效性,在HD和ASD指标上的性能分别提高了4.1366和0.6234。此外,通过在公开可用的BTCV数据集上实现平均DSC分数0.7960来评估模型的泛化能力,该分数优于先前的方法。我们的代码可在https://github.com/gyeongyeon-Hwang/veterinary-kidney-segmentation获取。