Vasanthi Ponduri, Srinivasu Lingamallu Naga, Teju Ventrapragada, Sowmya Kambhampati Venkata, Stan Anca, Sita Valentin, Miclea Liviu, Stan Ovidiu
Eswar College of Engineering, India.
Dhanekula Institute of Engineering and Technology, India.
Comput Biol Med. 2025 May;190:110023. doi: 10.1016/j.compbiomed.2025.110023. Epub 2025 Mar 18.
Kidney stone detection (KSD) is a critical aspect of medical imaging that necessitates precise identification and classification through computed tomography (CT) scans. The precise identification of the specific type of kidney stone is imperative for the administration of effective treatment. Although contemporary models demonstrate proficiency in classification tasks, they often face challenges when multiple stones are present within a single CT slice and frequently suffer from slow prediction times. This study aims to address these challenges by introducing the RT-DETR model, a novel approach designed for efficient and accurate detection of multiple kidney stones. The code for the proposed model is publicly available in https://github.com/vasanthi457/Kidney-Stone-Detection-using-RTDETR.git.
The RT-DETR model utilizes an innovative hybrid encoder that processes multi-scale features through Attention-based Intra-Scale Feature Interaction (AIFI) and a Cross-Scale Feature-fusion Module (CSFM). Furthermore, an Intersection over Union (IoU)-aware query selection mechanism is incorporated to improve the accuracy of detecting heterogeneous kidney stones. The model's performance was evaluated using a dataset of CT images annotated with kidney stone locations.
The RT-DETR proposed model achieved a precision (P) of 74.3 %, a recall (R) of 91 %, a mean Average Precision (mAP) of 73.3 %, an F1-Score of 73 %, an accuracy of 82.65 %, and a detection time of 1.043 s in detecting small kidney stones. These results surpass those of existing models like Optimized Fusion (OF), Kronecker Convolution (KC), Exemplar Darknet19 (ED19), Deep Learning (DL), Ensemble Learning (EL), YOLOv5, and YOLOv8, which had lower metric values. Moreover, the statistical tests confirmed that the RT-DETR model exhibits a significant difference in performance compared to other models in kidney stone prediction.
The RT-DETR model represents a significant advancement in kidney stone detection, offering both efficiency and accuracy. By effectively managing multi-scale features and enhancing detection with IoU-aware queries, the RT-DETR model improves diagnostic capabilities and treatment strategies for patients with kidney stones. This innovation has the potential to streamline clinical workflows and enhance patient outcomes through more precise and timely diagnosis.
肾结石检测(KSD)是医学成像的一个关键方面,需要通过计算机断层扫描(CT)进行精确识别和分类。准确识别特定类型的肾结石对于有效治疗至关重要。尽管当代模型在分类任务中表现出一定能力,但当单个CT切片中存在多个结石时,它们往往面临挑战,并且预测时间通常较慢。本研究旨在通过引入RT-DETR模型来应对这些挑战,该模型是一种专为高效准确检测多个肾结石而设计的新方法。所提出模型的代码可在https://github.com/vasanthi457/Kidney-Stone-Detection-using-RTDETR.git上公开获取。
RT-DETR模型采用了一种创新的混合编码器,通过基于注意力的尺度内特征交互(AIFI)和跨尺度特征融合模块(CSFM)来处理多尺度特征。此外,还引入了一种基于交并比(IoU)的查询选择机制,以提高检测异质性肾结石的准确性。使用标注有肾结石位置的CT图像数据集对该模型的性能进行评估。
所提出的RT-DETR模型在检测小肾结石时,精度(P)达到74.3%,召回率(R)为91%,平均精度均值(mAP)为73.3%,F1分数为73%,准确率为82.65%,检测时间为1.043秒。这些结果超过了优化融合(OF)、克罗内克卷积(KC)、样本Darknet19(ED19)、深度学习(DL)、集成学习(EL)、YOLOv5和YOLOv8等现有模型,这些模型的指标值较低。此外,统计测试证实,RT-DETR模型在肾结石预测方面的性能与其他模型存在显著差异。
RT-DETR模型代表了肾结石检测方面的重大进展,兼具效率和准确性。通过有效管理多尺度特征并利用基于IoU的查询增强检测,RT-DETR模型提高了肾结石患者的诊断能力和治疗策略。这项创新有可能通过更精确和及时的诊断简化临床工作流程并改善患者预后。