基于 RetinaNet、SSD 和 YOLO v3 的实时药丸识别比较。

Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification.

机构信息

Department of Pharmacy, The Third Affiliated Hospital of Southern Medical University, Guangzhou, 510000, China.

出版信息

BMC Med Inform Decis Mak. 2021 Nov 22;21(1):324. doi: 10.1186/s12911-021-01691-8.

DOI:10.1186/s12911-021-01691-8

PMID:34809632

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8609721/

Abstract

BACKGROUND

The correct identification of pills is very important to ensure the safe administration of drugs to patients. Here, we use three current mainstream object detection models, namely RetinaNet, Single Shot Multi-Box Detector (SSD), and You Only Look Once v3(YOLO v3), to identify pills and compare the associated performance.

METHODS

In this paper, we introduce the basic principles of three object detection models. We trained each algorithm on a pill image dataset and analyzed the performance of the three models to determine the best pill recognition model. The models were then used to detect difficult samples and we compared the results.

RESULTS

The mean average precision (MAP) of RetinaNet reached 82.89%, but the frames per second (FPS) is only one third of YOLO v3, which makes it difficult to achieve real-time performance. SSD does not perform as well on the indicators of MAP and FPS. Although the MAP of YOLO v3 is slightly lower than the others (80.69%), it has a significant advantage in terms of detection speed. YOLO v3 also performed better when tasked with hard sample detection, and therefore the model is more suitable for deployment in hospital equipment.

CONCLUSION

Our study reveals that object detection can be applied for real-time pill identification in a hospital pharmacy, and YOLO v3 exhibits an advantage in detection speed while maintaining a satisfactory MAP.

摘要

背景

正确识别药丸对于确保患者安全用药非常重要。在这里，我们使用了三种当前主流的目标检测模型，即 RetinaNet、Single Shot Multi-Box Detector（SSD）和 You Only Look Once v3（YOLO v3）来识别药丸，并比较相关性能。

方法

在本文中，我们介绍了三种目标检测模型的基本原理。我们在药丸图像数据集上训练了每个算法，并分析了三种模型的性能，以确定最佳的药丸识别模型。然后，我们使用这些模型来检测困难样本，并比较结果。

结果

RetinaNet 的平均准确率（MAP）达到了 82.89%，但每秒帧数（FPS）仅为 YOLO v3 的三分之一，这使得它难以实现实时性能。SSD 在 MAP 和 FPS 等指标上的表现也不是很好。虽然 YOLO v3 的 MAP 略低于其他模型（80.69%），但它在检测速度方面具有显著优势。YOLO v3 在进行困难样本检测时表现也更好，因此该模型更适合部署在医院设备中。

结论

我们的研究表明，目标检测可用于医院药房的实时药丸识别，而 YOLO v3 在保持满意的 MAP 的同时，在检测速度方面具有优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/865c/8609721/b5c9f5cc3f23/12911_2021_1691_Fig1_HTML.jpg

相似文献

Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification.

BMC Med Inform Decis Mak. 2021 Nov 22;21(1):324. doi: 10.1186/s12911-021-01691-8.

Agricultural Greenhouses Detection in High-Resolution Satellite Images Based on Convolutional Neural Networks: Comparison of Faster R-CNN, YOLO v3 and SSD.

Sensors (Basel). 2020 Aug 31;20(17):4938. doi: 10.3390/s20174938.

Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection.

Sensors (Basel). 2020 Jul 31;20(15):4276. doi: 10.3390/s20154276.

Platelet Detection Based on Improved YOLO_v3.

Cyborg Bionic Syst. 2022 Sep 14;2022:9780569. doi: 10.34133/2022/9780569. eCollection 2022.

Comparative Evaluation of Convolutional Neural Network Object Detection Algorithms for Vehicle Detection.

J Imaging. 2024 Jul 5;10(7):162. doi: 10.3390/jimaging10070162.

Real-Time Pattern-Recognition of GPR Images with YOLO v3 Implemented by Tensorflow.

Sensors (Basel). 2020 Nov 12;20(22):6476. doi: 10.3390/s20226476.

Improved SSD network for fast concealed object detection and recognition in passive terahertz security images.

Sci Rep. 2022 Jul 15;12(1):12082. doi: 10.1038/s41598-022-16208-0.

Detection and identification of tea leaf diseases based on AX-RetinaNet.

Sci Rep. 2022 Feb 9;12(1):2183. doi: 10.1038/s41598-022-06181-z.

Efficient Detection Method of Pig-Posture Behavior Based on Multiple Attention Mechanism.

Comput Intell Neurosci. 2022 Jul 16;2022:1759542. doi: 10.1155/2022/1759542. eCollection 2022.

Automatic Target Detection from Satellite Imagery Using Machine Learning.

Sensors (Basel). 2022 Feb 2;22(3):1147. doi: 10.3390/s22031147.

引用本文的文献

CNN-Based Automatic Tablet Classification Using a Vibration-Controlled Bowl Feeder with Spiral Torque Optimization.

Sensors (Basel). 2025 Jul 8;25(14):4248. doi: 10.3390/s25144248.

Deep Learning-Based Precision Cropping of Eye Regions in Strabismus Photographs: Algorithm Development and Validation Study for Workflow Optimization.

J Med Internet Res. 2025 Jul 17;27:e74402. doi: 10.2196/74402.

OS-DETR: End-to-end brain tumor detection framework based on orthogonal channel shuffle networks.

PLoS One. 2025 May 13;20(5):e0320757. doi: 10.1371/journal.pone.0320757. eCollection 2025.

HYFF-CB: Hybrid Feature Fusion Visual Model for Cargo Boxes.

Sensors (Basel). 2025 Mar 17;25(6):1865. doi: 10.3390/s25061865.

Application of dual branch and bidirectional feedback feature extraction networks for real time accurate positioning of stents.

Sci Rep. 2025 Mar 28;15(1):10682. doi: 10.1038/s41598-025-86304-4.

Disease detection on exterior surfaces of buildings using deep learning in China.

Sci Rep. 2025 Mar 12;15(1):8564. doi: 10.1038/s41598-025-92112-7.

Using deep learning model integration to build a smart railway traffic safety monitoring system.

Sci Rep. 2025 Feb 4;15(1):4224. doi: 10.1038/s41598-025-88830-7.

Automated Image Clarity Detection for the Improvement of Colposcopy Imaging with Multiple Devices.

Biomed Signal Process Control. 2025 Feb;100(Pt B). doi: 10.1016/j.bspc.2024.106948. Epub 2024 Sep 27.

Development of an automated artificial intelligence-based system for urogenital schistosomiasis diagnosis using digital image analysis techniques and a robotized microscope.

PLoS Negl Trop Dis. 2024 Nov 5;18(11):e0012614. doi: 10.1371/journal.pntd.0012614. eCollection 2024 Nov.

Intraoperative detection of parathyroid glands using artificial intelligence: optimizing medical image training with data augmentation methods.

Surg Endosc. 2024 Oct;38(10):5732-5745. doi: 10.1007/s00464-024-11115-z. Epub 2024 Aug 13.

本文引用的文献

Deep Learning-Assisted Three-Dimensional Fluorescence Difference Spectroscopy for Identification and Semiquantification of Illicit Drugs in Biofluids.

Anal Chem. 2019 Aug 6;91(15):9343-9347. doi: 10.1021/acs.analchem.9b01315. Epub 2019 Jun 13.

Drug identification by the patient: Perception of patients, physicians and pharmacists.

Therapie. 2019 Dec;74(6):591-598. doi: 10.1016/j.therap.2019.03.003. Epub 2019 Apr 2.

Robotic dispensing improves patient safety, inventory management, and staff satisfaction in an outpatient hospital pharmacy.

J Eval Clin Pract. 2019 Feb;25(1):28-35. doi: 10.1111/jep.13014. Epub 2018 Aug 22.

The National Library of Medicine Pill Image Recognition Challenge: An Initial Report.

IEEE Appl Imag Pattern Recognit Workshop. 2016 Oct;2016. doi: 10.1109/AIPR.2016.8010584. Epub 2017 Aug 17.

Development of fine-grained pill identification algorithm using deep convolutional network.

J Biomed Inform. 2017 Oct;74:130-136. doi: 10.1016/j.jbi.2017.09.005. Epub 2017 Sep 15.

Predicting Urban Medical Services Demand in China: An Improved Grey Markov Chain Model by Taylor Approximation.

Int J Environ Res Public Health. 2017 Aug 6;14(8):883. doi: 10.3390/ijerph14080883.

Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data.

Mol Pharm. 2016 Jul 5;13(7):2524-30. doi: 10.1021/acs.molpharmaceut.6b00248. Epub 2016 Jun 8.

Medication Safety Systems and the Important Role of Pharmacists.

Drugs Aging. 2016 Mar;33(3):213-21. doi: 10.1007/s40266-016-0358-1.

Transformation of potential medical demand in China: A system dynamics simulation model.

J Biomed Inform. 2015 Oct;57:399-414. doi: 10.1016/j.jbi.2015.08.015. Epub 2015 Aug 19.

Implement the RFID position based system of automatic tablets packaging machine for patient safety.

J Med Syst. 2012 Dec;36(6):3463-71. doi: 10.1007/s10916-011-9799-6. Epub 2011 Nov 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于 RetinaNet、SSD 和 YOLO v3 的实时药丸识别比较。

Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification.

机构信息

Department of Pharmacy, The Third Affiliated Hospital of Southern Medical University, Guangzhou, 510000, China.