• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 DBNet-CRNN 的药盒文本识别。

Pill Box Text Identification Using DBNet-CRNN.

机构信息

School of Computer Science, Yangtze University, Jingzhou 434025, China.

出版信息

Int J Environ Res Public Health. 2023 Feb 22;20(5):3881. doi: 10.3390/ijerph20053881.

DOI:10.3390/ijerph20053881
PMID:36900892
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10002137/
Abstract

The recognition process of natural scenes is complicated at present, and images themselves may be complex owing to the special features of natural scenes. In this study, we use the detection and recognition of pill box text as an application scenario and design a deep-learning-based text detection algorithm for such natural scenes. We propose an end-to-end graphical text detection and recognition model and implement a detection system based on the B/S research application for pill box recognition, which uses DBNet as the text detection framework and a convolutional recurrent neural network (CRNN) as the text recognition framework. No prior image preprocessing is required in the detection and recognition processes. The recognition result from the back-end is returned to the front-end display. Compared with traditional methods, this recognition process reduces the complexity of preprocessing prior to image detection and improves the simplicity of the model application. Experiments on the detection and recognition of 100 pill boxes demonstrate that the proposed method achieves better accuracy in text localization and recognition results than the previous CTPN + CRNN method. The proposed method is significantly more accurate and easier to use than the traditional approach in terms of both training and recognition processes.

摘要

目前自然场景的识别过程较为复杂,并且由于自然场景的特殊性质,图像本身可能较为复杂。在本研究中,我们将药盒文本的检测和识别作为应用场景,设计了一种基于深度学习的自然场景文本检测算法。我们提出了一种端到端的图形文本检测和识别模型,并为药盒识别实现了基于 B/S 研究应用的检测系统,该系统使用 DBNet 作为文本检测框架和卷积循环神经网络 (CRNN) 作为文本识别框架。在检测和识别过程中无需进行图像预处理。后端的识别结果将返回到前端显示。与传统方法相比,这种识别过程降低了图像检测前的预处理复杂性,并提高了模型应用的简单性。对 100 个药盒的检测和识别实验表明,所提出的方法在文本定位和识别结果方面比之前的 CTPN+CRNN 方法具有更好的准确性。在训练和识别过程方面,所提出的方法比传统方法更加准确和易于使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/27296eae75fd/ijerph-20-03881-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/b799e683ed16/ijerph-20-03881-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/6577bda2766f/ijerph-20-03881-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/9265107d356f/ijerph-20-03881-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/7ae87bdd0c00/ijerph-20-03881-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/90ea154ce9f7/ijerph-20-03881-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/0f09f38d05c2/ijerph-20-03881-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/47a6ed466932/ijerph-20-03881-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/89b4ba98b27f/ijerph-20-03881-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/cd1c7bdcbd30/ijerph-20-03881-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/c87d9ff4693c/ijerph-20-03881-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/d4ee2fe1988f/ijerph-20-03881-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/2e0a91e483d9/ijerph-20-03881-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/d3ac90e7d8d3/ijerph-20-03881-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/27296eae75fd/ijerph-20-03881-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/b799e683ed16/ijerph-20-03881-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/6577bda2766f/ijerph-20-03881-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/9265107d356f/ijerph-20-03881-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/7ae87bdd0c00/ijerph-20-03881-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/90ea154ce9f7/ijerph-20-03881-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/0f09f38d05c2/ijerph-20-03881-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/47a6ed466932/ijerph-20-03881-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/89b4ba98b27f/ijerph-20-03881-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/cd1c7bdcbd30/ijerph-20-03881-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/c87d9ff4693c/ijerph-20-03881-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/d4ee2fe1988f/ijerph-20-03881-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/2e0a91e483d9/ijerph-20-03881-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/d3ac90e7d8d3/ijerph-20-03881-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd8/10002137/27296eae75fd/ijerph-20-03881-g014.jpg

相似文献

1
Pill Box Text Identification Using DBNet-CRNN.基于 DBNet-CRNN 的药盒文本识别。
Int J Environ Res Public Health. 2023 Feb 22;20(5):3881. doi: 10.3390/ijerph20053881.
2
Text detection and recognition based on a lensless imaging system.基于无透镜成像系统的文本检测与识别。
Appl Opt. 2022 May 10;61(14):4177-4186. doi: 10.1364/AO.456158.
3
A Sequential Handwriting Recognition Model Based on a Dynamically Configurable CRNN.基于动态可配置 CRNN 的顺序手写识别模型。
Sensors (Basel). 2021 Nov 2;21(21):7306. doi: 10.3390/s21217306.
4
Text Recognition Model Based on Multi-Scale Fusion CRNN.基于多尺度融合CRNN的文本识别模型
Sensors (Basel). 2023 Aug 8;23(16):7034. doi: 10.3390/s23167034.
5
Towards End-to-End Text Spotting in Natural Scenes.面向自然场景的端到端文本检测。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7266-7281. doi: 10.1109/TPAMI.2021.3095916. Epub 2022 Sep 14.
6
Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones.基于卷积递归神经网络的多麦克风隧道事件检测
Sensors (Basel). 2019 Jun 14;19(12):2695. doi: 10.3390/s19122695.
7
Scene Text Detection Based on Two-Branch Feature Extraction.基于双分支特征提取的场景文本检测。
Sensors (Basel). 2022 Aug 20;22(16):6262. doi: 10.3390/s22166262.
8
Research and Application of Ancient Chinese Pattern Restoration Based on Deep Convolutional Neural Network.基于深度卷积神经网络的中国古图案恢复研究与应用。
Comput Intell Neurosci. 2021 Dec 10;2021:2691346. doi: 10.1155/2021/2691346. eCollection 2021.
9
EEG-based emotion charting for Parkinson's disease patients using Convolutional Recurrent Neural Networks and cross dataset learning.基于 EEG 的帕金森病患者情绪图表分析,使用卷积循环神经网络和跨数据集学习。
Comput Biol Med. 2022 May;144:105327. doi: 10.1016/j.compbiomed.2022.105327. Epub 2022 Mar 11.
10
ECG-based multi-class arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network.基于心电图的多类心律失常检测:使用基于时空注意力的卷积循环神经网络
Artif Intell Med. 2020 Jun;106:101856. doi: 10.1016/j.artmed.2020.101856. Epub 2020 May 11.

本文引用的文献

1
A Robust Deep-Learning-Based Detector for Real-Time Tomato Plant Diseases and Pests Recognition.基于深度学习的实时番茄病虫害识别稳健探测器。
Sensors (Basel). 2017 Sep 4;17(9):2022. doi: 10.3390/s17092022.
2
An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition.基于图像的序列识别的端到端可训练神经网络及其在场景文本识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2298-2304. doi: 10.1109/TPAMI.2016.2646371. Epub 2016 Dec 29.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.
更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
4
A fast learning algorithm for deep belief nets.一种用于深度信念网络的快速学习算法。
Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.
5
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.