• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DLI-IT:一种通过图像和文本嵌入进行药物标签识别的深度学习方法。

DLI-IT: a deep learning approach to drug label identification through image and text embedding.

机构信息

FDA/National Center for Toxicological Research, 3900 NCTR Rd, Jefferson, AR, 72079, USA.

University of Arkansas at Little Rock, 2801 S. University Ave, Little Rock, AR, 72204, USA.

出版信息

BMC Med Inform Decis Mak. 2020 Apr 15;20(1):68. doi: 10.1186/s12911-020-1078-3.

DOI:10.1186/s12911-020-1078-3
PMID:32293428
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7158001/
Abstract

BACKGROUND

Drug label, or packaging insert play a significant role in all the operations from production through drug distribution channels to the end consumer. Image of the label also called Display Panel or label could be used to identify illegal, illicit, unapproved and potentially dangerous drugs. Due to the time-consuming process and high labor cost of investigation, an artificial intelligence-based deep learning model is necessary for fast and accurate identification of the drugs.

METHODS

In addition to image-based identification technology, we take advantages of rich text information on the pharmaceutical package insert of drug label images. In this study, we developed the Drug Label Identification through Image and Text embedding model (DLI-IT) to model text-based patterns of historical data for detection of suspicious drugs. In DLI-IT, we first trained a Connectionist Text Proposal Network (CTPN) to crop the raw image into sub-images based on the text. The texts from the cropped sub-images are recognized independently through the Tesseract OCR Engine and combined as one document for each raw image. Finally, we applied universal sentence embedding to transform these documents into vectors and find the most similar reference images to the test image through the cosine similarity.

RESULTS

We trained the DLI-IT model on 1749 opioid and 2365 non-opioid drug label images. The model was then tested on 300 external opioid drug label images, the result demonstrated our model achieves up-to 88% of the precision in drug label identification, which outperforms previous image-based or text-based identification method by up-to 35% improvement.

CONCLUSION

To conclude, by combining Image and Text embedding analysis under deep learning framework, our DLI-IT approach achieved a competitive performance in advancing drug label identification.

摘要

背景

药品标签或包装插页在从生产到药品分销渠道再到最终消费者的所有环节中都发挥着重要作用。标签的图像也称为展示面板或标签,可用于识别非法、违禁、未经批准和潜在危险的药品。由于调查过程耗时且劳动成本高,因此需要基于人工智能的深度学习模型来快速准确地识别药品。

方法

除了基于图像的识别技术外,我们还利用药品标签图像的包装插页上丰富的文本信息。在这项研究中,我们开发了通过图像和文本嵌入模型识别药品标签(DLI-IT),以对历史数据的基于文本的模式进行建模,从而检测可疑药品。在 DLI-IT 中,我们首先训练一个连接文本提议网络(CTPN),根据文本将原始图像裁剪成子图像。通过 Tesseract OCR 引擎识别裁剪后的子图像中的文本,并将其组合成每个原始图像的一个文档。最后,我们应用通用句子嵌入将这些文档转换为向量,并通过余弦相似度找到与测试图像最相似的参考图像。

结果

我们在 1749 张阿片类药物和 2365 张非阿片类药物标签图像上训练了 DLI-IT 模型。然后,我们在 300 张外部阿片类药物标签图像上对该模型进行了测试,结果表明,我们的模型在药物标签识别方面的准确率高达 88%,比以前的基于图像或基于文本的识别方法提高了高达 35%。

结论

总之,通过在深度学习框架下结合图像和文本嵌入分析,我们的 DLI-IT 方法在推进药物标签识别方面取得了有竞争力的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/8f32e7565b46/12911_2020_1078_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/b278f4c8e457/12911_2020_1078_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/218f0c08306d/12911_2020_1078_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/f3fade8bae3f/12911_2020_1078_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/8f32e7565b46/12911_2020_1078_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/b278f4c8e457/12911_2020_1078_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/218f0c08306d/12911_2020_1078_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/f3fade8bae3f/12911_2020_1078_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564d/7158001/8f32e7565b46/12911_2020_1078_Fig4_HTML.jpg

相似文献

1
DLI-IT: a deep learning approach to drug label identification through image and text embedding.DLI-IT:一种通过图像和文本嵌入进行药物标签识别的深度学习方法。
BMC Med Inform Decis Mak. 2020 Apr 15;20(1):68. doi: 10.1186/s12911-020-1078-3.
2
An Accurate Deep Learning-Based System for Automatic Pill Identification: Model Development and Validation.基于深度学习的精准药丸自动识别系统:模型开发与验证。
J Med Internet Res. 2023 Jan 13;25:e41043. doi: 10.2196/41043.
3
Deep Relation Embedding for Cross-Modal Retrieval.深度关系嵌入的跨模态检索。
IEEE Trans Image Process. 2021;30:617-627. doi: 10.1109/TIP.2020.3038354. Epub 2020 Dec 1.
4
Pill Box Text Identification Using DBNet-CRNN.基于 DBNet-CRNN 的药盒文本识别。
Int J Environ Res Public Health. 2023 Feb 22;20(5):3881. doi: 10.3390/ijerph20053881.
5
Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.连笔文本:用于自然场景图像中乌尔都语文本端到端识别的综合数据集。
Data Brief. 2020 May 21;31:105749. doi: 10.1016/j.dib.2020.105749. eCollection 2020 Aug.
6
TIPS: A Framework for Text Summarising with Illustrative Pictures.提示:一个带有示例图片的文本摘要框架。
Entropy (Basel). 2021 Nov 30;23(12):1614. doi: 10.3390/e23121614.
7
Adverse Drug Event Detection from Electronic Health Records Using Hierarchical Recurrent Neural Networks with Dual-Level Embedding.基于具有双层嵌入的层次递归神经网络从电子健康记录中检测药物不良反应。
Drug Saf. 2019 Jan;42(1):113-122. doi: 10.1007/s40264-018-0765-9.
8
Transfer Learning for Toxoplasma gondii Recognition.用于弓形虫识别的迁移学习
mSystems. 2020 Jan 28;5(1):e00445-19. doi: 10.1128/mSystems.00445-19.
9
Text detection and recognition based on a lensless imaging system.基于无透镜成像系统的文本检测与识别。
Appl Opt. 2022 May 10;61(14):4177-4186. doi: 10.1364/AO.456158.
10
A hybrid approach of Weighted Fine-Tuned BERT extraction with deep Siamese Bi - LSTM model for semantic text similarity identification.一种结合加权微调BERT提取与深度孪生双向长短期记忆网络(Siamese Bi-LSTM)模型的混合方法,用于语义文本相似性识别。
Multimed Tools Appl. 2022;81(5):6131-6157. doi: 10.1007/s11042-021-11771-6. Epub 2022 Jan 6.

引用本文的文献

1
Application of Generative Artificial Intelligence Models for Accurate Prescription Label Identification and Information Retrieval for the Elderly in Northern East of Thailand.生成式人工智能模型在泰国东北部老年人准确处方标签识别和信息检索中的应用
J Imaging. 2025 Jan 6;11(1):11. doi: 10.3390/jimaging11010011.
2
A Novel Two-Stage Induced Deep Learning System for Classifying Similar Drugs with Diverse Packaging.一种用于分类具有不同包装的相似药物的新型两阶段诱导深度学习系统。
Sensors (Basel). 2023 Aug 19;23(16):7275. doi: 10.3390/s23167275.

本文引用的文献

1
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
2
A parallel approximate string matching under Levenshtein distance on graphics processing units using warp-shuffle operations.一种在图形处理单元上使用 warp 混洗操作的基于莱文斯坦距离的并行近似字符串匹配。
PLoS One. 2017 Oct 10;12(10):e0186251. doi: 10.1371/journal.pone.0186251. eCollection 2017.
3
Framewise phoneme classification with bidirectional LSTM and other neural network architectures.
使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.
4
ASHP guidelines on hospital drug distribution and control (with references).美国卫生系统药师协会医院药品分发与控制指南(附参考文献)
Am J Hosp Pharm. 1980 Aug;37(8):1097-103.