• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChemReco:利用深度学习对手绘碳氢氧结构进行自动识别

ChemReco: automated recognition of hand-drawn carbon-hydrogen-oxygen structures using deep learning.

作者信息

Ouyang Hengjie, Liu Wei, Tao Jiajun, Luo Yanghong, Zhang Wanjia, Zhou Jiayu, Geng Shuqi, Zhang Chengpeng

机构信息

School of Informatics, Hunan University of Chinese Medicine, Changsha, 410208, Hunan, People's Republic of China.

出版信息

Sci Rep. 2024 Jul 25;14(1):17126. doi: 10.1038/s41598-024-67496-7.

DOI:10.1038/s41598-024-67496-7
PMID:39054356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11272916/
Abstract

Chemical molecular structures are a direct and convenient means of expressing chemical knowledge, playing a vital role in academic communication. In chemistry, hand drawing is a common task for students and researchers. If we can convert hand-drawn chemical molecular structures into machine-readable formats, like SMILES encoding, computers can efficiently process and analyze these structures, significantly enhancing the efficiency of chemical research. Furthermore, with the progress of educational technology, automated grading is gaining popularity. When machines automatically recognize chemical molecular structures and assess the correctness of the drawings, it offers great convenience to teachers. We created ChemReco, a tool designed to identify chemical molecular structures involving three atoms: C, H, and O, providing convenience for chemical researchers. Currently, there are limited studies on hand-drawn chemical molecular structures. Therefore, the primary focus of this paper is constructing datasets. We propose a synthetic image method to rapidly generate images resembling hand-drawn chemical molecular structures, enhancing dataset acquisition efficiency. Regarding model selection, the hand-drawn chemical molecule structural recognition model developed in this article achieves a final recognition accuracy of 96.90%. This model employs the encoder-decoder architecture of EfficientNet + Transformer, demonstrating superior performance compared to other encoder-decoder combinations.

摘要

化学分子结构是表达化学知识的一种直接且便捷的方式,在学术交流中起着至关重要的作用。在化学领域,手绘是学生和研究人员的一项常见任务。如果我们能够将手绘的化学分子结构转换为机器可读的格式,如SMILES编码,计算机就能有效地处理和分析这些结构,显著提高化学研究的效率。此外,随着教育技术的进步,自动评分越来越受欢迎。当机器自动识别化学分子结构并评估绘图的正确性时,这给教师带来了极大的便利。我们创建了ChemReco,这是一个旨在识别包含C、H和O三种原子的化学分子结构的工具,为化学研究人员提供了便利。目前,关于手绘化学分子结构的研究有限。因此,本文的主要重点是构建数据集。我们提出了一种合成图像方法,以快速生成类似于手绘化学分子结构的图像,提高数据集获取效率。在模型选择方面,本文开发的手绘化学分子结构识别模型最终识别准确率达到了96.90%。该模型采用了EfficientNet + Transformer的编码器-解码器架构,与其他编码器-解码器组合相比表现出卓越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/436f25fd13e2/41598_2024_67496_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/349d24d5cce1/41598_2024_67496_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/9ece9d66b80a/41598_2024_67496_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/dd503f3b1299/41598_2024_67496_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/3e3e4e4aac32/41598_2024_67496_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/39797fa76177/41598_2024_67496_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/08929bc12e8f/41598_2024_67496_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/9edc2b5d0654/41598_2024_67496_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/f3fd8cc97ad7/41598_2024_67496_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/2807f7484748/41598_2024_67496_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/18c0386e58b3/41598_2024_67496_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/ab314bbe683a/41598_2024_67496_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/6269b60e793a/41598_2024_67496_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/436f25fd13e2/41598_2024_67496_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/349d24d5cce1/41598_2024_67496_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/9ece9d66b80a/41598_2024_67496_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/dd503f3b1299/41598_2024_67496_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/3e3e4e4aac32/41598_2024_67496_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/39797fa76177/41598_2024_67496_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/08929bc12e8f/41598_2024_67496_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/9edc2b5d0654/41598_2024_67496_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/f3fd8cc97ad7/41598_2024_67496_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/2807f7484748/41598_2024_67496_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/18c0386e58b3/41598_2024_67496_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/ab314bbe683a/41598_2024_67496_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/6269b60e793a/41598_2024_67496_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6e/11272916/436f25fd13e2/41598_2024_67496_Fig13_HTML.jpg

相似文献

1
ChemReco: automated recognition of hand-drawn carbon-hydrogen-oxygen structures using deep learning.ChemReco:利用深度学习对手绘碳氢氧结构进行自动识别
Sci Rep. 2024 Jul 25;14(1):17126. doi: 10.1038/s41598-024-67496-7.
2
ChemPix: automated recognition of hand-drawn hydrocarbon structures using deep learning.ChemPix:利用深度学习对手绘烃类结构进行自动识别
Chem Sci. 2021 Jul 3;12(31):10622-10633. doi: 10.1039/d1sc02957f. eCollection 2021 Aug 11.
3
Advancements in hand-drawn chemical structure recognition through an enhanced DECIMER architecture.通过增强的DECIMER架构实现手绘化学结构识别的进展。
J Cheminform. 2024 Jul 5;16(1):78. doi: 10.1186/s13321-024-00872-7.
4
Image2InChI: Automated Molecular Optical Image Recognition.图像到国际化合物标识:自动分子光学图像识别
J Chem Inf Model. 2024 May 13;64(9):3640-3649. doi: 10.1021/acs.jcim.3c02082. Epub 2024 Feb 15.
5
Investigation of chemical structure recognition by encoder-decoder models in learning progress.在学习过程中通过编码器-解码器模型进行化学结构识别的研究。
J Cheminform. 2023 Apr 12;15(1):45. doi: 10.1186/s13321-023-00713-z.
6
Context awareness based Sketch-DeepNet architecture for hand-drawn sketches classification and recognition in AIoT.用于人工智能物联网中手绘草图分类与识别的基于上下文感知的Sketch-DeepNet架构
PeerJ Comput Sci. 2023 Apr 27;9:e1186. doi: 10.7717/peerj-cs.1186. eCollection 2023.
7
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.迁移学习:基于小规模化学反应数据集的逆向合成预测扩展到新的水平。
Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357.
8
ABC-Net: a divide-and-conquer based deep learning architecture for SMILES recognition from molecular images.ABC-Net:一种基于分而治之的深度学习架构,用于从分子图像中识别 SMILES。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac033.
9
DECIMER 1.0: deep learning for chemical image recognition using transformers.DECIMER 1.0:使用Transformer进行化学图像识别的深度学习
J Cheminform. 2021 Aug 17;13(1):61. doi: 10.1186/s13321-021-00538-8.
10
MICER: a pre-trained encoder-decoder architecture for molecular image captioning.MICER:一种用于分子图像字幕生成的预训练编解码器架构。
Bioinformatics. 2022 Sep 30;38(19):4562-4572. doi: 10.1093/bioinformatics/btac545.

本文引用的文献

1
Review of techniques and models used in optical chemical structure recognition in images and scanned documents.图像和扫描文档中光学化学结构识别所使用的技术与模型综述。
J Cheminform. 2022 Sep 9;14(1):61. doi: 10.1186/s13321-022-00642-3.
2
Img2Mol - accurate SMILES recognition from molecular graphical depictions.Img2Mol - 从分子图形描绘中准确识别SMILES
Chem Sci. 2021 Sep 29;12(42):14174-14181. doi: 10.1039/d1sc01839f. eCollection 2021 Nov 3.
3
ChemPix: automated recognition of hand-drawn hydrocarbon structures using deep learning.
ChemPix:利用深度学习对手绘烃类结构进行自动识别
Chem Sci. 2021 Jul 3;12(31):10622-10633. doi: 10.1039/d1sc02957f. eCollection 2021 Aug 11.
4
DECIMER 1.0: deep learning for chemical image recognition using transformers.DECIMER 1.0:使用Transformer进行化学图像识别的深度学习
J Cheminform. 2021 Aug 17;13(1):61. doi: 10.1186/s13321-021-00538-8.
5
DECIMER: towards deep learning for chemical image recognition.DECIMER:迈向用于化学图像识别的深度学习
J Cheminform. 2020 Oct 27;12(1):65. doi: 10.1186/s13321-020-00469-w.
6
Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations.通过转换等效化学表示来学习连续且数据驱动的分子描述符。
Chem Sci. 2018 Nov 19;10(6):1692-1701. doi: 10.1039/c8sc04175j. eCollection 2019 Feb 14.
7
Molecular Structure Extraction from Documents Using Deep Learning.使用深度学习从文档中提取分子结构。
J Chem Inf Model. 2019 Mar 25;59(3):1017-1029. doi: 10.1021/acs.jcim.8b00669. Epub 2019 Feb 27.
8
CLiDE Pro: the latest generation of CLiDE, a tool for optical chemical structure recognition.CLiDE Pro:CLiDE的最新一代产品,一款用于光学化学结构识别的工具。
J Chem Inf Model. 2009 Apr;49(4):780-7. doi: 10.1021/ci800449t.
9
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.