• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CLiDE Pro:CLiDE的最新一代产品,一款用于光学化学结构识别的工具。

CLiDE Pro: the latest generation of CLiDE, a tool for optical chemical structure recognition.

作者信息

Valko Aniko T, Johnson A Peter

机构信息

Keymodule Ltd., Hobberley Lodge, Hobberley Lane, Leeds LS17 8JQ, United Kingdom.

出版信息

J Chem Inf Model. 2009 Apr;49(4):780-7. doi: 10.1021/ci800449t.

DOI:10.1021/ci800449t
PMID:19298076
Abstract

We present CLiDE Pro, the latest version of the output of the long-term CLiDE project for the development of tools for automatic extraction of chemical information from the literature. CLiDE Pro is concerned with the extraction of chemical structure and generic structure information from electronic images of chemical molecules available online as well as pages of scanned chemical documents. The information is extracted in three phases, first the image is segmented into text and graphical regions, then graphical regions are analyzed and where possible the connection tables are reconstructed, and finally any generic structures are interpreted by matching R-groups found in structure diagrams with the ones located in the text. The program has been tested on a large set of images of chemical structures originating from various sources. The results demonstrate good performance in the reconstruction of connection tables with few errors in the interpretation of the individual drawing features found in the structure diagrams. This full test set is presented for use in the validation of other similar systems.

摘要

我们展示了CLiDE Pro,这是长期CLiDE项目的最新成果,该项目旨在开发从文献中自动提取化学信息的工具。CLiDE Pro关注从在线提供的化学分子电子图像以及扫描的化学文档页面中提取化学结构和通用结构信息。信息提取分三个阶段进行,首先将图像分割为文本和图形区域,然后分析图形区域,并在可能的情况下重建连接表,最后通过将结构图中找到的R基团与文本中定位的R基团进行匹配来解释任何通用结构。该程序已在大量源自各种来源的化学结构图像上进行了测试。结果表明该程序在重建连接表方面表现良好,在解释结构图中发现的各个绘图特征时错误较少。这个完整的测试集可供用于验证其他类似系统。

相似文献

1
CLiDE Pro: the latest generation of CLiDE, a tool for optical chemical structure recognition.CLiDE Pro:CLiDE的最新一代产品,一款用于光学化学结构识别的工具。
J Chem Inf Model. 2009 Apr;49(4):780-7. doi: 10.1021/ci800449t.
2
KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways.KiPar,一种用于系统检索酵母代谢途径动力学建模参数相关信息的工具。
Bioinformatics. 2009 Jun 1;25(11):1404-11. doi: 10.1093/bioinformatics/btp175. Epub 2009 Mar 31.
3
Encoding and decoding graphical chemical structures as two-dimensional (PDF417) barcodes.将图形化学结构编码和解码为二维(PDF417)条形码。
J Chem Inf Model. 2005 May-Jun;45(3):572-80. doi: 10.1021/ci049758i.
4
Automated extraction of chemical structure information from digital raster images.从数字光栅图像中自动提取化学结构信息。
Chem Cent J. 2009 Feb 5;3:4. doi: 10.1186/1752-153X-3-4.
5
[Construction of chemical information database based on optical structure recognition technique].基于光学结构识别技术的化学信息数据库构建
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):352-357.
6
[On information. An essay].[论信息。一篇短文]。
Tijdschr Diergeneeskd. 1986 Jan 1;111(1):3-8.
7
From structure diagrams to visual chemical patterns.从结构图表到可视化化学模式。
J Chem Inf Model. 2010 Sep 27;50(9):1529-35. doi: 10.1021/ci100209a.
8
Selective image similarity measure for bronchoscope tracking based on image registration.基于图像配准的支气管镜跟踪选择性图像相似性度量
Med Image Anal. 2009 Aug;13(4):621-33. doi: 10.1016/j.media.2009.06.001. Epub 2009 Jun 9.
9
Markov logic networks for optical chemical structure recognition.用于光学化学结构识别的马尔可夫逻辑网络。
J Chem Inf Model. 2014 Aug 25;54(8):2380-90. doi: 10.1021/ci5002197. Epub 2014 Aug 6.
10
Signature detection and matching for document image retrieval.用于文档图像检索的签名检测与匹配。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.

引用本文的文献

1
MolNexTR: a generalized deep learning model for molecular image recognition.MolNexTR:一种用于分子图像识别的通用深度学习模型。
J Cheminform. 2024 Dec 18;16(1):141. doi: 10.1186/s13321-024-00926-w.
2
Automation and machine learning augmented by large language models in a catalysis study.在一项催化研究中,由大语言模型增强的自动化和机器学习。
Chem Sci. 2024 Jun 26;15(31):12200-12233. doi: 10.1039/d3sc07012c. eCollection 2024 Aug 7.
3
ChemReco: automated recognition of hand-drawn carbon-hydrogen-oxygen structures using deep learning.
ChemReco:利用深度学习对手绘碳氢氧结构进行自动识别
Sci Rep. 2024 Jul 25;14(1):17126. doi: 10.1038/s41598-024-67496-7.
4
MMSSC-Net: multi-stage sequence cognitive networks for drug molecule recognition.MMSSC-Net:用于药物分子识别的多阶段序列认知网络
RSC Adv. 2024 Jun 6;14(26):18182-18191. doi: 10.1039/d4ra02442g.
5
Automated molecular structure segmentation from documents using ChemSAM.使用ChemSAM从文档中自动进行分子结构分割。
J Cheminform. 2024 Mar 12;16(1):29. doi: 10.1186/s13321-024-00823-2.
6
ReactionDataExtractor 2.0: A Deep Learning Approach for Data Extraction from Chemical Reaction Schemes.反应数据提取器 2.0:一种从化学反应图中提取数据的深度学习方法。
J Chem Inf Model. 2023 Oct 9;63(19):6053-6067. doi: 10.1021/acs.jcim.3c00422. Epub 2023 Sep 20.
7
DECIMER.ai: an open platform for automated optical chemical structure identification, segmentation and recognition in scientific publications.DECIMER.ai:一个用于科学出版物中光学化学结构自动识别、分割和识别的开放平台。
Nat Commun. 2023 Aug 19;14(1):5045. doi: 10.1038/s41467-023-40782-0.
8
MolMiner: You Only Look Once for Chemical Structure Recognition.MolMiner:只看一次的化学结构识别。
J Chem Inf Model. 2022 Nov 28;62(22):5321-5328. doi: 10.1021/acs.jcim.2c00733. Epub 2022 Sep 15.
9
Review of techniques and models used in optical chemical structure recognition in images and scanned documents.图像和扫描文档中光学化学结构识别所使用的技术与模型综述。
J Cheminform. 2022 Sep 9;14(1):61. doi: 10.1186/s13321-022-00642-3.
10
SwinOCSR: end-to-end optical chemical structure recognition using a Swin Transformer.SwinOCSR:使用Swin Transformer进行端到端光学化学结构识别
J Cheminform. 2022 Jul 1;14(1):41. doi: 10.1186/s13321-022-00624-5.