• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过从原子力显微镜图像中提取分子指纹进行分子鉴定。

Molecular identification via molecular fingerprint extraction from atomic force microscopy images.

作者信息

González Lastre Manuel, Pou Pablo, Wiche Miguel, Ebeling Daniel, Schirmeisen Andre, Pérez Rubén

机构信息

Departamento de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049, Madrid, Spain.

Condensed Matter Physics Center (IFIMAC), Universidad Autónoma de Madrid, E-28049, Madrid, Spain.

出版信息

J Cheminform. 2024 Nov 25;16(1):130. doi: 10.1186/s13321-024-00921-1.

DOI:10.1186/s13321-024-00921-1
PMID:39587659
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11587762/
Abstract

Non-Contact Atomic Force Microscopy with CO-functionalized metal tips (referred to as HR-AFM) provides access to the internal structure of individual molecules adsorbed on a surface with totally unprecedented resolution. Previous works have shown that deep learning (DL) models can retrieve the chemical and structural information encoded in a 3D stack of constant-height HR-AFM images, leading to molecular identification. In this work, we overcome their limitations by using a well-established description of the molecular structure in terms of topological fingerprints, the 1024-bit Extended Connectivity Chemical Fingerprints of radius 2 (ECFP4), that were developed for substructure and similarity searching. ECFPs provide local structural information of the molecule, each bit correlating with a particular substructure within the molecule. Our DL model is able to extract this optimized structural descriptor from the 3D HR-AFM stacks and use it, through virtual screening, to identify molecules from their predicted ECFP4 with a retrieval accuracy on theoretical images of 95.4%. Furthermore, this approach, unlike previous DL models, assigns a confidence score, the Tanimoto similarity, to each of the candidate molecules, thus providing information on the reliability of the identification. By construction, the number of times a certain substructure is present in the molecule is lost during the hashing process, necessary to make them useful for machine learning applications. We show that it is possible to complement the fingerprint-based virtual screening with global information provided by another DL model that predicts from the same HR-AFM stacks the chemical formula, boosting the identification accuracy up to a 97.6%. Finally, we perform a limited test with experimental images, obtaining promising results towards the application of this pipeline under real conditions.Scientific contributionPrevious works on molecular identification from AFM images used chemical descriptors that were intuitive for humans but sub-optimal for neural networks. We propose a novel method to extract the ECFP4 from AFM images and identify the molecule via a virtual screening, beating previous state-of-the-art models.

摘要

采用一氧化碳功能化金属探针的非接触式原子力显微镜(简称为高分辨率原子力显微镜,HR-AFM)能够以前所未有的分辨率揭示吸附在表面的单个分子的内部结构。此前的研究表明,深度学习(DL)模型可以从三维等高HR-AFM图像堆栈中检索编码的化学和结构信息,从而实现分子识别。在本研究中,我们通过使用一种成熟的基于拓扑指纹(即半径为2的1024位扩展连接性化学指纹,ECFP4)的分子结构描述方法克服了它们的局限性,该方法是为子结构和相似性搜索而开发的。ECFP提供了分子的局部结构信息,每一位都与分子内的特定子结构相关。我们的DL模型能够从三维HR-AFM堆栈中提取这种优化的结构描述符,并通过虚拟筛选,利用预测的ECFP4识别分子,在理论图像上的检索准确率达到95.4%。此外,与之前的DL模型不同,这种方法为每个候选分子赋予一个置信度分数,即塔尼莫托相似度,从而提供识别可靠性的信息。在构建过程中,分子中某个特定子结构出现的次数在哈希过程中丢失了,而哈希过程对于使其适用于机器学习应用是必要的。我们表明,可以用另一个从相同HR-AFM堆栈预测化学式的DL模型提供的全局信息来补充基于指纹的虚拟筛选,从而将识别准确率提高到97.6%。最后,我们对实验图像进行了有限的测试,在实际条件下应用该流程取得了有前景的结果。

科学贡献

此前从AFM图像进行分子识别的研究使用的化学描述符对人类来说直观,但对神经网络来说并非最优。我们提出了一种从AFM图像中提取ECFP4并通过虚拟筛选识别分子的新方法,超越了之前的最先进模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/a27fbe23bf47/13321_2024_921_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/c61150c46ae0/13321_2024_921_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/778471461272/13321_2024_921_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/a265257e5c51/13321_2024_921_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/74bc6b7959ce/13321_2024_921_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/6165e7c8c9cd/13321_2024_921_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/335693b45dc9/13321_2024_921_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/8587b2c52f4c/13321_2024_921_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/d7e9ce006be4/13321_2024_921_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/a27fbe23bf47/13321_2024_921_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/c61150c46ae0/13321_2024_921_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/778471461272/13321_2024_921_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/a265257e5c51/13321_2024_921_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/74bc6b7959ce/13321_2024_921_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/6165e7c8c9cd/13321_2024_921_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/335693b45dc9/13321_2024_921_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/8587b2c52f4c/13321_2024_921_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/d7e9ce006be4/13321_2024_921_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/056c/11587762/a27fbe23bf47/13321_2024_921_Fig9_HTML.jpg

相似文献

1
Molecular identification via molecular fingerprint extraction from atomic force microscopy images.通过从原子力显微镜图像中提取分子指纹进行分子鉴定。
J Cheminform. 2024 Nov 25;16(1):130. doi: 10.1186/s13321-024-00921-1.
2
Molecular Identification from AFM Images Using the IUPAC Nomenclature and Attribute Multimodal Recurrent Neural Networks.基于 IUPAC 命名法和属性多模态递归神经网络的原子力显微镜图像分子识别。
ACS Appl Mater Interfaces. 2023 May 10;15(18):22692-22704. doi: 10.1021/acsami.3c01550. Epub 2023 May 1.
3
QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy.QUAM-AFM:一个通过原子力显微镜进行分子鉴定的免费数据库。
J Chem Inf Model. 2022 Mar 14;62(5):1214-1223. doi: 10.1021/acs.jcim.1c01323. Epub 2022 Mar 2.
4
Sort & Slice: a simple and superior alternative to hash-based folding for extended-connectivity fingerprints.排序与切片:一种用于扩展连接性指纹的、比基于哈希的折叠更简单且更优的替代方法。
J Cheminform. 2024 Dec 3;16(1):135. doi: 10.1186/s13321-024-00932-y.
5
A Deep Learning Approach for Molecular Classification Based on AFM Images.一种基于原子力显微镜图像的分子分类深度学习方法。
Nanomaterials (Basel). 2021 Jun 24;11(7):1658. doi: 10.3390/nano11071658.
6
Reconstruction of low-resolution molecular structures from simulated atomic force microscopy images.从原子力显微镜模拟图像中重建低分辨率分子结构。
Biochim Biophys Acta Gen Subj. 2020 Feb;1864(2):129420. doi: 10.1016/j.bbagen.2019.129420. Epub 2019 Aug 28.
7
Automated structure discovery in atomic force microscopy.原子力显微镜中的自动结构发现
Sci Adv. 2020 Feb 26;6(9):eaay6913. doi: 10.1126/sciadv.aay6913. eCollection 2020 Feb.
8
Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.基于深度置信网络堆叠的改进深度学习分子相似性搜索方法。
Molecules. 2020 Dec 29;26(1):128. doi: 10.3390/molecules26010128.
9
High-resolution imaging of C60 molecules using tuning-fork-based non-contact atomic force microscopy.利用基于音叉的非接触原子力显微镜对 C60 分子进行高分辨率成像。
J Phys Condens Matter. 2012 Feb 29;24(8):084005. doi: 10.1088/0953-8984/24/8/084005. Epub 2012 Feb 7.
10
Comparative Analysis of Chemical Descriptors by Machine Learning Reveals Atomistic Insights into Solute-Lipid Interactions.基于机器学习的化学描述符对比分析揭示了溶质-脂质相互作用的原子水平见解。
Mol Pharm. 2024 Jul 1;21(7):3343-3355. doi: 10.1021/acs.molpharmaceut.4c00080. Epub 2024 May 23.

引用本文的文献

1
Reconstructing Pristine Molecular Orbitals from Scanning Tunneling Microscope Images via Artificial Intelligence Approaches.通过人工智能方法从扫描隧道显微镜图像重建原始分子轨道
JACS Au. 2025 Jun 16;5(7):3163-3170. doi: 10.1021/jacsau.5c00310. eCollection 2025 Jul 28.