• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自动编码器深度学习的分子相似性搜索特征降维。

Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning.

机构信息

School of Computing, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia.

DAAI Research Group, Department of Computing and Data Science, School of Computing and Digital Technology, Birmingham City University, Birmingham B4 7XG, UK.

出版信息

Biomolecules. 2022 Mar 27;12(4):508. doi: 10.3390/biom12040508.

DOI:10.3390/biom12040508
PMID:35454097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9029813/
Abstract

The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching.

摘要

分子相似性的概念在合理药物设计中被广泛应用,在分子数据库中检查结构相似的分子,以检索功能相似的分子。最常用的传统相似性方法使用二维(2D)指纹来评估分子对目标查询的相似性。然而,这些描述符包含冗余和不相关的特征,可能会影响相似性搜索方法的性能。因此,本研究提出了一种基于自动编码器(AE)表示分子特征的新方法,用于识别化学数据集分子的重要特征,旨在去除不相关和冗余的特征。该方法在 MDL Data Drug Report 标准数据集(MDDR)上进行了实验。基于实验结果,与 Tanimoto 相似性方法(TAN)、文本处理自适应相似性度量(ASMTP)和基于量子的相似性方法(SQB)等几种现有的基准相似性方法相比,所提出的方法表现更好。结果表明,所提出的方法的性能表现优越,特别是在使用结构异构数据集时,与其他具有相似目标的先前使用的方法相比,它取得了更好的结果,即改善分子相似性搜索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/6ea33622784d/biomolecules-12-00508-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/1bf0b5f97a8d/biomolecules-12-00508-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/9d8e346afdb4/biomolecules-12-00508-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/67ffea62a13e/biomolecules-12-00508-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/8d03b21bdb8a/biomolecules-12-00508-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/d52acc8c6a2d/biomolecules-12-00508-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/c5464ef57738/biomolecules-12-00508-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/6ea33622784d/biomolecules-12-00508-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/1bf0b5f97a8d/biomolecules-12-00508-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/9d8e346afdb4/biomolecules-12-00508-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/67ffea62a13e/biomolecules-12-00508-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/8d03b21bdb8a/biomolecules-12-00508-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/d52acc8c6a2d/biomolecules-12-00508-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/c5464ef57738/biomolecules-12-00508-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f82/9029813/6ea33622784d/biomolecules-12-00508-g007.jpg

相似文献

1
Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning.基于自动编码器深度学习的分子相似性搜索特征降维。
Biomolecules. 2022 Mar 27;12(4):508. doi: 10.3390/biom12040508.
2
Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.基于深度置信网络堆叠的改进深度学习分子相似性搜索方法。
Molecules. 2020 Dec 29;26(1):128. doi: 10.3390/molecules26010128.
3
Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): evaluation of performance.使用原子环境描述符(MOLPRINT 2D)对化学数据库进行相似性搜索:性能评估
J Chem Inf Comput Sci. 2004 Sep-Oct;44(5):1708-18. doi: 10.1021/ci0498719.
4
A Quantum-Based Similarity Method in Virtual Screening.虚拟筛选中基于量子的相似性方法。
Molecules. 2015 Oct 2;20(10):18107-27. doi: 10.3390/molecules201018107.
5
Adapting Document Similarity Measures for Ligand-Based Virtual Screening.调整基于配体的虚拟筛选的文档相似性度量方法。
Molecules. 2016 Apr 13;21(4):476. doi: 10.3390/molecules21040476.
6
A new graph-based molecular descriptor using the canonical representation of the molecule.一种使用分子的规范表示法的基于图的新分子描述符。
ScientificWorldJournal. 2014;2014:286974. doi: 10.1155/2014/286974. Epub 2014 Jul 22.
7
Similarity-Based Virtual Screen Using Enhanced Siamese Multi-Layer Perceptron.基于相似性的虚拟筛选:使用增强型暹罗多层感知机。
Molecules. 2021 Nov 3;26(21):6669. doi: 10.3390/molecules26216669.
8
Performance of similarity measures in 2D fragment-based similarity searching: comparison of structural descriptors and similarity coefficients.二维基于片段的相似性搜索中相似性度量的性能:结构描述符和相似性系数的比较
J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1407-14. doi: 10.1021/ci025531g.
9
A Computational-Based Method for Predicting Drug-Target Interactions by Using Stacked Autoencoder Deep Neural Network.一种基于堆叠自动编码器深度神经网络预测药物-靶点相互作用的计算方法。
J Comput Biol. 2018 Mar;25(3):361-373. doi: 10.1089/cmb.2017.0135. Epub 2017 Sep 11.
10
Predicting drug-target interaction network using deep learning model.利用深度学习模型预测药物-靶标相互作用网络。
Comput Biol Chem. 2019 Jun;80:90-101. doi: 10.1016/j.compbiolchem.2019.03.016. Epub 2019 Mar 25.

引用本文的文献

1
Utilizing Low-Dimensional Molecular Embeddings for Rapid Chemical Similarity Search.利用低维分子嵌入进行快速化学相似性搜索。
Adv Inf Retr. 2024 Mar;14609:34-49. doi: 10.1007/978-3-031-56060-6_3. Epub 2024 Mar 16.
2
In Silico Drug Design and Discovery: Big Data for Small Molecule Design.计算机辅助药物设计与发现:小分子设计的大数据
Biomolecules. 2022 Dec 26;13(1):44. doi: 10.3390/biom13010044.
3
Hybrid-Enhanced Siamese Similarity Models in Ligand-Based Virtual Screen.基于配体的虚拟筛选中的混合增强型暹罗相似度模型。

本文引用的文献

1
Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.基于深度置信网络堆叠的改进深度学习分子相似性搜索方法。
Molecules. 2020 Dec 29;26(1):128. doi: 10.3390/molecules26010128.
2
VAE-Sim: A Novel Molecular Similarity Measure Based on a Variational Autoencoder.VAE-Sim:一种基于变分自动编码器的新型分子相似性度量方法。
Molecules. 2020 Jul 29;25(15):3446. doi: 10.3390/molecules25153446.
3
EncoderMap: Dimensionality Reduction and Generation of Molecule Conformations.编码器映射:分子构象的降维和生成。
Biomolecules. 2022 Nov 20;12(11):1719. doi: 10.3390/biom12111719.
4
Convolutional Neural Network Model Based on 2D Fingerprint for Bioactivity Prediction.基于二维指纹的卷积神经网络模型用于生物活性预测。
Int J Mol Sci. 2022 Oct 30;23(21):13230. doi: 10.3390/ijms232113230.
5
Deep autoencoder-powered pattern identification of sleep disturbance using multi-site cross-sectional survey data.利用多站点横断面调查数据,通过深度自动编码器进行睡眠障碍模式识别。
Front Med (Lausanne). 2022 Jul 29;9:950327. doi: 10.3389/fmed.2022.950327. eCollection 2022.
J Chem Theory Comput. 2019 Feb 12;15(2):1209-1215. doi: 10.1021/acs.jctc.8b00975. Epub 2019 Jan 25.
4
Computer-Assisted Retrosynthesis Based on Molecular Similarity.基于分子相似性的计算机辅助逆合成分析
ACS Cent Sci. 2017 Dec 27;3(12):1237-1245. doi: 10.1021/acscentsci.7b00355. Epub 2017 Nov 16.
5
Application of Generative Autoencoder in De Novo Molecular Design.生成式自动编码器在从头分子设计中的应用。
Mol Inform. 2018 Jan;37(1-2). doi: 10.1002/minf.201700123. Epub 2017 Dec 13.
6
Interaction prediction in structure-based virtual screening using deep learning.基于深度学习的结构虚拟筛选中的相互作用预测。
Comput Biol Med. 2018 Sep 1;100:253-258. doi: 10.1016/j.compbiomed.2017.09.007. Epub 2017 Sep 14.
7
Recent Advances in Scaffold Hopping.骨架跃迁的最新进展
J Med Chem. 2017 Feb 23;60(4):1238-1246. doi: 10.1021/acs.jmedchem.6b01437. Epub 2016 Dec 21.
8
Molecular Similarity Concepts for Informatics Applications.用于信息学应用的分子相似性概念
Methods Mol Biol. 2017;1526:231-245. doi: 10.1007/978-1-4939-6613-4_13.
9
Adapting Document Similarity Measures for Ligand-Based Virtual Screening.调整基于配体的虚拟筛选的文档相似性度量方法。
Molecules. 2016 Apr 13;21(4):476. doi: 10.3390/molecules21040476.
10
A Quantum-Based Similarity Method in Virtual Screening.虚拟筛选中基于量子的相似性方法。
Molecules. 2015 Oct 2;20(10):18107-27. doi: 10.3390/molecules201018107.