• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用神经网络生成的领域特定指纹增强基于配体的虚拟筛选。

Using Domain-Specific Fingerprints Generated Through Neural Networks to Enhance Ligand-Based Virtual Screening.

机构信息

Institute of Pharmaceutical and Medicinal Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, Münster 48149, Germany.

Center for Multiscale Theory and Computation, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, Münster 48149, Germany.

出版信息

J Chem Inf Model. 2021 Feb 22;61(2):664-675. doi: 10.1021/acs.jcim.0c01208. Epub 2021 Jan 26.

DOI:10.1021/acs.jcim.0c01208
PMID:33497572
Abstract

Similarity-based virtual screening is a fundamental tool in the early drug discovery process and relies heavily on molecular fingerprints. We propose a novel strategy of generating domain-specific fingerprints by training neural networks on target-specific bioactivity datasets and using the activation as a new molecular representation. The neural network is expected to combine information of already known bioactive compounds with unique information of the molecular structure and by doing so enrich the fingerprint. We evaluate this strategy on a large kinase-specific bioactivity dataset. A comparison of five neural network architectures and their fingerprints to the well-established extended-connectivity fingerprint (ECFP) and an autoencoder shows that our neural fingerprint produces better results in the similarity search. Most importantly, the neural fingerprint performs well even when specific targets are not included during training. Surprisingly, while Graph Neural Networks (GNNs) are thought to offer an advantageous alternative, the best performing neural fingerprints were based on traditional fully connected layers using the ECFP4 as the input. The neural fingerprint is freely available at: https://github.com/kochgroup/kinase_nnfp.

摘要

基于相似度的虚拟筛选是药物发现早期过程中的一个基本工具,它严重依赖于分子指纹。我们提出了一种新的策略,通过在特定于目标的生物活性数据集上训练神经网络,并使用激活作为新的分子表示来生成特定于域的指纹。预计神经网络将结合已经已知的生物活性化合物的信息与分子结构的独特信息,并通过这样来丰富指纹。我们在一个大型激酶特异性生物活性数据集上评估了这种策略。五种神经网络架构及其指纹与成熟的扩展连接指纹 (ECFP) 和自动编码器的比较表明,我们的神经网络指纹在相似性搜索中产生了更好的结果。最重要的是,即使在训练过程中不包括特定的目标,神经网络指纹也能很好地发挥作用。令人惊讶的是,尽管图神经网络 (GNN) 被认为是一种有利的替代方案,但表现最好的神经网络指纹是基于传统的全连接层,使用 ECFP4 作为输入。神经网络指纹可在以下网址免费获取:https://github.com/kochgroup/kinase_nnfp。

相似文献

1
Using Domain-Specific Fingerprints Generated Through Neural Networks to Enhance Ligand-Based Virtual Screening.利用神经网络生成的领域特定指纹增强基于配体的虚拟筛选。
J Chem Inf Model. 2021 Feb 22;61(2):664-675. doi: 10.1021/acs.jcim.0c01208. Epub 2021 Jan 26.
2
Neural networks prediction of the protein-ligand binding affinity with circular fingerprints.基于循环指纹的蛋白质配体结合亲和力的神经网络预测。
Technol Health Care. 2023;31(S1):487-495. doi: 10.3233/THC-236042.
3
Prioritizing Virtual Screening with Interpretable Interaction Fingerprints.基于可解释相互作用指纹的虚拟筛选优先级排序。
J Chem Inf Model. 2022 Sep 26;62(18):4300-4318. doi: 10.1021/acs.jcim.2c00695. Epub 2022 Sep 14.
4
TF3P: Three-Dimensional Force Fields Fingerprint Learned by Deep Capsular Network.TF3P:基于深度胶囊网络学习的三维力场指纹。
J Chem Inf Model. 2020 Jun 22;60(6):2754-2765. doi: 10.1021/acs.jcim.0c00005. Epub 2020 May 28.
5
EMBER-Embedding Multiple Molecular Fingerprints for Virtual Screening.EMBER-嵌入多种分子指纹进行虚拟筛选。
Int J Mol Sci. 2022 Feb 15;23(4):2156. doi: 10.3390/ijms23042156.
6
Employing Molecular Conformations for Ligand-Based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning.利用基于分子构象的等价图神经网络和深度多重实例学习进行配体虚拟筛选。
Molecules. 2023 Aug 9;28(16):5982. doi: 10.3390/molecules28165982.
7
A probabilistic molecular fingerprint for big data settings.一种适用于大数据环境的概率分子指纹。
J Cheminform. 2018 Dec 18;10(1):66. doi: 10.1186/s13321-018-0321-8.
8
Improved Scaffold Hopping in Ligand-Based Virtual Screening Using Neural Representation Learning.基于神经网络表示学习的配体虚拟筛选中支架跳跃的改进。
J Chem Inf Model. 2020 Oct 26;60(10):4629-4639. doi: 10.1021/acs.jcim.0c00622. Epub 2020 Aug 19.
9
Natural product scores and fingerprints extracted from artificial neural networks.从人工神经网络中提取的天然产物得分和指纹图谱。
Comput Struct Biotechnol J. 2021 Jul 30;19:4593-4602. doi: 10.1016/j.csbj.2021.07.032. eCollection 2021.
10
OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs.OLB-AC:通过深度图学习和活性悬崖优化配体生物活性。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae365.

引用本文的文献

1
DrugGen enhances drug discovery with large language models and reinforcement learning.DrugGen利用大语言模型和强化学习提升药物研发。
Sci Rep. 2025 Apr 18;15(1):13445. doi: 10.1038/s41598-025-98629-1.
2
Comparing Explanations of Molecular Machine Learning Models Generated with Different Methods for the Calculation of Shapley Values.比较使用不同方法计算Shapley值生成的分子机器学习模型的解释
Mol Inform. 2025 Mar;44(3):e202500067. doi: 10.1002/minf.202500067.
3
Efficient and Explainable Virtual Screening of Molecules through Fingerprint-Generating Networks Integrated with Artificial Neural Networks.
通过与人工神经网络集成的指纹生成网络对分子进行高效且可解释的虚拟筛选。
ACS Omega. 2025 Jan 28;10(5):4896-4911. doi: 10.1021/acsomega.4c10289. eCollection 2025 Feb 11.
4
Sort & Slice: a simple and superior alternative to hash-based folding for extended-connectivity fingerprints.排序与切片:一种用于扩展连接性指纹的、比基于哈希的折叠更简单且更优的替代方法。
J Cheminform. 2024 Dec 3;16(1):135. doi: 10.1186/s13321-024-00932-y.
5
DiPPI: A Curated Data Set for Drug-like Molecules in Protein-Protein Interfaces.DiPPI:蛋白质-蛋白质界面中类药分子的精选数据集。
J Chem Inf Model. 2024 Jul 8;64(13):5041-5051. doi: 10.1021/acs.jcim.3c01905. Epub 2024 Jun 22.
6
Interpreting Neural Network Models for Toxicity Prediction by Extracting Learned Chemical Features.通过提取学习到的化学特征来解释神经网络模型在毒性预测中的作用。
J Chem Inf Model. 2024 May 13;64(9):3670-3688. doi: 10.1021/acs.jcim.4c00127. Epub 2024 Apr 30.
7
"DompeKeys": a set of novel substructure-based descriptors for efficient chemical space mapping, development and structural interpretation of machine learning models, and indexing of large databases.“多姆佩键”:一组基于新颖子结构的描述符,用于高效的化学空间映射、机器学习模型的开发与结构解释以及大型数据库的索引编制。
J Cheminform. 2024 Feb 23;16(1):21. doi: 10.1186/s13321-024-00813-4.
8
Can Pretrained Models Really Learn Better Molecular Representations for AI-Aided Drug Discovery?预训练模型真的能为 AI 辅助药物发现学习更好的分子表示吗?
J Chem Inf Model. 2024 Apr 8;64(7):2921-2930. doi: 10.1021/acs.jcim.3c01707. Epub 2023 Dec 25.
9
Machine-Learning-Based Data Analysis Method for Cell-Based Selection of DNA-Encoded Libraries.基于机器学习的用于基于细胞筛选DNA编码文库的数据分析方法
ACS Omega. 2023 May 15;8(21):19057-19071. doi: 10.1021/acsomega.3c02152. eCollection 2023 May 30.
10
Exploring QSAR models for activity-cliff prediction.探索用于活性悬崖预测的定量构效关系模型。
J Cheminform. 2023 Apr 17;15(1):47. doi: 10.1186/s13321-023-00708-w.