• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有快速传播的可推广、快速且准确的深度定量构效关系模型。

Generalizable, fast, and accurate DeepQSPR with fastprop.

作者信息

Burns Jackson W, Green William H

机构信息

Massachusetts Institute of Technology, Cambridge, MA, USA.

出版信息

J Cheminform. 2025 May 13;17(1):73. doi: 10.1186/s13321-025-01013-4.

DOI:10.1186/s13321-025-01013-4
PMID:40361252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12076996/
Abstract

Quantitative Structure-Property Relationship studies (QSPR), often referred to interchangeably as QSAR, seek to establish a mapping between molecular structure and an arbitrary target property. Historically this was done on a target-by-target basis with new descriptors being devised to specifically map to a given target. Today software packages exist that calculate thousands of these descriptors, enabling general modeling typically with classical and machine learning methods. Also present today are learned representation methods in which deep learning models generate a target-specific representation during training. The former requires less training data and offers improved speed and interpretability while the latter offers excellent generality, while the intersection of the two remains under-explored. This paper introduces fastprop, a software package and general Deep-QSPR framework that combines a cogent set of molecular descriptors with deep learning to achieve state-of-the-art performance on datasets ranging from tens to tens of thousands of molecules. fastprop provides both a user-friendly Command Line Interface and highly interoperable set of Python modules for the training and deployment of feedforward neural networks for property prediction. This approach yields improvements in speed and interpretability over existing methods while statistically equaling or exceeding their performance across most of the tested benchmarks. fastprop is designed with Research Software Engineering best practices and is free and open source, hosted at github.com/jacksonburns/fastprop.

摘要

定量结构-性质关系研究(QSPR,通常与QSAR互换使用)旨在建立分子结构与任意目标性质之间的映射关系。从历史上看,这是在逐个目标的基础上进行的,需要设计新的描述符来专门映射到给定目标。如今,存在一些软件包,可以计算数千个这样的描述符,从而通常能够使用经典方法和机器学习方法进行通用建模。如今还出现了学习表示方法,其中深度学习模型在训练期间生成特定于目标的表示。前者需要较少的训练数据,并提供更高的速度和可解释性,而后者具有出色的通用性,而两者的交叉点仍有待深入探索。本文介绍了fastprop,这是一个软件包和通用的深度QSPR框架,它将一组有说服力的分子描述符与深度学习相结合,在从数十到数万个分子的数据集上实现了一流的性能。fastprop为前馈神经网络的训练和部署提供了用户友好的命令行界面和高度可互操作的Python模块集,用于属性预测。这种方法在速度和可解释性方面比现有方法有所改进,同时在大多数测试基准上在统计上等于或超过它们的性能。fastprop是按照研究软件工程的最佳实践设计的,并且是免费和开源的,托管在github.com/jacksonburns/fastprop上。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22e8/12076996/659da3bd1caa/13321_2025_1013_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22e8/12076996/659da3bd1caa/13321_2025_1013_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22e8/12076996/659da3bd1caa/13321_2025_1013_Fig1_HTML.jpg

相似文献

1
Generalizable, fast, and accurate DeepQSPR with fastprop.具有快速传播的可推广、快速且准确的深度定量构效关系模型。
J Cheminform. 2025 May 13;17(1):73. doi: 10.1186/s13321-025-01013-4.
2
Molecular Descriptors, Structure Generation, and Inverse QSAR/QSPR Based on SELFIES.基于SELFIES的分子描述符、结构生成及逆定量构效关系/定量构性关系
ACS Omega. 2023 Jun 5;8(24):21781-21786. doi: 10.1021/acsomega.3c01332. eCollection 2023 Jun 20.
3
DGL-LifeSci: An Open-Source Toolkit for Deep Learning on Graphs in Life Science.DGL-LifeSci:用于生命科学领域图深度学习的开源工具包。
ACS Omega. 2021 Oct 5;6(41):27233-27238. doi: 10.1021/acsomega.1c04017. eCollection 2021 Oct 19.
4
Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.化学信息学中的可视化分析:用于定量构效关系方法的用户监督描述符选择
J Cheminform. 2015 Aug 19;7:39. doi: 10.1186/s13321-015-0092-4. eCollection 2015.
5
Open-source QSAR models for pKa prediction using multiple machine learning approaches.使用多种机器学习方法进行pKa预测的开源定量构效关系模型
J Cheminform. 2019 Sep 18;11(1):60. doi: 10.1186/s13321-019-0384-1.
6
ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification.ChampKit:一种基于补丁的组织病理学分类的深度神经网络快速评估框架。
Comput Methods Programs Biomed. 2023 Sep;239:107631. doi: 10.1016/j.cmpb.2023.107631. Epub 2023 May 30.
7
Quantitative structure-property relationships for the calculation of the soil adsorption coefficient using machine learning algorithms with calculated chemical properties from open-source software.使用机器学习算法和开源软件计算的化学性质计算土壤吸附系数的定量结构-性质关系。
Environ Res. 2021 May;196:110363. doi: 10.1016/j.envres.2020.110363. Epub 2020 Oct 22.
8
QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool.QSPRpred:一个灵活的开源定量结构-性质关系建模工具。
J Cheminform. 2024 Nov 14;16(1):128. doi: 10.1186/s13321-024-00908-y.
9
PyL3dMD: Python LAMMPS 3D molecular descriptors package.PyL3dMD:Python LAMMPS 3D分子描述符软件包。
J Cheminform. 2023 Jul 28;15(1):69. doi: 10.1186/s13321-023-00737-5.
10
Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties.在预测有机反应性、选择性和化学性质方面,工程化和学习的分子表示的重要性。
Acc Chem Res. 2021 Feb 16;54(4):827-836. doi: 10.1021/acs.accounts.0c00745. Epub 2021 Feb 3.

引用本文的文献

1
Data-driven organic solubility prediction at the limit of aleatoric uncertainty.在偶然不确定性极限下的数据驱动有机溶解度预测。
Nat Commun. 2025 Aug 19;16(1):7497. doi: 10.1038/s41467-025-62717-7.
2
Advanced machine learning for innovative drug discovery.用于创新药物发现的先进机器学习技术。
J Cheminform. 2025 Aug 8;17(1):122. doi: 10.1186/s13321-025-01061-w.

本文引用的文献

1
Be aware of overfitting by hyperparameter optimization!通过超参数优化注意过拟合!
J Cheminform. 2024 Dec 9;16(1):139. doi: 10.1186/s13321-024-00934-w.
2
Molecular property prediction based on graph structure learning.基于图结构学习的分子性质预测。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae304.
3
Deep learning for low-data drug discovery: Hurdles and opportunities.用于低数据量药物发现的深度学习:障碍与机遇。
Curr Opin Struct Biol. 2024 Jun;86:102818. doi: 10.1016/j.sbi.2024.102818. Epub 2024 Apr 25.
4
Molecular hypergraph neural networks.分子超图神经网络
J Chem Phys. 2024 Apr 14;160(14). doi: 10.1063/5.0193557.
5
Molecular Contrastive Pretraining with Collaborative Featurizations.分子对比预训练与协作特征化。
J Chem Inf Model. 2024 Feb 26;64(4):1112-1122. doi: 10.1021/acs.jcim.3c01468. Epub 2024 Feb 5.
6
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
7
From intuition to AI: evolution of small molecule representations in drug discovery.从直觉到人工智能:药物发现中小分子表示的演变。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad422.
8
QuantumScents: Quantum-Mechanical Properties for 3.5k Olfactory Molecules.QuantumScents:3500 种气味分子的量子力学特性。
J Chem Inf Model. 2023 Dec 11;63(23):7330-7337. doi: 10.1021/acs.jcim.3c01338. Epub 2023 Nov 21.
9
Novel molecular hybrid geometric-harmonic-Zagreb degree based descriptors and their efficacy in QSPR studies of polycyclic aromatic hydrocarbons.新型分子混合几何调和扎格布度数描述符及其在多环芳烃 QSPR 研究中的效能。
SAR QSAR Environ Res. 2023 Jul-Sep;34(7):569-589. doi: 10.1080/1062936X.2023.2239149. Epub 2023 Aug 4.
10
DeepAR: a novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists.DeepAR:一种基于深度学习的新型混合框架,用于雄激素受体拮抗剂的可解释预测。
J Cheminform. 2023 May 6;15(1):50. doi: 10.1186/s13321-023-00721-z.