• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

QSPRpred:一个灵活的开源定量结构-性质关系建模工具。

QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool.

作者信息

van den Maagdenberg Helle W, Šícho Martin, Araripe David Alencar, Luukkonen Sohvi, Schoenmaker Linde, Jespers Michiel, Béquignon Olivier J M, González Marina Gorostiola, van den Broek Remco L, Bernatavicius Andrius, van Hasselt J G Coen, van der Graaf Piet H, van Westen Gerard J P

机构信息

Computational Drug Discovery, Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, Leiden, 2333 CC, The Netherlands.

CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Department of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, Prague, A-4040, Czech Republic.

出版信息

J Cheminform. 2024 Nov 14;16(1):128. doi: 10.1186/s13321-024-00908-y.

DOI:10.1186/s13321-024-00908-y
PMID:39543652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11566221/
Abstract

Building reliable and robust quantitative structure-property relationship (QSPR) models is a challenging task. First, the experimental data needs to be obtained, analyzed and curated. Second, the number of available methods is continuously growing and evaluating different algorithms and methodologies can be arduous. Finally, the last hurdle that researchers face is to ensure the reproducibility of their models and facilitate their transferability into practice. In this work, we introduce QSPRpred, a toolkit for analysis of bioactivity data sets and QSPR modelling, which attempts to address the aforementioned challenges. QSPRpred's modular Python API enables users to intuitively describe different parts of a modelling workflow using a plethora of pre-implemented components, but also integrates customized implementations in a "plug-and-play" manner. QSPRpred data sets and models are directly serializable, which means they can be readily reproduced and put into operation after training as the models are saved with all required data pre-processing steps to make predictions on new compounds directly from SMILES strings. The general-purpose character of QSPRpred is also demonstrated by inclusion of support for multi-task and proteochemometric modelling. The package is extensively documented and comes with a large collection of tutorials to help new users. In this paper, we describe all of QSPRpred's functionalities and also conduct a small benchmarking case study to illustrate how different components can be leveraged to compare a diverse set of models. QSPRpred is fully open-source and available at https://github.com/CDDLeiden/QSPRpred .Scientific ContributionQSPRpred aims to provide a complex, but comprehensive Python API to conduct all tasks encountered in QSPR modelling from data preparation and analysis to model creation and model deployment. In contrast to similar packages, QSPRpred offers a wider and more exhaustive range of capabilities and integrations with many popular packages that also go beyond QSPR modelling. A significant contribution of QSPRpred is also in its automated and highly standardized serialization scheme, which significantly improves reproducibility and transferability of models.

摘要

构建可靠且稳健的定量结构-性质关系(QSPR)模型是一项具有挑战性的任务。首先,需要获取、分析和整理实验数据。其次,可用方法的数量在不断增加,评估不同的算法和方法可能很艰巨。最后,研究人员面临的最后一个障碍是确保其模型的可重复性,并促进其向实际应用的可转移性。在这项工作中,我们引入了QSPRpred,这是一个用于生物活性数据集分析和QSPR建模的工具包,旨在应对上述挑战。QSPRpred的模块化Python API使用户能够使用大量预先实现的组件直观地描述建模工作流程的不同部分,同时还以“即插即用”的方式集成定制实现。QSPRpred数据集和模型是直接可序列化的,这意味着在训练后它们可以很容易地被重现并投入使用,因为模型在保存时包含了所有所需的数据预处理步骤,以便直接从SMILES字符串对新化合物进行预测。对多任务和蛋白质化学计量学建模的支持也证明了QSPRpred的通用性。该软件包有大量文档,并附带大量教程以帮助新用户。在本文中,我们描述了QSPRpred的所有功能,并进行了一个小型基准案例研究,以说明如何利用不同组件来比较各种模型。QSPRpred是完全开源的,可在https://github.com/CDDLeiden/QSPRpred获取。科学贡献QSPRpred旨在提供一个复杂但全面的Python API,以执行QSPR建模中遇到的所有任务,从数据准备和分析到模型创建和模型部署。与类似的软件包相比,QSPRpred提供了更广泛、更详尽的功能范围,并与许多流行软件包进行了集成,这些集成也超出了QSPR建模的范围。QSPRpred的一个重要贡献还在于其自动化且高度标准化的序列化方案,这显著提高了模型的可重复性和可转移性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/770290792fa0/13321_2024_908_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/f783aba2a262/13321_2024_908_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/02f1c3ac3dcd/13321_2024_908_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/45c626dae5d6/13321_2024_908_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/770290792fa0/13321_2024_908_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/f783aba2a262/13321_2024_908_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/02f1c3ac3dcd/13321_2024_908_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/45c626dae5d6/13321_2024_908_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9440/11566221/770290792fa0/13321_2024_908_Fig4_HTML.jpg

相似文献

1
QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool.QSPRpred:一个灵活的开源定量结构-性质关系建模工具。
J Cheminform. 2024 Nov 14;16(1):128. doi: 10.1186/s13321-024-00908-y.
2
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.
3
RRegrs: an R package for computer-aided model selection with multiple regression models.RRegrs:一个用于通过多元回归模型进行计算机辅助模型选择的R软件包。
J Cheminform. 2015 Sep 15;7:46. doi: 10.1186/s13321-015-0094-2. eCollection 2015.
4
Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules.化学感知模型构建器(camb):一个用于小分子性质和生物活性建模的R包。
J Cheminform. 2015 Aug 28;7:45. doi: 10.1186/s13321-015-0086-2. eCollection 2015.
5
QSAR-Co-X: an open source toolkit for multitarget QSAR modelling.QSAR-Co-X:用于多靶点定量构效关系建模的开源工具包。
J Cheminform. 2021 Apr 15;13(1):29. doi: 10.1186/s13321-021-00508-0.
6
MolCompass: multi-tool for the navigation in chemical space and visual validation of QSAR/QSPR models.MolCompass:用于化学空间导航和QSAR/QSPR模型可视化验证的多功能工具。
J Cheminform. 2024 Aug 12;16(1):98. doi: 10.1186/s13321-024-00888-z.
7
PyRaDiSe: A Python package for DICOM-RT-based auto-segmentation pipeline construction and DICOM-RT data conversion.PyRaDiSe:一个用于基于DICOM-RT的自动分割管道构建和DICOM-RT数据转换的Python软件包。
Comput Methods Programs Biomed. 2023 Apr;231:107374. doi: 10.1016/j.cmpb.2023.107374. Epub 2023 Jan 28.
8
Introduction to the BioChemical Library (BCL): An Application-Based Open-Source Toolkit for Integrated Cheminformatics and Machine Learning in Computer-Aided Drug Discovery.生物化学库(BCL)简介:一种基于应用的开源工具包,用于计算机辅助药物发现中的综合化学信息学和机器学习。
Front Pharmacol. 2022 Feb 21;13:833099. doi: 10.3389/fphar.2022.833099. eCollection 2022.
9
Gnocis: An integrated system for interactive and reproducible analysis and modelling of cis-regulatory elements in Python 3.Gnocis:一个用于在 Python 3 中交互式和可重复分析及建模顺式调控元件的集成系统。
PLoS One. 2022 Sep 9;17(9):e0274338. doi: 10.1371/journal.pone.0274338. eCollection 2022.
10
PyL3dMD: Python LAMMPS 3D molecular descriptors package.PyL3dMD:Python LAMMPS 3D分子描述符软件包。
J Cheminform. 2023 Jul 28;15(1):69. doi: 10.1186/s13321-023-00737-5.

引用本文的文献

1
Prediction model for chemical explosion consequences via multimodal feature fusion.基于多模态特征融合的化学爆炸后果预测模型
J Cheminform. 2025 Aug 5;17(1):118. doi: 10.1186/s13321-025-01060-x.
2
Toward Assay-Aware Bioactivity Model(er)s: Getting a Grip on Biological Context.迈向可感知分析的生物活性模型:把握生物学背景。
J Chem Inf Model. 2025 Jul 14;65(13):7013-7023. doi: 10.1021/acs.jcim.5c00603. Epub 2025 Jun 30.
3
Integrating Pharmacokinetics and Quantitative Systems Pharmacology Approaches in Generative Drug Design.

本文引用的文献

1
Data-driven quantum chemical property prediction leveraging 3D conformations with Uni-Mol.利用Uni-Mol的3D构象进行数据驱动的量子化学性质预测。
Nat Commun. 2024 Aug 19;15(1):7104. doi: 10.1038/s41467-024-51321-w.
2
Application of Mechanistic Multiparameter Optimization and Large-Scale to Pharmacokinetics Correlations to Small-Molecule Therapeutic Projects.机制多参数优化和大规模筛选在小分子治疗项目药代动力学相关性中的应用。
Mol Pharm. 2024 Sep 2;21(9):4312-4323. doi: 10.1021/acs.molpharmaceut.4c00256. Epub 2024 Aug 12.
3
QSARtuna: An Automated QSAR Modeling Platform for Molecular Property Prediction in Drug Design.
在生成式药物设计中整合药代动力学和定量系统药理学方法。
J Chem Inf Model. 2025 May 26;65(10):4783-4796. doi: 10.1021/acs.jcim.5c00107. Epub 2025 May 9.
4
MHNfs: Prompting In-Context Bioactivity Predictions for Low-Data Drug Discovery.MHNfs:为低数据药物发现提供上下文生物活性预测
J Chem Inf Model. 2025 May 12;65(9):4243-4250. doi: 10.1021/acs.jcim.4c02373. Epub 2025 Apr 30.
QSARtuna:一个用于药物设计中分子性质预测的自动化 QSAR 建模平台。
J Chem Inf Model. 2024 Jul 22;64(14):5365-5374. doi: 10.1021/acs.jcim.4c00457. Epub 2024 Jul 1.
4
Mind your prevalence!留意你的患病率!
J Cheminform. 2024 Apr 15;16(1):43. doi: 10.1186/s13321-024-00837-w.
5
Combining IC or Values from Different Sources Is a Source of Significant Noise.合并来自不同来源的IC或值是显著噪声的一个来源。
J Chem Inf Model. 2024 Mar 11;64(5):1560-1567. doi: 10.1021/acs.jcim.4c00049. Epub 2024 Feb 23.
6
Real-World Molecular Out-Of-Distribution: Specification and Investigation.真实世界的分子离群值:规范与研究。
J Chem Inf Model. 2024 Feb 12;64(3):697-711. doi: 10.1021/acs.jcim.3c01774. Epub 2024 Feb 1.
7
Prediction of Human Clearance Using In Silico Models with Reduced Bias.使用具有较低偏差的计算模型预测人体清除率。
Mol Pharm. 2024 Mar 4;21(3):1192-1203. doi: 10.1021/acs.molpharmaceut.3c00812. Epub 2024 Jan 29.
8
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
9
Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR.将 QSAR 建模与深度学习整合到药物发现中:深 QSAR 的出现。
Nat Rev Drug Discov. 2024 Feb;23(2):141-155. doi: 10.1038/s41573-023-00832-0. Epub 2023 Dec 8.
10
Rationalizing general limitations in assessing and comparing methods for compound potency prediction.合理化评估和比较化合物效力预测方法的一般局限性。
Sci Rep. 2023 Oct 19;13(1):17816. doi: 10.1038/s41598-023-45086-3.