• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PySmash:用于代表性子结构生成和应用的 Python 包和独立可执行程序。

PySmash: Python package and individual executable program for representative substructure generation and application.

机构信息

Department of Pharmacy, Xiangya Hospital, Central South University and the Xiangya School of Pharmaceutical Sciences, Central South University, Sichuan, China.

Xiangya School of Pharmaceutical Sciences, Central South University, Hunan, China.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab017.

DOI:10.1093/bib/bbab017
PMID:33709154
Abstract

BACKGROUND

Substructure screening is widely applied to evaluate the molecular potency and ADMET properties of compounds in drug discovery pipelines, and it can also be used to interpret QSAR models for the design of new compounds with desirable physicochemical and biological properties. With the continuous accumulation of more experimental data, data-driven computational systems which can derive representative substructures from large chemical libraries attract more attention. Therefore, the development of an integrated and convenient tool to generate and implement representative substructures is urgently needed.

RESULTS

In this study, PySmash, a user-friendly and powerful tool to generate different types of representative substructures, was developed. The current version of PySmash provides both a Python package and an individual executable program, which achieves ease of operation and pipeline integration. Three types of substructure generation algorithms, including circular, path-based and functional group-based algorithms, are provided. Users can conveniently customize their own requirements for substructure size, accuracy and coverage, statistical significance and parallel computation during execution. Besides, PySmash provides the function for external data screening.

CONCLUSION

PySmash, a user-friendly and integrated tool for the automatic generation and implementation of representative substructures, is presented. Three screening examples, including toxicophore derivation, privileged motif detection and the integration of substructures with machine learning (ML) models, are provided to illustrate the utility of PySmash in safety profile evaluation, therapeutic activity exploration and molecular optimization, respectively. Its executable program and Python package are available at https://github.com/kotori-y/pySmash.

摘要

背景

子结构筛选被广泛应用于评估药物发现管道中化合物的分子效力和 ADMET 性质,也可用于解释 QSAR 模型,以设计具有理想物理化学和生物性质的新化合物。随着越来越多的实验数据的不断积累,能够从大型化学库中提取代表性子结构的数据驱动计算系统受到了更多的关注。因此,迫切需要开发一种集成和方便的工具来生成和实施代表性子结构。

结果

本研究开发了一种用户友好且功能强大的生成不同类型代表性子结构的工具 PySmash。当前版本的 PySmash 提供了 Python 包和独立的可执行程序,实现了操作简便和流水线集成。提供了三种子结构生成算法,包括循环、基于路径和基于官能团的算法。用户可以在执行过程中方便地自定义对子结构大小、准确性和覆盖率、统计显著性和并行计算的要求。此外,PySmash 还提供了外部数据筛选功能。

结论

本研究提出了一种用户友好且集成的代表性子结构自动生成和实施工具 PySmash。提供了三个筛选示例,包括毒性原子推导、特权基检测以及子结构与机器学习(ML)模型的集成,分别说明了 PySmash 在安全性评估、治疗活性探索和分子优化方面的应用。其可执行程序和 Python 包可在 https://github.com/kotori-y/pySmash 上获得。

相似文献

1
PySmash: Python package and individual executable program for representative substructure generation and application.PySmash:用于代表性子结构生成和应用的 Python 包和独立可执行程序。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab017.
2
Scopy: an integrated negative design python library for desirable HTS/VS database design.Scopy:一个集成的负设计 Python 库,用于设计理想的高通量筛选/虚拟筛选数据库。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa194.
3
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.
4
ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.ADMET-AI:用于评估大规模化学文库的机器学习 ADMET 平台。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae416.
5
Bambu and its applications in the discovery of active molecules against melanoma.竹及其在发现抗黑色素瘤活性分子中的应用。
J Mol Graph Model. 2023 Nov;124:108564. doi: 10.1016/j.jmgm.2023.108564. Epub 2023 Jul 11.
6
ChemFLuo: a web-server for structure analysis and identification of fluorescent compounds.ChemFLuo:一个用于荧光化合物结构分析和鉴定的网络服务器。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa282.
7
ChemoPy: freely available python package for computational biology and chemoinformatics.ChemoPy:可用于计算生物学和化学信息学的免费 Python 包。
Bioinformatics. 2013 Apr 15;29(8):1092-4. doi: 10.1093/bioinformatics/btt105. Epub 2013 Mar 14.
8
PyBoolNet: a python package for the generation, analysis and visualization of boolean networks.PyBoolNet:一个用于布尔网络生成、分析和可视化的Python软件包。
Bioinformatics. 2017 Mar 1;33(5):770-772. doi: 10.1093/bioinformatics/btw682.
9
The DynaSig-ML Python package: automated learning of biomolecular dynamics-function relationships.DynaSig-ML Python 包:生物分子动力学-功能关系的自动化学习。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad180.
10
PhyInformR: phylogenetic experimental design and phylogenomic data exploration in R.PhyInformR:R语言中的系统发育实验设计与系统发育基因组数据探索
BMC Evol Biol. 2016 Dec 1;16(1):262. doi: 10.1186/s12862-016-0837-3.

引用本文的文献

1
ChemFH: an integrated tool for screening frequent false positives in chemical biology and drug discovery.ChemFH:一种用于筛选化学生物学和药物发现中常见假阳性的综合工具。
Nucleic Acids Res. 2024 Jul 5;52(W1):W439-W449. doi: 10.1093/nar/gkae424.
2
ChemMORT: an automatic ADMET optimization platform using deep learning and multi-objective particle swarm optimization.ChemMORT:一个使用深度学习和多目标粒子群优化算法的自动 ADMET 优化平台。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae008.
3
Prediction and Screening Model for Products Based on Fusion Regression and XGBoost Classification.基于融合回归和 XGBoost 分类的产品预测和筛选模型。
Comput Intell Neurosci. 2022 Jul 31;2022:4987639. doi: 10.1155/2022/4987639. eCollection 2022.