• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NP-Scout:用于小分子天然产物相似性定量和可视化的机器学习方法。

NP-Scout: Machine Learning Approach for the Quantification and Visualization of the Natural Product-Likeness of Small Molecules.

机构信息

Center for Bioinformatics (ZBH), Department of Informatics, Faculty of Mathematics, Informatics and Natural Sciences, Universität Hamburg, 20146 Hamburg, Germany.

Department of Chemistry, University of Bergen, 5007 Bergen, Norway.

出版信息

Biomolecules. 2019 Jan 24;9(2):43. doi: 10.3390/biom9020043.

DOI:10.3390/biom9020043
PMID:30682850
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6406893/
Abstract

Natural products (NPs) remain the most prolific resource for the development of smallmolecule drugs. Here we report a new machine learning approach that allows the identification of natural products with high accuracy. The method also generates similarity maps, which highlight atoms that contribute significantly to the classification of small molecules as a natural product or synthetic molecule. The method can hence be utilized to (i) identify natural products in large molecular libraries, (ii) quantify the natural product-likeness of small molecules, and (iii) visualize atoms in small molecules that are characteristic of natural products or synthetic molecules. The models are based on random forest classifiers trained on data sets consisting of more than 265,000 to 322,000 natural products and synthetic molecules. Two-dimensional molecular descriptors, MACCS keys and Morgan2 fingerprints were explored. On an independent test set the models reached areas under the receiver operating characteristic curve (AUC) of 0.997 and Matthews correlation coefficients (MCCs) of 0.954 and higher. The method was further tested on data from the Dictionary of Natural Products, ChEMBL and other resources. The best-performing models are accessible as a free web service at http://npscout.zbh.uni-hamburg.de/npscout.

摘要

天然产物(NPs)仍然是小分子药物开发的最丰富资源。在这里,我们报告了一种新的机器学习方法,可实现高精度地识别天然产物。该方法还生成相似度图,突出对将小分子分类为天然产物或合成分子有重大贡献的原子。因此,该方法可用于(i)在大型分子文库中识别天然产物,(ii)量化小分子的天然产物相似性,以及(iii)可视化小分子中具有天然产物或合成分子特征的原子。这些模型是基于随机森林分类器构建的,训练数据集中包含超过 265000 至 322000 个天然产物和合成分子。我们探索了二维分子描述符、MACCS 键和 Morgan2 指纹。在独立测试集上,这些模型的接收者操作特征曲线(ROC)下面积(AUC)达到 0.997,马修斯相关系数(MCC)达到 0.954 及以上。该方法还在天然产物词典、ChEMBL 和其他资源的数据上进行了测试。性能最佳的模型可在免费的网络服务 http://npscout.zbh.uni-hamburg.de/npscout 上获得。

相似文献

1
NP-Scout: Machine Learning Approach for the Quantification and Visualization of the Natural Product-Likeness of Small Molecules.NP-Scout:用于小分子天然产物相似性定量和可视化的机器学习方法。
Biomolecules. 2019 Jan 24;9(2):43. doi: 10.3390/biom9020043.
2
Hit Dexter 2.0: Machine-Learning Models for the Prediction of Frequent Hitters.命中德克斯特 2.0:用于预测高频命中者的机器学习模型。
J Chem Inf Model. 2019 Mar 25;59(3):1030-1043. doi: 10.1021/acs.jcim.8b00677. Epub 2019 Jan 25.
3
Predictive classifier models built from natural products with antimalarial bioactivity using machine learning approach.基于具有抗疟生物活性的天然产物,采用机器学习方法构建的预测分类器模型。
PLoS One. 2018 Sep 28;13(9):e0204644. doi: 10.1371/journal.pone.0204644. eCollection 2018.
4
Natural product-likeness score and its application for prioritization of compound libraries.天然产物相似性评分及其在化合物库优先级排序中的应用。
J Chem Inf Model. 2008 Jan;48(1):68-74. doi: 10.1021/ci700286x. Epub 2007 Nov 23.
5
Natural product-likeness score revisited: an open-source, open-data implementation.重新审视天然产物相似度评分:一个开源、开放数据的实现。
BMC Bioinformatics. 2012 May 20;13:106. doi: 10.1186/1471-2105-13-106.
6
STarFish: A Stacked Ensemble Target Fishing Approach and its Application to Natural Products.STarFish:一种堆叠集成目标捕捞方法及其在天然产物中的应用。
J Chem Inf Model. 2019 Nov 25;59(11):4906-4920. doi: 10.1021/acs.jcim.9b00489. Epub 2019 Oct 24.
7
Temporal bone radiology report classification using open source machine learning and natural langue processing libraries.使用开源机器学习和自然语言处理库对颞骨放射学报告进行分类
BMC Med Inform Decis Mak. 2016 Jun 6;16:65. doi: 10.1186/s12911-016-0306-3.
8
Computational models for the classification of mPGES-1 inhibitors with fingerprint descriptors.基于指纹描述符的 mPGES-1 抑制剂分类的计算模型。
Mol Divers. 2017 Aug;21(3):661-675. doi: 10.1007/s11030-017-9743-x. Epub 2017 May 8.
9
HDAC3i-Finder: A Machine Learning-based Computational Tool to Screen for HDAC3 Inhibitors.HDAC3i-Finder:一种基于机器学习的计算工具,用于筛选 HDAC3 抑制剂。
Mol Inform. 2021 Mar;40(3):e2000105. doi: 10.1002/minf.202000105. Epub 2020 Nov 23.
10
Construction of a 3D-shaped, natural product like fragment library by fragmentation and diversification of natural products.通过天然产物的碎片化和多样化构建三维形状的类天然产物片段库。
Bioorg Med Chem. 2017 Feb 1;25(3):921-925. doi: 10.1016/j.bmc.2016.12.005. Epub 2016 Dec 8.

引用本文的文献

1
Integrating data augmentation and BERT-based deep learning for predicting alpha-glucosidase inhibitors derived from Black Cohosh.整合数据增强和基于BERT的深度学习以预测源自黑升麻的α-葡萄糖苷酶抑制剂。
Sci Rep. 2025 Aug 27;15(1):31536. doi: 10.1038/s41598-025-14699-1.
2
Unraveling Nature's Pharmacy: Transforming Medicinal Plants into Modern Therapeutic Agents.揭开自然药房的奥秘:将药用植物转化为现代治疗药物。
Pharmaceutics. 2025 Jun 7;17(6):754. doi: 10.3390/pharmaceutics17060754.
3
Fragment Libraries from Large and Novel Synthetic Compounds and Natural Products: A Comparative Chemoinformatic Analysis.

本文引用的文献

1
Characterization of the Chemical Space of Known and Readily Obtainable Natural Products.已知和易得天然产物的化学空间特征描述。
J Chem Inf Model. 2018 Aug 27;58(8):1518-1532. doi: 10.1021/acs.jcim.8b00302. Epub 2018 Aug 1.
2
Computer-Assisted Discovery of Retinoid X Receptor Modulating Natural Products and Isofunctional Mimetics.计算机辅助发现视黄醇 X 受体调节剂天然产物和同功能模拟物。
J Med Chem. 2018 Jun 28;61(12):5442-5447. doi: 10.1021/acs.jmedchem.8b00494. Epub 2018 Jun 14.
3
Statistical Investigation into the Structural Complementarity of Natural Products and Synthetic Compounds.
来自大型新型合成化合物和天然产物的片段库:比较化学信息学分析
ACS Omega. 2025 Apr 16;10(16):16921-16937. doi: 10.1021/acsomega.5c01420. eCollection 2025 Apr 29.
4
Predicting and Explaining Yields with Machine Learning for Carboxylated Azoles and Beyond.利用机器学习预测和解释羧基化唑类及其他物质的产率
J Chem Inf Model. 2025 Feb 24;65(4):1862-1872. doi: 10.1021/acs.jcim.4c02336. Epub 2025 Feb 7.
5
Artificial Intelligence in Natural Product Drug Discovery: Current Applications and Future Perspectives.天然产物药物发现中的人工智能:当前应用与未来展望。
J Med Chem. 2025 Feb 27;68(4):3948-3969. doi: 10.1021/acs.jmedchem.4c01257. Epub 2025 Feb 6.
6
Time-Dependent Comparison of the Structural Variations of Natural Products and Synthetic Compounds.时间依赖性比较天然产物和合成化合物的结构变化。
Int J Mol Sci. 2024 Oct 25;25(21):11475. doi: 10.3390/ijms252111475.
7
Advances, opportunities, and challenges in methods for interrogating the structure activity relationships of natural products.天然产物结构-活性关系研究方法的进展、机遇与挑战。
Nat Prod Rep. 2024 Oct 17;41(10):1543-1578. doi: 10.1039/d4np00009a.
8
A Multi-Label Learning Framework for Predicting Chemical Classes and Biological Activities of Natural Products from Biosynthetic Gene Clusters.一种用于从生物合成基因簇预测天然产物化学类别和生物活性的多标签学习框架。
J Chem Ecol. 2023 Dec;49(11-12):681-695. doi: 10.1007/s10886-023-01452-z. Epub 2023 Oct 2.
9
School of cheminformatics in Latin America.拉丁美洲化学信息学学院。
J Cheminform. 2023 Sep 19;15(1):82. doi: 10.1186/s13321-023-00758-0.
10
Artificial intelligence for natural product drug discovery.人工智能在天然产物药物发现中的应用。
Nat Rev Drug Discov. 2023 Nov;22(11):895-916. doi: 10.1038/s41573-023-00774-7. Epub 2023 Sep 11.
天然产物与合成化合物结构互补性的统计研究。
Angew Chem Int Ed Engl. 1999 Mar 1;38(5):643-647. doi: 10.1002/(SICI)1521-3773(19990301)38:5<643::AID-ANIE643>3.0.CO;2-G.
4
Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters.命中德克斯特:一个用于预测高频击球手的机器学习模型。
ChemMedChem. 2018 Mar 20;13(6):564-571. doi: 10.1002/cmdc.201700673. Epub 2018 Feb 1.
5
Data Resources for the Computer-Guided Discovery of Bioactive Natural Products.用于计算机辅助发现生物活性天然产物的数据资源。
J Chem Inf Model. 2017 Sep 25;57(9):2099-2111. doi: 10.1021/acs.jcim.7b00341. Epub 2017 Aug 30.
6
NuBBE: an updated database to uncover chemical and biological information from Brazilian biodiversity.NuBBE:一个从巴西生物多样性中挖掘化学和生物信息的更新数据库。
Sci Rep. 2017 Aug 3;7(1):7215. doi: 10.1038/s41598-017-07451-x.
7
NANPDB: A Resource for Natural Products from Northern African Sources.NANPDB:来自北非来源的天然产物资源。
J Nat Prod. 2017 Jul 28;80(7):2067-2076. doi: 10.1021/acs.jnatprod.7b00283. Epub 2017 Jun 22.
8
NPCARE: database of natural products and fractional extracts for cancer regulation.NPCARE:用于癌症调控的天然产物和分级提取物数据库。
J Cheminform. 2017 Jan 5;9:2. doi: 10.1186/s13321-016-0188-5. eCollection 2017.
9
De-orphaning the marine natural product (±)-marinopyrrole A by computational target prediction and biochemical validation.通过计算靶点预测和生化验证鉴定海洋天然产物(±)-marinepyrrole A的作用靶点
Chem Commun (Camb). 2017 Feb 14;53(14):2272-2274. doi: 10.1039/c6cc09693j.
10
Cheminformatics Based Machine Learning Models for AMA1-RON2 Abrogators for Inhibiting Plasmodium falciparum Erythrocyte Invasion.基于化学信息学的机器学习模型用于寻找AMA1-RON2阻断剂以抑制恶性疟原虫红细胞入侵
Mol Inform. 2015 Oct;34(10):655-64. doi: 10.1002/minf.201400139. Epub 2015 Jul 17.