• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用RDKit和NetworkX可视化化学空间网络。

Visualizing chemical space networks with RDKit and NetworkX.

作者信息

Scalfani Vincent F, Patel Vishank D, Fernandez Avery M

机构信息

University Libraries, Rodgers Library for Science and Engineering, The University of Alabama, Tuscaloosa, AL, 35487, USA.

出版信息

J Cheminform. 2022 Dec 28;14(1):87. doi: 10.1186/s13321-022-00664-x.

DOI:10.1186/s13321-022-00664-x
PMID:36578091
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9798653/
Abstract

This article demonstrates how to create Chemical Space Networks (CSNs) using a Python RDKit and NetworkX workflow. CSNs are a type of network visualization that depict compounds as nodes connected by edges, defined as a pairwise relationship such as a 2D fingerprint similarity value. A step by step approach is presented for creating two different CSNs in this manuscript, one based on RDKit 2D fingerprint Tanimoto similarity values, and another based on maximum common substructure similarity values. Several different CSN visualization features are included in the tutorial including methods to represent nodes with color based on bioactivity attribute value, edges with different line styles based on similarity value, as well as replacing the circle nodes with 2D structure depictions. Finally, some common network property and analysis calculations are presented including the clustering coefficient, degree assortativity, and modularity. All code is provided in the form of Jupyter Notebooks and is available on GitHub with a permissive BSD-3 open-source license: https://github.com/vfscalfani/CSN_tutorial.

摘要

本文演示了如何使用Python的RDKit和NetworkX工作流程创建化学空间网络(CSN)。CSN是一种网络可视化类型,将化合物描绘为通过边连接的节点,边定义为一种成对关系,如二维指纹相似性值。本文提出了一种逐步方法来创建两种不同的CSN,一种基于RDKit二维指纹的Tanimoto相似性值,另一种基于最大公共子结构相似性值。教程中包含了几种不同的CSN可视化功能,包括根据生物活性属性值用颜色表示节点、根据相似性值用不同线型表示边,以及用二维结构描绘替换圆形节点。最后,介绍了一些常见的网络属性和分析计算,包括聚类系数、度相关性和模块性。所有代码均以Jupyter Notebook的形式提供,并在GitHub上以宽松的BSD-3开源许可证发布:https://github.com/vfscalfani/CSN_tutorial 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/fc1deb6e0c02/13321_2022_664_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/071394d4c58f/13321_2022_664_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/e01864eb80aa/13321_2022_664_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/6702833d7092/13321_2022_664_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/b69c96eee544/13321_2022_664_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/bbc4bcb332f8/13321_2022_664_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/fc1deb6e0c02/13321_2022_664_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/071394d4c58f/13321_2022_664_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/e01864eb80aa/13321_2022_664_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/6702833d7092/13321_2022_664_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/b69c96eee544/13321_2022_664_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/bbc4bcb332f8/13321_2022_664_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9d8/9798653/fc1deb6e0c02/13321_2022_664_Fig6_HTML.jpg

相似文献

1
Visualizing chemical space networks with RDKit and NetworkX.使用RDKit和NetworkX可视化化学空间网络。
J Cheminform. 2022 Dec 28;14(1):87. doi: 10.1186/s13321-022-00664-x.
2
Design of chemical space networks using a Tanimoto similarity variant based upon maximum common substructures.基于最大公共子结构的Tanimoto相似性变体设计化学空间网络。
J Comput Aided Mol Des. 2015 Oct;29(10):937-50. doi: 10.1007/s10822-015-9872-1. Epub 2015 Sep 29.
3
Design of chemical space networks on the basis of Tversky similarity.基于特沃斯基相似性的化学空间网络设计。
J Comput Aided Mol Des. 2016 Jan;30(1):1-12. doi: 10.1007/s10822-015-9891-y. Epub 2015 Dec 22.
4
Tracing compound pathways using chemical space networks.使用化学空间网络追踪复合途径。
Medchemcomm. 2016 Dec 23;8(2):376-384. doi: 10.1039/c6md00628k. eCollection 2017 Feb 1.
5
Comparison of bioactive chemical space networks generated using substructure- and fingerprint-based measures of molecular similarity.使用基于子结构和指纹的分子相似性度量生成的生物活性化学空间网络的比较。
J Comput Aided Mol Des. 2015 Jul;29(7):595-608. doi: 10.1007/s10822-015-9852-5. Epub 2015 Jun 7.
6
Design and characterization of chemical space networks for different compound data sets.针对不同化合物数据集的化学空间网络的设计与表征
J Comput Aided Mol Des. 2015 Feb;29(2):113-25. doi: 10.1007/s10822-014-9821-4. Epub 2014 Dec 3.
7
Design of chemical space networks incorporating compound distance relationships.纳入化合物距离关系的化学空间网络设计。
F1000Res. 2016 Nov 4;5. doi: 10.12688/f1000research.10021.2. eCollection 2016.
8
ScaffoldGraph: an open-source library for the generation and analysis of molecular scaffold networks and scaffold trees.支架图:用于生成和分析分子支架网络和支架树的开源库。
Bioinformatics. 2020 Jun 1;36(12):3930-3931. doi: 10.1093/bioinformatics/btaa219.
9
Interactive network visualization in Jupyter notebooks: visJS2jupyter.交互式网络可视化在 Jupyter 笔记本:visJS2jupyter。
Bioinformatics. 2018 Jan 1;34(1):126-128. doi: 10.1093/bioinformatics/btx581.
10
ccbmlib - a Python package for modeling Tanimoto similarity value distributions.ccbmlib - 一个用于对谷本相似度值分布进行建模的Python包。
F1000Res. 2020 Feb 10;9. doi: 10.12688/f1000research.22292.2. eCollection 2020.

引用本文的文献

1
Scaffold and SAR studies on c-MET inhibitors using machine learning approaches.使用机器学习方法对c-MET抑制剂进行支架和构效关系研究。
J Pharm Anal. 2025 Jun;15(6):101303. doi: 10.1016/j.jpha.2025.101303. Epub 2025 Apr 10.
2
Molecular Mediated Angiogenesis and Vasculogenesis Networks.分子介导的血管生成和血管发生网络
Int J Mol Sci. 2025 Jun 30;26(13):6316. doi: 10.3390/ijms26136316.
3
Predicting the Experimental Emission Spectra of Fluorescent Organic Semiconductors by Ensemble Machine Learning Analysis.通过集成机器学习分析预测荧光有机半导体的实验发射光谱

本文引用的文献

1
Diversity and Chemical Library Networks of Large Data Sets.大数据集的多样性和化学文库网络。
J Chem Inf Model. 2022 May 9;62(9):2186-2201. doi: 10.1021/acs.jcim.1c01013. Epub 2021 Nov 1.
2
An open source chemical structure curation pipeline using RDKit.一个使用RDKit的开源化学结构编目流程。
J Cheminform. 2020 Sep 1;12(1):51. doi: 10.1186/s13321-020-00456-1.
3
rdScaffoldNetwork: The Scaffold Network Implementation in RDKit.rdScaffoldNetwork:RDKit 中的支架网络实现。
J Fluoresc. 2025 Jun 13. doi: 10.1007/s10895-025-04395-8.
4
Machine Learning-Driven Consensus Modeling for Activity Ranking and Chemical Landscape Analysis of HIV-1 Inhibitors.机器学习驱动的HIV-1抑制剂活性排名及化学景观分析的共识建模
Pharmaceuticals (Basel). 2025 May 13;18(5):714. doi: 10.3390/ph18050714.
5
First report on analysis of chemical space, scaffold diversity, critical structural features of HDAC11 inhibitors.关于组蛋白去乙酰化酶11(HDAC11)抑制剂的化学空间分析、骨架多样性及关键结构特征的首次报告
Mol Divers. 2025 May 17. doi: 10.1007/s11030-025-11217-3.
6
Machine learning-based bioactivity prediction of porphyrin derivatives: molecular descriptors, clustering, and model evaluation.基于机器学习的卟啉衍生物生物活性预测:分子描述符、聚类和模型评估。
Photochem Photobiol Sci. 2025 May 15. doi: 10.1007/s43630-025-00733-8.
7
Labels as a feature: Network homophily for systematically annotating human GPCR drug-target interactions.作为一种特征的标签:用于系统注释人类GPCR药物-靶点相互作用的网络同质性
Nat Commun. 2025 May 3;16(1):4121. doi: 10.1038/s41467-025-59418-6.
8
PPARγ modulator predictor (PGMP_v1): chemical space exploration and computational insights for enhanced type 2 diabetes mellitus management.过氧化物酶体增殖物激活受体γ调节剂预测器(PGMP_v1):用于改善2型糖尿病管理的化学空间探索与计算洞察
Mol Divers. 2025 Feb 1. doi: 10.1007/s11030-025-11118-5.
9
cyclicpeptide: a Python package for cyclic peptide drug design.环肽:用于环肽药物设计的Python软件包。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae714.
10
A membrane permeability database for nonpeptidic macrocycles.一个非肽大环化合物的膜通透性数据库。
Sci Data. 2025 Jan 3;12(1):10. doi: 10.1038/s41597-024-04302-z.
J Chem Inf Model. 2020 Jul 27;60(7):3331-3335. doi: 10.1021/acs.jcim.0c00296. Epub 2020 Jul 7.
4
Drug Research Meets Network Science: Where Are We?药物研究与网络科学的结合:我们在哪里?
J Med Chem. 2020 Aug 27;63(16):8653-8666. doi: 10.1021/acs.jmedchem.9b01989. Epub 2020 May 8.
5
Tracing compound pathways using chemical space networks.使用化学空间网络追踪复合途径。
Medchemcomm. 2016 Dec 23;8(2):376-384. doi: 10.1039/c6md00628k. eCollection 2017 Feb 1.
6
Combining Similarity Searching and Network Analysis for the Identification of Active Compounds.结合相似性搜索与网络分析以鉴定活性化合物。
ACS Omega. 2018 Apr 30;3(4):3768-3777. doi: 10.1021/acsomega.8b00344. Epub 2018 Apr 3.
7
Exploring sets of molecules from patents and relationships to other active compounds in chemical space networks.探索来自专利的分子集合以及在化学空间网络中与其他活性化合物的关系。
J Comput Aided Mol Des. 2017 Sep;31(9):779-788. doi: 10.1007/s10822-017-0061-2. Epub 2017 Sep 4.
8
The ChEMBL database in 2017.2017年的ChEMBL数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D945-D954. doi: 10.1093/nar/gkw1074. Epub 2016 Nov 28.
9
Chemical space visualization: transforming multidimensional chemical spaces into similarity-based molecular networks.化学空间可视化:将多维化学空间转化为基于相似性的分子网络。
Future Med Chem. 2016 Sep;8(14):1769-78. doi: 10.4155/fmc-2016-0023. Epub 2016 Aug 30.
10
Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation.《信任,但需验证II:化学基因组学数据管理实用指南》
J Chem Inf Model. 2016 Jul 25;56(7):1243-52. doi: 10.1021/acs.jcim.6b00129. Epub 2016 Jun 22.