• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据集的多样性和化学文库网络。

Diversity and Chemical Library Networks of Large Data Sets.

机构信息

Department of Chemistry, University of Florida, Gainesville, Florida 32611, United States.

Department of Medicinal Chemistry, University of Florida, Gainesville, Florida 32610, United States.

出版信息

J Chem Inf Model. 2022 May 9;62(9):2186-2201. doi: 10.1021/acs.jcim.1c01013. Epub 2021 Nov 1.

DOI:10.1021/acs.jcim.1c01013
PMID:34723537
Abstract

The quantification of chemical diversity has many applications in drug discovery, organic chemistry, food, and natural product chemistry, to name a few. As the size of the chemical space is expanding rapidly, it is imperative to develop efficient methods to quantify the diversity of large and ultralarge chemical libraries and visualize their mutual relationships in chemical space. Herein, we show an application of our recently introduced extended similarity indices to measure the fingerprint-based diversity of 19 chemical libraries typically used in drug discovery and natural products research with over 18 million compounds. Based on this concept, we introduce the Chemical Library Networks (CLNs) as a general and efficient framework to represent visually the chemical space of large chemical libraries providing a global perspective of the relation between the libraries. For the 19 compound libraries explored in this work, it was found that the (extended) Tanimoto index offers the best description of extended similarity in combination with RDKit fingerprints. CLNs are general and can be explored with any structure representation and similarity coefficient for large chemical libraries.

摘要

化学多样性的量化在药物发现、有机化学、食品和天然产物化学等领域有许多应用。随着化学空间的规模迅速扩大,开发有效方法来量化大型和超大型化学文库的多样性并在化学空间中可视化它们的相互关系势在必行。在此,我们展示了我们最近引入的扩展相似性指数在测量 19 个通常用于药物发现和天然产物研究的化学文库的基于指纹的多样性方面的应用,这些文库包含超过 1800 万个化合物。基于这一概念,我们引入了化学文库网络 (CLN),作为一个通用且高效的框架来直观地表示大型化学文库的化学空间,提供了库之间关系的全局视角。对于这项工作中探索的 19 个化合物库,发现(扩展)Tanimoto 指数与 RDKit 指纹相结合,提供了对扩展相似性的最佳描述。CLN 是通用的,可以与任何结构表示和相似系数一起用于大型化学文库。

相似文献

1
Diversity and Chemical Library Networks of Large Data Sets.大数据集的多样性和化学文库网络。
J Chem Inf Model. 2022 May 9;62(9):2186-2201. doi: 10.1021/acs.jcim.1c01013. Epub 2021 Nov 1.
2
DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.DNA 编码化学文库:超越传统小分子文库。
Acc Chem Res. 2014 Apr 15;47(4):1247-55. doi: 10.1021/ar400284t. Epub 2014 Mar 28.
3
Analysing and Navigating Natural Products Space for Generating Small, Diverse, But Representative Chemical Libraries.分析和导航天然产物空间,以生成小而多样但具有代表性的化学文库。
Biotechnol J. 2018 Jan;13(1). doi: 10.1002/biot.201700503. Epub 2017 Dec 6.
4
Chemical Multiverse: An Expanded View of Chemical Space.化学多元宇宙:化学空间的扩展视角。
Mol Inform. 2022 Nov;41(11):e2200116. doi: 10.1002/minf.202200116. Epub 2022 Aug 23.
5
Fragment Library of Natural Products and Compound Databases for Drug Discovery.天然产物片段库和化合物数据库在药物发现中的应用。
Biomolecules. 2020 Nov 6;10(11):1518. doi: 10.3390/biom10111518.
6
Exploring activity landscapes with extended similarity: is Tanimoto enough?用扩展相似度探索活动景观:Tanimoto 足够吗?
Mol Inform. 2023 Jul;42(7):e2300056. doi: 10.1002/minf.202300056. Epub 2023 Jun 7.
7
A Fragment Library of Natural Products and its Comparative Chemoinformatic Characterization.天然产物片段库及其比较化学信息学特征分析。
Mol Inform. 2020 Nov;39(11):e2000050. doi: 10.1002/minf.202000050. Epub 2020 Apr 29.
8
Charting, navigating, and populating natural product chemical space for drug discovery.为药物发现绘制、导航和填充天然产物化学空间。
J Med Chem. 2012 Jul 12;55(13):5989-6001. doi: 10.1021/jm300288g. Epub 2012 May 11.
9
Fragment-based screening with natural products for novel anti-parasitic disease drug discovery.基于天然产物的片段筛选在新型抗寄生虫病药物发现中的应用。
Expert Opin Drug Discov. 2019 Dec;14(12):1283-1295. doi: 10.1080/17460441.2019.1653849. Epub 2019 Sep 12.
10
The Symbiotic Relationship Between Drug Discovery and Organic Chemistry.药物发现与有机化学的共生关系。
Chemistry. 2020 Jan 27;26(6):1196-1237. doi: 10.1002/chem.201903232. Epub 2019 Oct 30.

引用本文的文献

1
Extended Quality (eQual): Radial Threshold Clustering Based on -ary Similarity.扩展质量(eQual):基于 - 元相似度的径向阈值聚类
J Chem Inf Model. 2025 May 26;65(10):5062-5070. doi: 10.1021/acs.jcim.4c02341. Epub 2025 May 1.
2
Molecular similarity: Theory, applications, and perspectives.分子相似性:理论、应用与展望。
Artif Intell Chem. 2024 Dec;2(2). doi: 10.1016/j.aichem.2024.100077. Epub 2024 Aug 31.
3
Virtual screening: hope, hype, and the fine line in between.虚拟筛选:希望、炒作与二者之间的微妙界限。
Expert Opin Drug Discov. 2025 Feb;20(2):145-162. doi: 10.1080/17460441.2025.2458666. Epub 2025 Jan 27.
4
Coverage bias in small molecule machine learning.小分子机器学习中的覆盖偏差
Nat Commun. 2025 Jan 9;16(1):554. doi: 10.1038/s41467-024-55462-w.
5
Extended Quality (eQual): Radial threshold clustering based on n-ary similarity.扩展质量(eQual):基于n元相似度的径向阈值聚类
bioRxiv. 2024 Dec 5:2024.12.05.627001. doi: 10.1101/2024.12.05.627001.
6
Protein Retrieval via Integrative Molecular Ensembles (PRIME) through Extended Similarity Indices.通过扩展相似性指数的综合分子组合(PRIME)进行蛋白质提取。
J Chem Theory Comput. 2024 Jul 23;20(14):6303-6315. doi: 10.1021/acs.jctc.4c00362. Epub 2024 Jul 8.
7
iSIM: instant similarity.iSIM:即时相似度。
Digit Discov. 2024 May 7;3(6):1160-1171. doi: 10.1039/d4dd00041b. eCollection 2024 Jun 12.
8
Extended similarity methods for efficient data mining in imaging mass spectrometry.用于成像质谱中高效数据挖掘的扩展相似性方法。
Digit Discov. 2024 Mar 27;3(4):805-817. doi: 10.1039/d3dd00165b. eCollection 2024 Apr 17.
9
Making sense of chemical space network shows signs of criticality.理解化学空间网络显示出临界性的迹象。
Sci Rep. 2023 Dec 4;13(1):21335. doi: 10.1038/s41598-023-48107-3.
10
Molecular Property Diagnostic Suite Compound Library (MPDS-CL): a structure-based classification of the chemical space.分子性质诊断套件化合物库(MPDS-CL):化学空间的基于结构的分类
Mol Divers. 2024 Oct;28(5):3243-3259. doi: 10.1007/s11030-023-10752-1. Epub 2023 Oct 30.