• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

化学开发工具包(CDK)v2.0:原子类型标注、描绘、分子式及子结构搜索。

The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.

作者信息

Willighagen Egon L, Mayfield John W, Alvarsson Jonathan, Berg Arvid, Carlsson Lars, Jeliazkova Nina, Kuhn Stefan, Pluskal Tomáš, Rojas-Chertó Miquel, Spjuth Ola, Torrance Gilleain, Evelo Chris T, Guha Rajarshi, Steinbeck Christoph

机构信息

Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, 6200 MD, Maastricht, The Netherlands.

NextMove Software Ltd, Cambridge, CB4 0EY, UK.

出版信息

J Cheminform. 2017 Jun 6;9(1):33. doi: 10.1186/s13321-017-0220-4.

DOI:10.1186/s13321-017-0220-4
PMID:29086040
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5461230/
Abstract

BACKGROUND

The Chemistry Development Kit (CDK) is a widely used open source cheminformatics toolkit, providing data structures to represent chemical concepts along with methods to manipulate such structures and perform computations on them. The library implements a wide variety of cheminformatics algorithms ranging from chemical structure canonicalization to molecular descriptor calculations and pharmacophore perception. It is used in drug discovery, metabolomics, and toxicology. Over the last 10 years, the code base has grown significantly, however, resulting in many complex interdependencies among components and poor performance of many algorithms.

RESULTS

We report improvements to the CDK v2.0 since the v1.2 release series, specifically addressing the increased functional complexity and poor performance. We first summarize the addition of new functionality, such atom typing and molecular formula handling, and improvement to existing functionality that has led to significantly better performance for substructure searching, molecular fingerprints, and rendering of molecules. Second, we outline how the CDK has evolved with respect to quality control and the approaches we have adopted to ensure stability, including a code review mechanism.

CONCLUSIONS

This paper highlights our continued efforts to provide a community driven, open source cheminformatics library, and shows that such collaborative projects can thrive over extended periods of time, resulting in a high-quality and performant library. By taking advantage of community support and contributions, we show that an open source cheminformatics project can act as a peer reviewed publishing platform for scientific computing software. Graphical abstract CDK 2.0 provides new features and improved performance.

摘要

背景

化学开发工具包(CDK)是一个广泛使用的开源化学信息学工具包,它提供表示化学概念的数据结构以及操作这些结构并对其进行计算的方法。该库实现了从化学结构规范化到分子描述符计算和药效团识别等各种各样的化学信息学算法。它被用于药物发现、代谢组学和毒理学领域。然而,在过去10年中,代码库显著增长,导致组件之间存在许多复杂的相互依赖关系,并且许多算法的性能不佳。

结果

我们报告了自v1.2发布系列以来CDK v2.0的改进情况,特别针对功能复杂性增加和性能不佳的问题。我们首先总结了新功能的添加,如原子类型和分子式处理,以及对现有功能的改进,这些改进显著提高了子结构搜索、分子指纹和分子渲染的性能。其次,我们概述了CDK在质量控制方面的发展以及我们为确保稳定性所采用的方法,包括代码审查机制。

结论

本文强调了我们持续努力提供一个由社区驱动的开源化学信息学库,并表明这样的合作项目可以长期蓬勃发展,从而产生一个高质量且高性能的库。通过利用社区支持和贡献,我们表明一个开源化学信息学项目可以充当科学计算软件的同行评审发布平台。图形摘要CDK 2.0提供了新功能并提高了性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/bb68d29ad05b/13321_2017_220_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/c3416ce61b04/13321_2017_220_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/ebff4f64616f/13321_2017_220_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/7ae6d219c5b0/13321_2017_220_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/061a6541dafa/13321_2017_220_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3077d4b27807/13321_2017_220_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/c07b03ab1cf1/13321_2017_220_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3ab09f8d6b32/13321_2017_220_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3f389d4628cd/13321_2017_220_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/bb68d29ad05b/13321_2017_220_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/c3416ce61b04/13321_2017_220_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/ebff4f64616f/13321_2017_220_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/7ae6d219c5b0/13321_2017_220_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/061a6541dafa/13321_2017_220_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3077d4b27807/13321_2017_220_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/c07b03ab1cf1/13321_2017_220_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3ab09f8d6b32/13321_2017_220_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/3f389d4628cd/13321_2017_220_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465e/5461230/bb68d29ad05b/13321_2017_220_Fig8_HTML.jpg

相似文献

1
The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.化学开发工具包(CDK)v2.0:原子类型标注、描绘、分子式及子结构搜索。
J Cheminform. 2017 Jun 6;9(1):33. doi: 10.1186/s13321-017-0220-4.
2
New developments on the cheminformatics open workflow environment CDK-Taverna.CDK-Taverna cheminformatics 开放工作流程环境的新进展。
J Cheminform. 2011 Dec 13;3:54. doi: 10.1186/1758-2946-3-54.
3
jCompoundMapper: An open source Java library and command-line tool for chemical fingerprints.jCompoundMapper:一个用于化学指纹的开源 Java 库和命令行工具。
J Cheminform. 2011 Jan 10;3(1):3. doi: 10.1186/1758-2946-3-3.
4
Ambit-SLN: an Open Source Software Library for Processing of Chemical Objects via SLN Linear Notation.Ambit-SLN:一个用于通过SLN线性表示法处理化学对象的开源软件库。
Mol Inform. 2021 Nov;40(11):e2100027. doi: 10.1002/minf.202100027. Epub 2021 Aug 3.
5
Erratum to: The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.勘误:《化学开发工具包(CDK)v2.0:原子类型、描绘、分子式和子结构搜索》
J Cheminform. 2017 Sep 20;9(1):53. doi: 10.1186/s13321-017-0231-1.
6
CDK-Taverna: an open workflow environment for cheminformatics.CDK-Taverna:一个用于化学信息学的开放工作流环境。
BMC Bioinformatics. 2010 Mar 29;11:159. doi: 10.1186/1471-2105-11-159.
7
KNIME-CDK: Workflow-driven cheminformatics.KNIME-CDK:基于工作流的化学信息学。
BMC Bioinformatics. 2013 Aug 22;14:257. doi: 10.1186/1471-2105-14-257.
8
Applications of the InChI in cheminformatics with the CDK and Bioclipse.InChI 在 cheminformatics 中的应用,包括 CDK 和 Bioclipse。
J Cheminform. 2013 Mar 13;5(1):14. doi: 10.1186/1758-2946-5-14.
9
Open Babel: An open chemical toolbox.Open Babel:一个开放的化学工具箱。
J Cheminform. 2011 Oct 7;3:33. doi: 10.1186/1758-2946-3-33.
10
Efficient ring perception for the Chemistry Development Kit.化学开发工具包的高效环感知。
J Cheminform. 2014 Jan 30;6(1):3. doi: 10.1186/1758-2946-6-3.

引用本文的文献

1
An NMR Database for Organic and Organometallic Compounds.有机和有机金属化合物的核磁共振数据库。
Molecules. 2025 Aug 21;30(16):3442. doi: 10.3390/molecules30163442.
2
The Plasma Membrane may Serve as a Drug Depot to Drive the Extreme Potency of Fentanyl.质膜可能充当药物储存库以增强芬太尼的超强效力。
bioRxiv. 2025 Jul 26:2025.07.22.666146. doi: 10.1101/2025.07.22.666146.
3
Stacking Ensemble Neural Network for Chemical Safety Assessment: A Case Study of Thyroid Peroxidase and Natural Product Screening.用于化学安全评估的堆叠集成神经网络:以甲状腺过氧化物酶和天然产物筛选为例

本文引用的文献

1
BRENDA in 2017: new perspectives and new tools in BRENDA.2017年的BRENDA:BRENDA中的新视角与新工具。
Nucleic Acids Res. 2017 Jan 4;45(D1):D380-D388. doi: 10.1093/nar/gkw952. Epub 2016 Oct 19.
2
Modelling Chemical Reasoning to Predict and Invent Reactions.建立化学推理模型以预测和发明反应。
Chemistry. 2017 May 2;23(25):6118-6128. doi: 10.1002/chem.201604556. Epub 2017 Jan 4.
3
Large-scale ligand-based predictive modelling using support vector machines.使用支持向量机的基于配体的大规模预测建模。
ACS Omega. 2025 Jul 10;10(28):30450-30466. doi: 10.1021/acsomega.5c02188. eCollection 2025 Jul 22.
4
Analysis of metabolomics and transcriptomics data to assess interactions in microalgal co-culture of Skeletonema marinoi and Prymnesium parvum.分析代谢组学和转录组学数据以评估海洋骨条藻和微小原甲藻微藻共培养中的相互作用。
PLoS One. 2025 Jul 28;20(7):e0329115. doi: 10.1371/journal.pone.0329115. eCollection 2025.
5
Targeting Drug Resistance in Cancer: Dimethoxycurcumin as a Functional Antioxidant Targeting ABCC3.靶向癌症中的耐药性:二甲氧基姜黄素作为靶向ABCC3的功能性抗氧化剂
Antioxidants (Basel). 2025 May 16;14(5):599. doi: 10.3390/antiox14050599.
6
PrankWeb 4: a modular web server for protein-ligand binding site prediction and downstream analysis.PrankWeb 4:用于蛋白质-配体结合位点预测及下游分析的模块化网络服务器。
Nucleic Acids Res. 2025 Jul 7;53(W1):W466-W471. doi: 10.1093/nar/gkaf421.
7
Consensus Modeling Strategies for Predicting Transthyretin Binding Affinity from Tox24 Challenge Data.基于Tox24挑战数据预测转甲状腺素蛋白结合亲和力的共识建模策略
Chem Res Toxicol. 2025 May 15. doi: 10.1021/acs.chemrestox.5c00018.
8
Harnessing advanced computational approaches to design novel antimicrobial peptides against intracellular bacterial infections.利用先进的计算方法设计针对细胞内细菌感染的新型抗菌肽。
Bioact Mater. 2025 Apr 28;50:510-524. doi: 10.1016/j.bioactmat.2025.04.016. eCollection 2025 Aug.
9
ProteinsPlus: a publicly available resource for protein structure mining.蛋白质增强版:一个可公开获取的蛋白质结构挖掘资源。
Nucleic Acids Res. 2025 Jul 7;53(W1):W478-W484. doi: 10.1093/nar/gkaf377.
10
BRAFPred: A Novel Approach for Accurate Prediction of the B-Type Rapidly Accelerated Fibrosarcoma Inhibitor.BRAFPred:一种准确预测B型快速加速纤维肉瘤抑制剂的新方法。
ACS Omega. 2025 Mar 21;10(12):12170-12184. doi: 10.1021/acsomega.4c10367. eCollection 2025 Apr 1.
J Cheminform. 2016 Aug 10;8:39. doi: 10.1186/s13321-016-0151-5. eCollection 2016.
4
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.通过全球天然产物社会分子网络共享和社区管理质谱数据。
Nat Biotechnol. 2016 Aug 9;34(8):828-837. doi: 10.1038/nbt.3597.
5
ReactPRED: a tool to predict and analyze biochemical reactions.ReactPRED:一种预测和分析生化反应的工具。
Bioinformatics. 2016 Nov 15;32(22):3522-3524. doi: 10.1093/bioinformatics/btw491. Epub 2016 Aug 2.
6
Ambit-Tautomer: An Open Source Tool for Tautomer Generation.Ambit-互变异构体:一种用于生成互变异构体的开源工具。
Mol Inform. 2013 Jun;32(5-6):481-504. doi: 10.1002/minf.201200133. Epub 2013 Jun 3.
7
Rendering Molecular Sketches for Publication Quality Output.生成用于高质量出版输出的分子草图。
Mol Inform. 2013 Mar;32(3):291-301. doi: 10.1002/minf.201200171. Epub 2013 Mar 8.
8
Visual Analysis of Biological Activity Data with Scaffold Hunter.使用Scaffold Hunter对生物活性数据进行可视化分析。
Mol Inform. 2013 Dec;32(11-12):964-75. doi: 10.1002/minf.201300087. Epub 2013 Sep 9.
9
AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments.AMBIT-SMARTS:高效的化学结构和片段搜索。
Mol Inform. 2011 Aug;30(8):707-20. doi: 10.1002/minf.201100028. Epub 2011 Aug 4.
10
Comparing structural fingerprints using a literature-based similarity benchmark.使用基于文献的相似性基准比较结构指纹。
J Cheminform. 2016 Jul 5;8:36. doi: 10.1186/s13321-016-0148-0. eCollection 2016.