• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种识别有机分子中官能团的算法。

An algorithm to identify functional groups in organic molecules.

作者信息

Ertl Peter

机构信息

Novartis Institutes for BioMedical Research, 4056, Basel, Switzerland.

出版信息

J Cheminform. 2017 Jun 7;9(1):36. doi: 10.1186/s13321-017-0225-z.

DOI:10.1186/s13321-017-0225-z
PMID:29086048
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5462667/
Abstract

BACKGROUND

The concept of functional groups forms a basis of organic chemistry, medicinal chemistry, toxicity assessment, spectroscopy and also chemical nomenclature. All current software systems to identify functional groups are based on a predefined list of substructures. We are not aware of any program that can identify all functional groups in a molecule automatically. The algorithm presented in this article is an attempt to solve this scientific challenge.

RESULTS

An algorithm to identify functional groups in a molecule based on iterative marching through its atoms is described. The procedure is illustrated by extracting functional groups from the bioactive portion of the ChEMBL database, resulting in identification of 3080 unique functional groups.

CONCLUSIONS

A new algorithm to identify all functional groups in organic molecules is presented. The algorithm is relatively simple and full details with examples are provided, therefore implementation in any cheminformatics toolkit should be relatively easy. The new method allows the analysis of functional groups in large chemical databases in a way that was not possible using previous approaches. Graphical abstract .

摘要

背景

官能团的概念构成了有机化学、药物化学、毒性评估、光谱学以及化学命名法的基础。当前所有用于识别官能团的软件系统均基于预定义的子结构列表。我们不知道有任何程序能够自动识别分子中的所有官能团。本文提出的算法旨在尝试解决这一科学挑战。

结果

描述了一种基于对分子中的原子进行迭代遍历以识别官能团的算法。通过从ChEMBL数据库的生物活性部分提取官能团对该过程进行了说明,从而识别出3080个独特的官能团。

结论

提出了一种用于识别有机分子中所有官能团的新算法。该算法相对简单,并提供了带有示例的完整细节,因此在任何化学信息学工具包中的实现应该相对容易。这种新方法能够以以前的方法无法实现的方式对大型化学数据库中的官能团进行分析。图形摘要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/914aa103a5cc/13321_2017_225_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/df948f74f726/13321_2017_225_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/ddf11d08701f/13321_2017_225_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/e94287d9b020/13321_2017_225_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/678acfc89696/13321_2017_225_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/914aa103a5cc/13321_2017_225_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/df948f74f726/13321_2017_225_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/ddf11d08701f/13321_2017_225_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/e94287d9b020/13321_2017_225_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/678acfc89696/13321_2017_225_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3c/5462667/914aa103a5cc/13321_2017_225_Fig4_HTML.jpg

相似文献

1
An algorithm to identify functional groups in organic molecules.一种识别有机分子中官能团的算法。
J Cheminform. 2017 Jun 7;9(1):36. doi: 10.1186/s13321-017-0225-z.
2
Get Your Atoms in Order--An Open-Source Implementation of a Novel and Robust Molecular Canonicalization Algorithm.整理原子顺序——一种新颖且强大的分子正则化算法的开源实现。
J Chem Inf Model. 2015 Oct 26;55(10):2111-20. doi: 10.1021/acs.jcim.5b00543. Epub 2015 Oct 15.
3
rdScaffoldNetwork: The Scaffold Network Implementation in RDKit.rdScaffoldNetwork:RDKit 中的支架网络实现。
J Chem Inf Model. 2020 Jul 27;60(7):3331-3335. doi: 10.1021/acs.jcim.0c00296. Epub 2020 Jul 7.
4
ErtlFunctionalGroupsFinder: automated rule-based functional group detection with the Chemistry Development Kit (CDK).厄特尔官能团查找器:使用化学开发工具包(CDK)进行基于规则的自动官能团检测。
J Cheminform. 2019 Jun 4;11(1):37. doi: 10.1186/s13321-019-0361-8.
5
ChEMBL-Likeness Score and Database GDBChEMBL.ChEMBL类药性评分与数据库GDBChEMBL
Front Chem. 2020 Feb 4;8:46. doi: 10.3389/fchem.2020.00046. eCollection 2020.
6
Automatic extraction of candidate nomenclature terms using the doublet method.使用双重方法自动提取候选命名术语。
BMC Med Inform Decis Mak. 2005 Oct 18;5:35. doi: 10.1186/1472-6947-5-35.
7
ChEMBL web services: streamlining access to drug discovery data and utilities.ChEMBL网络服务:简化对药物发现数据和实用工具的访问
Nucleic Acids Res. 2015 Jul 1;43(W1):W612-20. doi: 10.1093/nar/gkv352. Epub 2015 Apr 16.
8
The Molecule Cloud - compact visualization of large collections of molecules.分子云 - 大规模分子集合的紧凑可视化。
J Cheminform. 2012 Jul 6;4(1):12. doi: 10.1186/1758-2946-4-12.
9
Flexible heuristic algorithm for automatic molecule fragmentation: application to the UNIFAC group contribution model.用于自动分子碎片化的灵活启发式算法:在UNIFAC基团贡献模型中的应用
J Cheminform. 2019 Aug 20;11(1):57. doi: 10.1186/s13321-019-0382-3.
10
The development of an artificial organic networks toolkit for LabVIEW.用于LabVIEW的人工有机网络工具包的开发。
J Comput Chem. 2015 Mar 15;36(7):478-92. doi: 10.1002/jcc.23818. Epub 2015 Jan 6.

引用本文的文献

1
Dehydroxy-Perfluoro-tert-butylation of alcohols with PhSOC(CF) and insights into the structure of the [C(CF)] anion.醇与PhSOC(CF)的脱羟基全氟叔丁基化反应及对[C(CF)]阴离子结构的见解
Nat Commun. 2025 Aug 26;16(1):7969. doi: 10.1038/s41467-025-63180-0.
2
Aryl Acid-Alcohol Cross-Coupling: C(sp)-C(sp) Bond Formation from Nontraditional Precursors.芳基酸-醇交叉偶联反应:通过非传统前体形成C(sp)-C(sp)键
J Am Chem Soc. 2025 May 7;147(18):14905-14914. doi: 10.1021/jacs.4c15827. Epub 2025 Apr 23.
3
Benzene-1,3-disulfonyl fluoride and Benzene-1,3,5-trisulfonyl fluoride: Low-Cost, Stable, and Selective Reagents for SuFEx-Driven Deoxyazidation.

本文引用的文献

1
ClassyFire: automated chemical classification with a comprehensive, computable taxonomy.ClassyFire:基于全面、可计算分类法的自动化化学分类
J Cheminform. 2016 Nov 4;8:61. doi: 10.1186/s13321-016-0174-y. eCollection 2016.
2
Extended Functional Groups (EFG): An Efficient Set for Chemical Characterization and Structure-Activity Relationship Studies of Chemical Compounds.扩展官能团(EFG):用于化合物化学表征和构效关系研究的有效集合。
Molecules. 2015 Dec 23;21(1):E1. doi: 10.3390/molecules21010001.
3
ZINC 15--Ligand Discovery for Everyone.
苯-1,3-二磺酰氟和苯-1,3,5-三磺酰氟:用于SuFEx驱动的脱氧叠氮化反应的低成本、稳定且选择性的试剂。
Adv Synth Catal. 2024 Nov 5;366(21):4470-4477. doi: 10.1002/adsc.202400680. Epub 2024 Jul 23.
4
COCONUT 2.0: a comprehensive overhaul and curation of the collection of open natural products database.COCONUT 2.0:开放天然产物数据库集合的全面修订与管理
Nucleic Acids Res. 2025 Jan 6;53(D1):D634-D643. doi: 10.1093/nar/gkae1063.
5
Direct synthesis of dialkyl ketones from deoxygenative cross-coupling of carboxylic acids and alcohols.通过羧酸与醇的脱氧交叉偶联直接合成二烷基酮。
Chem Sci. 2024 Oct 8;15(44):18405-10. doi: 10.1039/d4sc05420b.
6
C-H Labeling with [F]Fluoride: An Emerging Methodology in Radiochemistry.用[F]氟化物进行碳-氢标记:放射化学中的一种新兴方法。
ACS Cent Sci. 2024 Aug 23;10(9):1674-1688. doi: 10.1021/acscentsci.4c00997. eCollection 2024 Sep 25.
7
Quantum Chemistry Dataset with Ground- and Excited-state Properties of 450 Kilo Molecules.包含45万个分子基态和激发态性质的量子化学数据集。
Sci Data. 2024 Aug 29;11(1):948. doi: 10.1038/s41597-024-03788-x.
8
Development of bedaquiline nanoemulsions intended for paediatric multidrug-resistant tuberculosis: excipient selection and preformulation studies.用于儿童耐多药结核病的贝达喹啉纳米乳剂的研发:辅料选择与处方前研究
Front Med Technol. 2024 Jun 10;6:1388113. doi: 10.3389/fmedt.2024.1388113. eCollection 2024.
9
Evaluation of functional group compatibility and development of reaction-accelerating additives in ammonium salt-accelerated hydrazinolysis of amides.酰胺铵盐加速肼解中官能团兼容性的评估及反应加速添加剂的开发
Front Chem. 2024 May 22;12:1378746. doi: 10.3389/fchem.2024.1378746. eCollection 2024.
10
Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices.反应洞察:使用键电子矩阵进行快速化学反应分析。
J Cheminform. 2024 Mar 29;16(1):37. doi: 10.1186/s13321-024-00834-z.
锌15——面向大众的配体发现平台。
J Chem Inf Model. 2015 Nov 23;55(11):2324-37. doi: 10.1021/acs.jcim.5b00559. Epub 2015 Nov 9.
4
New publicly available chemical query language, CSRML, to support chemotype representations for application to data mining and modeling.新的公开可用化学查询语言 CSRML,用于支持化学型表示,以应用于数据挖掘和建模。
J Chem Inf Model. 2015 Mar 23;55(3):510-28. doi: 10.1021/ci500667v. Epub 2015 Feb 19.
5
Automated compound classification using a chemical ontology.基于化学本体的化合物自动分类。
J Cheminform. 2012 Dec 29;4(1):40. doi: 10.1186/1758-2946-4-40.
6
Rules for identifying potentially reactive or promiscuous compounds.潜在反应性或混杂化合物的鉴定规则。
J Med Chem. 2012 Nov 26;55(22):9763-72. doi: 10.1021/jm301008n. Epub 2012 Oct 25.
7
The Molecule Cloud - compact visualization of large collections of molecules.分子云 - 大规模分子集合的紧凑可视化。
J Cheminform. 2012 Jul 6;4(1):12. doi: 10.1186/1758-2946-4-12.
8
ChEMBL: a large-scale bioactivity database for drug discovery.ChEMBL:用于药物发现的大型生物活性数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D1100-7. doi: 10.1093/nar/gkr777. Epub 2011 Sep 23.
9
New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays.新型亚结构筛选器,可用于从筛选库中去除泛分析干扰化合物(PAINS),并在生物测定中对其进行排除。
J Med Chem. 2010 Apr 8;53(7):2719-40. doi: 10.1021/jm901137j.
10
Quest for the rings. In silico exploration of ring universe to identify novel bioactive heteroaromatic scaffolds.寻找环状化合物。通过计算机模拟探索环状化合物领域以识别新型生物活性杂芳族骨架。
J Med Chem. 2006 Jul 27;49(15):4568-73. doi: 10.1021/jm060217p.