• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从微笑到单体:聚合物化学结构与生物结构之间的联系

Smiles2Monomers: a link between chemical and biological structures for polymers.

作者信息

Dufresne Yoann, Noé Laurent, Leclère Valérie, Pupin Maude

机构信息

Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France.

Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France ; Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394 - ICV - Institut Charles Viollette, 59000 Lille, France.

出版信息

J Cheminform. 2015 Dec 29;7:62. doi: 10.1186/s13321-015-0111-5. eCollection 2015.

DOI:10.1186/s13321-015-0111-5
PMID:26715946
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4693424/
Abstract

BACKGROUND

The monomeric composition of polymers is powerful for structure comparison and synthetic biology, among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure.

RESULTS

Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s.

CONCLUSION

s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp.Graphical abstract:.

摘要

背景

聚合物的单体组成对于结构比较和合成生物学等领域具有重要意义。许多数据库提供化合物的原子结构,但聚合物的单体结构往往缺失。我们设计了一种智能算法,并在工具Smiles2Monomers(s2m)中实现,以便从聚合物的化学结构高效且准确地推断其单体结构。

结果

我们的策略分为两个步骤:首先,通过高效的子图同构算法将单体映射到原子结构上;其次,计算最佳平铺方式,使不重叠的单体覆盖目标聚合物的所有结构。映射基于动态规划算法构建的马尔可夫指数。该指数使s2m能够在目标聚合物上快速搜索所有给定的单体。之后,贪心算法将映射的单体组合成一致的单体结构。最后,局部分支定界算法对结构进行优化。我们在两个手动注释的聚合物数据库上测试了该方法,从头重建结构的灵敏度超过90%。每个聚合物的平均计算时间为2秒。

结论

s2m能自动为聚合物从头创建单体注释,在计算时间和灵敏度方面都很高效。s2m使我们能够检测测试数据库中的注释错误,并轻松找到准确的结构。因此,s2m可集成到小分子化合物数据库的管理过程中,以验证当前条目并加速新聚合物的注释。完整方法可通过http://bioinfo.lifl.fr/norine/smiles2monomers.jsp网站下载或访问,该网站用于类肽聚合物。图形摘要:.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/0bb7c947eec6/13321_2015_111_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/1a25fc4a2f61/13321_2015_111_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/1ca1c26537d8/13321_2015_111_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/7f6cf06b39ef/13321_2015_111_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/5070b172d2c9/13321_2015_111_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/7e91834a0c86/13321_2015_111_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/ce629bc0e3de/13321_2015_111_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/52355b5c9d3b/13321_2015_111_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/0bb7c947eec6/13321_2015_111_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/1a25fc4a2f61/13321_2015_111_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/1ca1c26537d8/13321_2015_111_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/7f6cf06b39ef/13321_2015_111_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/5070b172d2c9/13321_2015_111_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/7e91834a0c86/13321_2015_111_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/ce629bc0e3de/13321_2015_111_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/52355b5c9d3b/13321_2015_111_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87f5/4693424/0bb7c947eec6/13321_2015_111_Fig7_HTML.jpg

相似文献

1
Smiles2Monomers: a link between chemical and biological structures for polymers.从微笑到单体:聚合物化学结构与生物结构之间的联系
J Cheminform. 2015 Dec 29;7:62. doi: 10.1186/s13321-015-0111-5. eCollection 2015.
2
rBAN: retro-biosynthetic analysis of nonribosomal peptides.rBAN:非核糖体肽的逆向生物合成分析
J Cheminform. 2019 Feb 8;11(1):13. doi: 10.1186/s13321-019-0335-x.
3
BigSMARTS: A Topologically Aware Query Language and Substructure Search Algorithm for Polymer Chemical Structures.BigSMARTS:一种用于聚合物化学结构的拓扑感知查询语言和子结构搜索算法
J Chem Inf Model. 2023 Nov 13;63(21):6555-6568. doi: 10.1021/acs.jcim.3c00978. Epub 2023 Oct 24.
4
Norine: update of the nonribosomal peptide resource.诺琳:非核糖体肽资源的更新。
Nucleic Acids Res. 2020 Jan 8;48(D1):D465-D469. doi: 10.1093/nar/gkz1000.
5
Structural pattern matching of nonribosomal peptides.非核糖体肽的结构模式匹配
BMC Struct Biol. 2009 Mar 18;9:15. doi: 10.1186/1472-6807-9-15.
6
Norine: A powerful resource for novel nonribosomal peptide discovery.诺琳:新型非核糖体肽发现的强大资源。
Synth Syst Biotechnol. 2016 Jun 1;1(2):89-94. doi: 10.1016/j.synbio.2015.11.001. eCollection 2016 Jun.
7
[A retrieval method of drug molecules based on graph collapsing].基于图折叠的药物分子检索方法
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):368-374.
8
Norine, the knowledgebase dedicated to non-ribosomal peptides, is now open to crowdsourcing.诺琳(Norine)是一个致力于非核糖体肽的知识库,现在它向众包开放。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1113-8. doi: 10.1093/nar/gkv1143. Epub 2015 Nov 2.
9
Searching for pharmacophoric patterns in databases of three-dimensional chemical structures.在三维化学结构数据库中搜索药效团模式。
J Mol Recognit. 1995 Sep-Oct;8(5):290-303. doi: 10.1002/jmr.300080503.
10
Efficient enumeration of stereoisomers of outerplanar chemical graphs using dynamic programming.使用动态规划高效枚举外轨化学图的立体异构体。
J Chem Inf Model. 2011 Nov 28;51(11):2788-807. doi: 10.1021/ci200084b. Epub 2011 Oct 25.

引用本文的文献

1
cyclicpeptide: a Python package for cyclic peptide drug design.环肽:用于环肽药物设计的Python软件包。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae714.
2
Norine: Bioinformatics Methods and Tools for the Characterization of Newly Discovered Nonribosomal Peptides.诺琳:用于鉴定新发现的非核糖体肽的生物信息学方法与工具。
Methods Mol Biol. 2023;2670:303-318. doi: 10.1007/978-1-0716-3214-7_16.
3
How to Completely Squeeze a Fungus-Advanced Genome Mining Tools for Novel Bioactive Substances.如何彻底挖掘真菌——用于新型生物活性物质的先进基因组挖掘工具

本文引用的文献

1
The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank.化学成分词典:蛋白质数据库中通过实验确定的三维大分子中组成分子的完整描述。
Bioinformatics. 2015 Apr 15;31(8):1274-8. doi: 10.1093/bioinformatics/btu789. Epub 2014 Dec 2.
2
Small molecule annotation for the Protein Data Bank.蛋白质数据库的小分子注释。
Database (Oxford). 2014 Nov 25;2014:bau116. doi: 10.1093/database/bau116. Print 2014.
3
Evolutions in fragment-based drug design: the deconstruction-reconstruction approach.
Pharmaceutics. 2022 Aug 31;14(9):1837. doi: 10.3390/pharmaceutics14091837.
4
MassSpecBlocks: a web-based tool to create building blocks and sequences of nonribosomal peptides and polyketides for tandem mass spectra analysis.质谱分析模块:一种基于网络的工具,用于创建非核糖体肽和聚酮化合物的构建模块及序列,以进行串联质谱分析。
J Cheminform. 2021 Jul 7;13(1):51. doi: 10.1186/s13321-021-00530-2.
5
Norine: update of the nonribosomal peptide resource.诺琳:非核糖体肽资源的更新。
Nucleic Acids Res. 2020 Jan 8;48(D1):D465-D469. doi: 10.1093/nar/gkz1000.
6
rBAN: retro-biosynthetic analysis of nonribosomal peptides.rBAN:非核糖体肽的逆向生物合成分析
J Cheminform. 2019 Feb 8;11(1):13. doi: 10.1186/s13321-019-0335-x.
7
SPICES: a particle-based molecular structure line notation and support library for mesoscopic simulation.SPICES:一种用于介观模拟的基于粒子的分子结构线符号表示法及支持库。
J Cheminform. 2018 Aug 9;10(1):35. doi: 10.1186/s13321-018-0294-7.
8
Annotation of Peptide Structures Using SMILES and Other Chemical Codes-Practical Solutions.使用 SMILES 和其他化学代码注释肽结构-实用解决方案。
Molecules. 2017 Nov 27;22(12):2075. doi: 10.3390/molecules22122075.
9
Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters.抗 SMASH 及其他计算方法在挖掘次级代谢产物生物合成基因簇方面的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1103-1113. doi: 10.1093/bib/bbx146.
10
Norine: A powerful resource for novel nonribosomal peptide discovery.诺琳:新型非核糖体肽发现的强大资源。
Synth Syst Biotechnol. 2016 Jun 1;1(2):89-94. doi: 10.1016/j.synbio.2015.11.001. eCollection 2016 Jun.
基于片段的药物设计进展:解构-重构方法
Drug Discov Today. 2015 Jan;20(1):105-13. doi: 10.1016/j.drudis.2014.09.015. Epub 2014 Sep 27.
4
HubAlign: an accurate and efficient method for global alignment of protein-protein interaction networks.HubAlign:一种用于蛋白质-蛋白质相互作用网络全局比对的准确且高效的方法。
Bioinformatics. 2014 Sep 1;30(17):i438-44. doi: 10.1093/bioinformatics/btu450.
5
molBLOCKS: decomposing small molecule sets and uncovering enriched fragments.molBLOCKS:分解小分子集并揭示富集片段。
Bioinformatics. 2014 Jul 15;30(14):2081-3. doi: 10.1093/bioinformatics/btu173. Epub 2014 Mar 28.
6
Prediction of new bioactive molecules using a Bayesian belief network.基于贝叶斯信念网络的新型生物活性分子预测。
J Chem Inf Model. 2014 Jan 27;54(1):30-6. doi: 10.1021/ci4004909. Epub 2014 Jan 15.
7
QSAR modeling: where have you been? Where are you going to?定量构效关系模型:你从何处来?你将往何处去?
J Med Chem. 2014 Jun 26;57(12):4977-5010. doi: 10.1021/jm4004285. Epub 2014 Jan 6.
8
Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank.提高肽类抑制剂和抗生素分子在蛋白质数据库中的表示形式。
Biopolymers. 2014 Jun;101(6):659-68. doi: 10.1002/bip.22434.
9
Open Babel: An open chemical toolbox.Open Babel:一个开放的化学工具箱。
J Cheminform. 2011 Oct 7;3:33. doi: 10.1186/1758-2946-3-33.
10
Build-up algorithm for atomic correspondence between chemical structures.化学结构间原子对应关系的构建算法。
J Chem Inf Model. 2011 Aug 22;51(8):1775-87. doi: 10.1021/ci2001023. Epub 2011 Jul 18.