• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多语法:数字聚合物表示与生成的语法

Polygrammar: Grammar for Digital Polymer Representation and Generation.

作者信息

Guo Minghao, Shou Wan, Makatura Liane, Erps Timothy, Foshey Michael, Matusik Wojciech

机构信息

Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.

CUHK Multimedia Lab, The Chinese University of Hong Kong, Sha Tin, Hong Kong.

出版信息

Adv Sci (Weinh). 2022 Aug;9(23):e2101864. doi: 10.1002/advs.202101864. Epub 2022 Jun 9.

DOI:10.1002/advs.202101864
PMID:35678650
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9376847/
Abstract

Polymers are widely studied materials with diverse properties and applications determined by molecular structures. It is essential to represent these structures clearly and explore the full space of achievable chemical designs. However, existing approaches cannot offer comprehensive design models for polymers because of their inherent scale and structural complexity. Here, a parametric, context-sensitive grammar designed specifically for polymers (PolyGrammar) is proposed. Using the symbolic hypergraph representation and 14 simple production rules, PolyGrammar can represent and generate all valid polyurethane structures. An algorithm is presented to translate any polyurethane structure from the popular Simplified Molecular-Input Line-entry System (SMILES) string format into the PolyGrammar representation. The representative power of PolyGrammar is tested by translating a dataset of over 600 polyurethane samples collected from the literature. Furthermore, it is shown that PolyGrammar can be easily extended to other copolymers and homopolymers. By offering a complete, explicit representation scheme and an explainable generative model with validity guarantees, PolyGrammar takes an essential step toward a more comprehensive and practical system for polymer discovery and exploration. As the first bridge between formal languages and chemistry, PolyGrammar also serves as a critical blueprint to inform the design of similar grammars for other chemistries, including organic and inorganic molecules.

摘要

聚合物是一类被广泛研究的材料,其具有由分子结构决定的多样性质和应用。清晰地表示这些结构并探索可实现的化学设计的完整空间至关重要。然而,由于其固有的尺度和结构复杂性,现有方法无法为聚合物提供全面的设计模型。在此,提出了一种专门为聚合物设计的参数化、上下文敏感语法(PolyGrammar)。使用符号超图表示和14条简单的产生式规则,PolyGrammar可以表示并生成所有有效的聚氨酯结构。提出了一种算法,用于将来自流行的简化分子输入线性输入系统(SMILES)字符串格式的任何聚氨酯结构转换为PolyGrammar表示。通过翻译从文献中收集的600多个聚氨酯样品的数据集来测试PolyGrammar的代表性能力。此外,还表明PolyGrammar可以轻松扩展到其他共聚物和均聚物。通过提供一个完整、明确的表示方案以及一个具有有效性保证的可解释生成模型,PolyGrammar朝着建立一个更全面、实用的聚合物发现和探索系统迈出了重要一步。作为形式语言与化学之间的第一座桥梁,PolyGrammar也为设计包括有机和无机分子在内的其他化学物质的类似语法提供了关键蓝图。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/b692035c096e/ADVS-9-2101864-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/b654d324ea3b/ADVS-9-2101864-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/081c4860cfc2/ADVS-9-2101864-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/91996182788f/ADVS-9-2101864-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/29f28c1fbe18/ADVS-9-2101864-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/2f55449603e5/ADVS-9-2101864-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/cbcd999b0407/ADVS-9-2101864-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/407b181d2cd6/ADVS-9-2101864-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/cf733fba1bce/ADVS-9-2101864-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/b692035c096e/ADVS-9-2101864-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/b654d324ea3b/ADVS-9-2101864-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/081c4860cfc2/ADVS-9-2101864-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/91996182788f/ADVS-9-2101864-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/29f28c1fbe18/ADVS-9-2101864-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/2f55449603e5/ADVS-9-2101864-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/cbcd999b0407/ADVS-9-2101864-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/407b181d2cd6/ADVS-9-2101864-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/cf733fba1bce/ADVS-9-2101864-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d34/9376847/b692035c096e/ADVS-9-2101864-g010.jpg

相似文献

1
Polygrammar: Grammar for Digital Polymer Representation and Generation.多语法:数字聚合物表示与生成的语法
Adv Sci (Weinh). 2022 Aug;9(23):e2101864. doi: 10.1002/advs.202101864. Epub 2022 Jun 9.
2
Planning Implications Related to Sterilization-Sensitive Science Investigations Associated with Mars Sample Return (MSR).与火星样本返回(MSR)相关的对灭菌敏感的科学研究的规划意义。
Astrobiology. 2022 Jun;22(S1):S112-S164. doi: 10.1089/AST.2021.0113. Epub 2022 May 19.
3
An invertible, invariant crystal representation for inverse design of solid-state materials using generative deep learning.一种用于固态材料逆设计的可逆、不变晶体表示,采用生成式深度学习。
Nat Commun. 2023 Nov 2;14(1):7027. doi: 10.1038/s41467-023-42870-7.
4
SELFIES and the future of molecular string representations.自拍与分子串表示法的未来。
Patterns (N Y). 2022 Oct 14;3(10):100588. doi: 10.1016/j.patter.2022.100588.
5
The language of RNA: a formal grammar that includes pseudoknots.RNA的语言:一种包含假结的形式语法。
Bioinformatics. 2000 Apr;16(4):334-40. doi: 10.1093/bioinformatics/16.4.334.
6
Transfer Learned Designer Polymers For Organic Solar Cells.用于有机太阳能电池的迁移学习设计聚合物。
J Chem Inf Model. 2021 Jan 25;61(1):134-142. doi: 10.1021/acs.jcim.0c01157. Epub 2021 Jan 7.
7
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
8
Fast and Space-Efficient Construction of AVL Grammars from the LZ77 Parsing.基于LZ77解析的AVL语法的快速且空间高效构建。
Lebniz Int Proc Inform. 2021;204. doi: 10.4230/LIPIcs.ESA.2021.56.
9
Randomized SMILES strings improve the quality of molecular generative models.随机化的SMILES字符串提高了分子生成模型的质量。
J Cheminform. 2019 Nov 21;11(1):71. doi: 10.1186/s13321-019-0393-0.
10
Exploration of Chemical Space Guided by PixelCNN for Fragment-Based De Novo Drug Discovery.基于PixelCNN的化学空间探索用于基于片段的从头药物发现
J Chem Inf Model. 2022 Dec 12;62(23):5988-6001. doi: 10.1021/acs.jcim.2c01345. Epub 2022 Dec 1.

引用本文的文献

1
Machine Learning in Polymer Research.聚合物研究中的机器学习
Adv Mater. 2025 Mar;37(11):e2413695. doi: 10.1002/adma.202413695. Epub 2025 Feb 9.
2
Inverse design of copolymers including stoichiometry and chain architecture.共聚物的逆设计,包括化学计量和链结构。
Chem Sci. 2024 Dec 17;16(3):1161-1178. doi: 10.1039/d4sc05900j. eCollection 2025 Jan 15.
3
Calculating Pairwise Similarity of Polymer Ensembles via Earth Mover's Distance.通过推土机距离计算聚合物集合的成对相似性。

本文引用的文献

1
Mol-CycleGAN: a generative model for molecular optimization.Mol-CycleGAN:一种用于分子优化的生成模型。
J Cheminform. 2020 Jan 8;12(1):2. doi: 10.1186/s13321-019-0404-1.
2
ChemOS: An orchestration software to democratize autonomous discovery.ChemOS:一个使自主发现民主化的编排软件。
PLoS One. 2020 Apr 16;15(4):e0229862. doi: 10.1371/journal.pone.0229862. eCollection 2020.
3
Smart multifunctional polyurethane microcapsules for the quick release of anticancer drugs in BGC 823 and HeLa tumor cells.用于在BGC 823和HeLa肿瘤细胞中快速释放抗癌药物的智能多功能聚氨酯微胶囊。
ACS Polym Au. 2024 Jan 10;4(1):66-76. doi: 10.1021/acspolymersau.3c00029. eCollection 2024 Feb 14.
4
Navigating the Expansive Landscapes of Soft Materials: A User Guide for High-Throughput Workflows.探索软材料的广阔领域:高通量工作流程用户指南
ACS Polym Au. 2023 Dec 5;3(6):406-427. doi: 10.1021/acspolymersau.3c00025. eCollection 2023 Dec 13.
5
Emerging Trends in Machine Learning: A Polymer Perspective.机器学习的新兴趋势:聚合物视角
ACS Polym Au. 2023 Jan 18;3(3):239-258. doi: 10.1021/acspolymersau.2c00053. eCollection 2023 Jun 14.
J Mater Chem B. 2017 Dec 28;5(48):9477-9481. doi: 10.1039/c7tb02570j. Epub 2017 Nov 30.
4
De novo generation of hit-like molecules from gene expression signatures using artificial intelligence.利用人工智能从基因表达特征生成类似命中的新分子。
Nat Commun. 2020 Jan 3;11(1):10. doi: 10.1038/s41467-019-13807-w.
5
BigSMILES: A Structurally-Based Line Notation for Describing Macromolecules.大简化分子线性输入规范(BigSMILES):一种用于描述大分子的基于结构的线性符号表示法。
ACS Cent Sci. 2019 Sep 25;5(9):1523-1531. doi: 10.1021/acscentsci.9b00476. Epub 2019 Sep 12.
6
Optimization of Molecules via Deep Reinforcement Learning.通过深度强化学习优化分子。
Sci Rep. 2019 Jul 24;9(1):10752. doi: 10.1038/s41598-019-47148-x.
7
Learning Retrosynthetic Planning through Simulated Experience.通过模拟经验学习逆合成规划。
ACS Cent Sci. 2019 Jun 26;5(6):970-981. doi: 10.1021/acscentsci.9b00055. Epub 2019 May 31.
8
Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations.通过转换等效化学表示来学习连续且数据驱动的分子描述符。
Chem Sci. 2018 Nov 19;10(6):1692-1701. doi: 10.1039/c8sc04175j. eCollection 2019 Feb 14.
9
A graph-convolutional neural network model for the prediction of chemical reactivity.一种用于预测化学反应性的图卷积神经网络模型。
Chem Sci. 2018 Nov 26;10(2):370-377. doi: 10.1039/c8sc04228d. eCollection 2019 Jan 14.
10
Using Machine Learning To Predict Suitable Conditions for Organic Reactions.使用机器学习预测有机反应的合适条件。
ACS Cent Sci. 2018 Nov 28;4(11):1465-1476. doi: 10.1021/acscentsci.8b00357. Epub 2018 Nov 16.