Suppr超能文献

大简化分子线性输入规范(BigSMILES):一种用于描述大分子的基于结构的线性符号表示法。

BigSMILES: A Structurally-Based Line Notation for Describing Macromolecules.

作者信息

Lin Tzyy-Shyang, Coley Connor W, Mochigase Hidenobu, Beech Haley K, Wang Wencong, Wang Zi, Woods Eliot, Craig Stephen L, Johnson Jeremiah A, Kalow Julia A, Jensen Klavs F, Olsen Bradley D

机构信息

Department of Chemical Engineering and Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States.

Department of Chemistry, Duke University, Durham, North Carolina 27708, United States.

出版信息

ACS Cent Sci. 2019 Sep 25;5(9):1523-1531. doi: 10.1021/acscentsci.9b00476. Epub 2019 Sep 12.

Abstract

Having a compact yet robust structurally based identifier or representation system is a key enabling factor for efficient sharing and dissemination of research results within the chemistry community, and such systems lay down the essential foundations for future informatics and data-driven research. While substantial advances have been made for small molecules, the polymer community has struggled in coming up with an efficient representation system. This is because, unlike other disciplines in chemistry, the basic premise that each distinct chemical species corresponds to a well-defined chemical structure does not hold for polymers. Polymers are intrinsically stochastic molecules that are often ensembles with a distribution of chemical structures. This difficulty limits the applicability of all deterministic representations developed for small molecules. In this work, a new representation system that is capable of handling the stochastic nature of polymers is proposed. The new system is based on the popular "simplified molecular-input line-entry system" (SMILES), and it aims to provide representations that can be used as indexing identifiers for entries in polymer databases. As a pilot test, the entries of the standard data set of the glass transition temperature of linear polymers (Bicerano, 2002) were converted into the new BigSMILES language. Furthermore, it is hoped that the proposed system will provide a more effective language for communication within the polymer community and increase cohesion between the researchers within the community.

摘要

拥有一个紧凑而强大的基于结构的标识符或表示系统,是化学界高效共享和传播研究成果的关键推动因素,此类系统为未来的信息学和数据驱动研究奠定了重要基础。虽然小分子领域已取得重大进展,但聚合物领域在提出一个高效的表示系统方面却面临困难。这是因为,与化学中的其他学科不同,聚合物并不符合每个独特化学物种都对应一个明确化学结构的基本前提。聚合物本质上是随机分子,通常是具有化学结构分布的集合体。这一困难限制了为小分子开发的所有确定性表示方法的适用性。在这项工作中,提出了一种能够处理聚合物随机性质的新表示系统。新系统基于流行的“简化分子输入线性输入系统”(SMILES),旨在提供可作为聚合物数据库条目的索引标识符的表示方法。作为一项试点测试,线性聚合物玻璃化转变温度标准数据集(Bicerano,2002)的条目被转换为新的BigSMILES语言。此外,希望所提出的系统将为聚合物领域内的交流提供一种更有效的语言,并增强该领域内研究人员之间的凝聚力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2afc/6764162/17485bc23609/oc9b00476_0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验