Suppr超能文献

建立研发用化学物质登记制度。

Building an R&D chemical registration system.

机构信息

Philip Morris International R&D, Philip Morris Products S,A, Neuchâtel, Switzerland.

出版信息

J Cheminform. 2012 May 31;4(1):11. doi: 10.1186/1758-2946-4-11.

Abstract

Small molecule chemistry is of central importance to a number of R&D companies in diverse areas such as the pharmaceutical, nutraceutical, food flavoring, and cosmeceutical industries. In order to store and manage thousands of chemical compounds in such an environment, we have built a state-of-the-art master chemical database with unique structure identifiers. Here, we present the concept and methodology we used to build the system that we call the Unique Compound Database (UCD). In the UCD, each molecule is registered only once (uniqueness), structures with alternative representations are entered in a uniform way (normalization), and the chemical structure drawings are recognizable to chemists and to a cartridge. In brief, structural molecules are entered as neutral entities which can be associated with a salt. The salts are listed in a dictionary and bound to the molecule with the appropriate stoichiometric coefficient in an entity called "substance". The substances are associated with batches. Once a molecule is registered, some properties (e.g., ADMET prediction, IUPAC name, chemical properties) are calculated automatically. The UCD has both automated and manual data controls. Moreover, the UCD concept enables the management of user errors in the structure entry by reassigning or archiving the batches. It also allows updating of the records to include newly discovered properties of individual structures. As our research spans a wide variety of scientific fields, the database enables registration of mixtures of compounds, enantiomers, tautomers, and compounds with unknown stereochemistries.

摘要

小分子化学在医药、营养保健品、食品调味剂和化妆品等多个研发领域的公司中具有核心重要性。为了在这样的环境中存储和管理数千种化合物,我们构建了一个具有独特结构标识符的最先进的主化学数据库。在这里,我们介绍了构建我们称之为“独特化合物数据库”(UCD)的系统的概念和方法。在 UCD 中,每个分子仅注册一次(唯一性),以统一的方式输入具有替代表示形式的结构(归一化),并且化学结构绘图可以被化学家识别,也可以被墨盒识别。简而言之,结构分子以中性实体的形式输入,可以与盐相关联。盐列在字典中,并与具有适当化学计量系数的分子绑定在称为“物质”的实体中。物质与批次相关联。一旦分子被注册,一些属性(例如 ADMET 预测、IUPAC 名称、化学性质)将自动计算。UCD 具有自动和手动数据控制。此外,UCD 概念允许通过重新分配或存档批次来管理结构输入中的用户错误。它还允许更新记录,以包括个别结构新发现的属性。由于我们的研究涵盖了广泛的科学领域,因此该数据库允许注册化合物的混合物、对映异构体、互变异构体以及立体化学未知的化合物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2220/3430593/d8ddb6a30e86/1758-2946-4-11-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验