Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland.
J Cheminform. 2014 May 22;6:27. doi: 10.1186/1758-2946-6-27. eCollection 2014.
The properties of fragrance molecules in the public databases SuperScent and Flavornet were analyzed to define a "fragrance-like" (FL) property range (Heavy Atom Count ≤ 21, only C, H, O, S, (O + S) ≤ 3, Hydrogen Bond Donor ≤ 1) and the corresponding chemical space including FL molecules from PubChem (NIH repository of molecules), ChEMBL (bioactive molecules), ZINC (drug-like molecules), and GDB-13 (all possible organic molecules up to 13 atoms of C, N, O, S, Cl). The FL subsets of these databases were classified by MQN (Molecular Quantum Numbers, a set of 42 integer value descriptors of molecular structure) and formatted for fast MQN-similarity searching and interactive exploration of color-coded principal component maps in form of the FL-mapplet and FL-browser applications freely available at http://www.gdb.unibe.ch. MQN-similarity is shown to efficiently recover 15 different fragrance molecule families from the different FL subsets, demonstrating the relevance of the MQN-based tool to explore the fragrance chemical space.
分析公共数据库 SuperScent 和 Flavornet 中香气分子的性质,定义了一个“类似香气”(FL)的性质范围(重原子数≤21,仅 C、H、O、S、(O+S)≤3,氢键供体≤1)和相应的化学空间,其中包括来自 PubChem(NIH 分子库)、ChEMBL(生物活性分子)、ZINC(类药物分子)和 GDB-13(最多可达 13 个 C、N、O、S、Cl 原子的所有可能有机分子)的 FL 分子。这些数据库的 FL 子集通过 MQN(分子量子数,分子结构的一组 42 个整数值描述符)进行分类,并以 FL-mapplet 和 FL-browser 应用程序的形式格式化,以便快速进行 MQN 相似性搜索和交互式探索彩色主成分映射,这些应用程序可在 http://www.gdb.unibe.ch 上免费获得。MQN 相似性有效地从不同的 FL 子集中恢复了 15 种不同的香料分子家族,证明了基于 MQN 的工具在探索香料化学空间方面的相关性。