Chen Jonathan, Swamidass S Joshua, Dou Yimeng, Bruand Jocelyne, Baldi Pierre
Institute for Genomics and Bioinformatics, University of California, Irvine, USA.
Bioinformatics. 2005 Nov 15;21(22):4133-9. doi: 10.1093/bioinformatics/bti683. Epub 2005 Sep 20.
The development of chemoinformatics has been hampered by the lack of large, publicly available, comprehensive repositories of molecules, in particular of small molecules. Small molecules play a fundamental role in organic chemistry and biology. They can be used as combinatorial building blocks for chemical synthesis, as molecular probes in chemical genomics and systems biology, and for the screening and discovery of new drugs and other useful compounds.
We describe ChemDB, a public database of small molecules available on the Web. ChemDB is built using the digital catalogs of over a hundred vendors and other public sources and is annotated with information derived from these sources as well as from computational methods, such as predicted solubility and three-dimensional structure. It supports multiple molecular formats and is periodically updated, automatically whenever possible. The current version of the database contains approximately 4.1 million commercially available compounds and 8.2 million counting isomers. The database includes a user-friendly graphical interface, chemical reactions capabilities, as well as unique search capabilities.
Database and datasets are available on http://cdb.ics.uci.edu.
化学信息学的发展受到缺乏大型、公开可用的综合性分子库(尤其是小分子库)的阻碍。小分子在有机化学和生物学中起着基础性作用。它们可用作化学合成的组合构建模块、化学基因组学和系统生物学中的分子探针,以及用于新药和其他有用化合物的筛选与发现。
我们描述了ChemDB,一个可在网络上获取的小分子公共数据库。ChemDB是利用一百多家供应商的数字目录及其他公共资源构建而成,并使用来自这些资源以及计算方法(如预测溶解度和三维结构)的信息进行注释。它支持多种分子格式,并会定期更新,尽可能实现自动更新。该数据库的当前版本包含约410万个可商购化合物和820万个计数异构体。该数据库包括一个用户友好的图形界面、化学反应功能以及独特的搜索功能。
数据库和数据集可在http://cdb.ics.uci.edu获取。