Suppr超能文献

反应映射:化学反应的高效原子映射算法。

ReactionMap: an efficient atom-mapping algorithm for chemical reactions.

机构信息

Institute for Genomics and Bioinformatics and School of Information and Computer Sciences, University of California , Irvine, California 92697, United States.

出版信息

J Chem Inf Model. 2013 Nov 25;53(11):2812-9. doi: 10.1021/ci400326p. Epub 2013 Nov 11.

Abstract

Large databases of chemical reactions provide new data-mining opportunities and challenges. Key challenges result from the imperfect quality of the data and the fact that many of these reactions are not properly balanced or atom-mapped. Here, we describe ReactionMap, an efficient atom-mapping algorithm. Our approach uses a combination of maximum common chemical subgraph search and minimization of an assignment cost function derived empirically from training data. We use a set of over 259,000 balanced atom-mapped reactions from the SPRESI commercial database to train the system, and we validate it on random sets of 1000 and 17,996 reactions sampled from this pool. These large test sets represent a broad range of chemical reaction types, and ReactionMap correctly maps about 99% of the atoms and about 96% of the reactions, with a mean time per mapping of 2 s. Most correctly mapped reactions are mapped with high confidence. Mapping accuracy compares favorably with ChemAxon's AutoMapper, versions 5 and 6.1, and the DREAM Web tool. These approaches correctly map 60.7%, 86.5%, and 90.3% of the reactions, respectively, on the same data set. A ReactionMap server is available on the ChemDB Web portal at http://cdb.ics.uci.edu .

摘要

大型化学反应数据库提供了新的数据挖掘机会和挑战。主要的挑战源于数据的不完善质量,以及许多这些反应没有正确平衡或原子映射的事实。在这里,我们描述了 ReactionMap,这是一种高效的原子映射算法。我们的方法结合了最大公共化学子图搜索和从训练数据中得出的经验分配成本函数最小化。我们使用来自 SPRESI 商业数据库的超过 259,000 个平衡原子映射反应的集合来训练系统,并在该池中的 1000 个和 17,996 个反应的随机集合上对其进行验证。这些大型测试集代表了广泛的化学反应类型,ReactionMap 正确映射了约 99%的原子和约 96%的反应,每次映射的平均时间为 2 秒。大多数正确映射的反应都具有较高的置信度。映射准确性与 ChemAxon 的 AutoMapper 版本 5 和 6.1 以及 DREAM Web 工具相比具有优势。这些方法在相同的数据集中分别正确映射了 60.7%、86.5%和 90.3%的反应。ReactionMap 服务器可在 ChemDB Web 门户上获得,网址为 http://cdb.ics.uci.edu。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验