Suppr超能文献

准确的生化反应原子映射计算。

Accurate atom-mapping computation for biochemical reactions.

机构信息

SRI International, 333 Ravenswood Ave., Menlo Park, California 94025, USA.

出版信息

J Chem Inf Model. 2012 Nov 26;52(11):2970-82. doi: 10.1021/ci3002217. Epub 2012 Oct 15.

Abstract

The complete atom mapping of a chemical reaction is a bijection of the reactant atoms to the product atoms that specifies the terminus of each reactant atom. Atom mapping of biochemical reactions is useful for many applications of systems biology, in particular for metabolic engineering where synthesizing new biochemical pathways has to take into account for the number of carbon atoms from a source compound that are conserved in the synthesis of a target compound. Rapid, accurate computation of the atom mapping(s) of a biochemical reaction remains elusive despite significant work on this topic. In particular, past researchers did not validate the accuracy of mapping algorithms. We introduce a new method for computing atom mappings called the minimum weighted edit-distance (MWED) metric. The metric is based on bond propensity to react and computes biochemically valid atom mappings for a large percentage of biochemical reactions. MWED models can be formulated efficiently as Mixed-Integer Linear Programs (MILPs). We have demonstrated this approach on 7501 reactions of the MetaCyc database for which 87% of the models could be solved in less than 10 s. For 2.1% of the reactions, we found multiple optimal atom mappings. We show that the error rate is 0.9% (22 reactions) by comparing these atom mappings to 2446 atom mappings of the manually curated Kyoto Encyclopedia of Genes and Genomes (KEGG) RPAIR database. To our knowledge, our computational atom-mapping approach is the most accurate and among the fastest published to date. The atom-mapping data will be available in the MetaCyc database later in 2012; the atom-mapping software will be available within the Pathway Tools software later in 2012.

摘要

化学反应的完整原子映射是反应物原子到产物原子的双射,它指定了每个反应物原子的终点。生化反应的原子映射对于系统生物学的许多应用都很有用,特别是在代谢工程中,合成新的生化途径必须考虑到源化合物的碳原子数量在目标化合物的合成中是守恒的。尽管在这个主题上已经做了大量的工作,但快速、准确地计算生化反应的原子映射仍然难以实现。特别是,过去的研究人员没有验证映射算法的准确性。我们引入了一种新的计算原子映射的方法,称为最小加权编辑距离(MWED)度量。该度量基于键的反应倾向,并为很大一部分生化反应计算出生物化学上有效的原子映射。MWED 模型可以有效地表示为混合整数线性规划(MILP)。我们已经在 MetaCyc 数据库的 7501 个反应上证明了这种方法,其中 87%的模型可以在不到 10 秒内解决。对于 2.1%的反应,我们找到了多个最佳原子映射。我们通过将这些原子映射与手动整理的京都基因与基因组百科全书(KEGG)RPAIR 数据库的 2446 个原子映射进行比较,发现错误率为 0.9%(22 个反应)。据我们所知,我们的计算原子映射方法是迄今为止最准确和最快的方法之一。原子映射数据将在 2012 年晚些时候在 MetaCyc 数据库中提供;原子映射软件将在 2012 年晚些时候在 Pathway Tools 软件中提供。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验