Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering, KAIST Institute for BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea.
Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, Republic of Korea.
Biotechnol J. 2021 May;16(5):e2000605. doi: 10.1002/biot.202000605. Epub 2021 Jan 18.
Retrobiosynthesis allows the designing of novel biosynthetic pathways for the production of chemicals and materials through metabolic engineering, but generates a large number of reactions beyond the experimental feasibility. Thus, an effective method that can reduce a large number of the initially predicted enzymatic reactions has been needed. Here, we present Deep learning-based Reaction Feasibility Checker (DeepRFC) to classify the feasibility of a given enzymatic reaction with high performance and speed. DeepRFC is designed to receive Simplified Molecular-Input Line-Entry System (SMILES) strings of a reactant pair, which is defined as a substrate and a product of a reaction, as an input, and evaluates whether the input reaction is feasible. A deep neural network is selected for DeepRFC as it leads to better classification performance than five other representative machine learning methods examined. For validation, the performance of DeepRFC is compared with another in-house reaction feasibility checker that uses the concept of reaction similarity. Finally, the use of DeepRFC is demonstrated for the retrobiosynthesis-based design of novel one-carbon assimilation pathways. DeepRFC will allow retrobiosynthesis to be more practical for metabolic engineering applications by efficiently screening a large number of retrobiosynthesis-derived enzymatic reactions. DeepRFC is freely available at https://bitbucket.org/kaistsystemsbiology/deeprfc.
反生物合成允许通过代谢工程为化学品和材料的生产设计新的生物合成途径,但会产生超出实验可行性的大量反应。因此,需要一种能够有效减少最初预测的大量酶反应的方法。在这里,我们提出了基于深度学习的反应可行性检查器(DeepRFC),以高性能和快速的方式对给定的酶反应的可行性进行分类。DeepRFC 被设计为接收反应物对的简化分子输入行式系统 (SMILES) 字符串作为输入,反应物对定义为反应的底物和产物,并评估输入反应是否可行。选择深度神经网络作为 DeepRFC,因为它比检查的其他五种代表性机器学习方法具有更好的分类性能。为了验证,将 DeepRFC 的性能与另一个使用反应相似性概念的内部反应可行性检查器进行了比较。最后,演示了 DeepRFC 在基于反生物合成的新型一碳同化途径设计中的应用。DeepRFC 将通过有效筛选大量反生物合成衍生的酶反应,使反生物合成在代谢工程应用中更具实用性。DeepRFC 可在 https://bitbucket.org/kaistsystemsbiology/deeprfc 免费获得。