Goshisht Manoj Kumar
Department of Chemistry, Natural and Applied Sciences, University of Wisconsin-Green Bay, Green Bay, Wisconsin 54311-7001, United States.
ACS Omega. 2024 Feb 19;9(9):9921-9945. doi: 10.1021/acsomega.3c05913. eCollection 2024 Mar 5.
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
近年来,机器学习(ML),尤其是深度学习(DL),在合成生物学领域取得了迅速而重大的进展。随着时间的推移,生物系统(包括代谢途径、酶和全细胞)的生物技术应用正受到频繁探索。生物系统的复杂性和相互关联性使得设计具有所需特性的生物系统具有挑战性。ML和DL与合成生物学具有协同作用。合成生物学可用于生成用于训练模型的大数据集(例如,通过利用DNA合成),而ML/DL模型可用于指导设计(例如,通过生成新部件或建议进行无与伦比的实验)。工程生物学与ML/DL交叉领域的研究最近通过诸如新型生物组件的设计、最佳实验设计、显微镜数据的自动分析、蛋白质结构预测以及人工神经网络(ANN)的生物分子实现等成果,揭示了这种潜力。我将本综述分为三个部分。在第一部分中,我描述了ML的预测潜力和基础知识,以及在合成生物学中的众多应用,特别是在工程细胞、蛋白质活性和代谢途径方面。在第二部分中,我描述了基本的DL架构及其在合成生物学中的应用。最后,我描述了阻碍ML/DL和合成生物学发展的不同挑战及其解决方案。