Saito Yutaka, Oikawa Misaki, Nakazawa Hikaru, Niide Teppei, Kameda Tomoshi, Tsuda Koji, Umetsu Mitsuo
Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST) , 2-4-7 Aomi, Koto-ku , Tokyo 135-0064 , Japan.
Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST) , 3-4-1 Okubo, Shinjuku-ku , Tokyo 169-8555 , Japan.
ACS Synth Biol. 2018 Sep 21;7(9):2014-2022. doi: 10.1021/acssynbio.8b00155. Epub 2018 Aug 20.
Molecular evolution based on mutagenesis is widely used in protein engineering. However, optimal proteins are often difficult to obtain due to a large sequence space. Here, we propose a novel approach that combines molecular evolution with machine learning. In this approach, we conduct two rounds of mutagenesis where an initial library of protein variants is used to train a machine-learning model to guide mutagenesis for the second-round library. This enables us to prepare a small library suited for screening experiments with high enrichment of functional proteins. We demonstrated a proof-of-concept of our approach by altering the reference green fluorescent protein (GFP) so that its fluorescence is changed into yellow. We successfully obtained a number of proteins showing yellow fluorescence, 12 of which had longer wavelengths than the reference yellow fluorescent protein (YFP). These results show the potential of our approach as a powerful method for directed evolution of fluorescent proteins.
基于诱变的分子进化在蛋白质工程中被广泛应用。然而,由于序列空间庞大,往往难以获得最优蛋白质。在此,我们提出一种将分子进化与机器学习相结合的新方法。在这种方法中,我们进行两轮诱变,利用蛋白质变体的初始文库训练机器学习模型,以指导第二轮文库的诱变。这使我们能够制备一个适合筛选实验的小型文库,其中功能性蛋白质高度富集。我们通过改变参考绿色荧光蛋白(GFP)使其荧光变为黄色,展示了我们方法的概念验证。我们成功获得了许多发出黄色荧光的蛋白质,其中12种蛋白质的波长比参考黄色荧光蛋白(YFP)更长。这些结果表明我们的方法作为荧光蛋白定向进化的有力方法具有潜力。