Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan.
Biotechnol J. 2024 Jan;19(1):e2300285. doi: 10.1002/biot.202300285. Epub 2023 Nov 27.
Simultaneous modification of the expression levels of many metabolic enzyme genes results in diverse expression ratios of these genes; however, the relationship between gene expression levels and chemical productivity remains unclear. However, clarification of this relationship is expected to improve the productivity of useful chemicals. Supervised machine learning is considered to be an effective means to clarify this relationship. In this study, to improve the productivity of carotenoids in yeast Saccharomyces cerevisiae, we aimed to build a machine-learning model that can predict the optimal gene expression level for carotenoid production. First, we obtained data on the expression levels of mevalonate pathway enzyme genes and carotenoid production. Then, based on these data, we built a machine-learning model to predict carotenoid productivity based on gene expression levels. The prediction accuracy of 0.6292 (coefficient of determination) was achieved using the test data. The maximum predicted carotenoid productivity was 4.3 times higher in the engineered strain than in the parental strain, suggesting that the expression levels of the mevalonate pathway enzyme genes tHMG1 and ERG8 have a particularly large impact on carotenoid productivity. This study could be one of the important achievements in addressing the uncertainty of genotype-phenotype correlations, which is one of the challenges facing metabolic engineering strategies.
同时改变许多代谢酶基因的表达水平会导致这些基因的表达比例不同;然而,基因表达水平与化学产率之间的关系仍不清楚。然而,澄清这种关系有望提高有用化学品的产量。监督机器学习被认为是澄清这种关系的有效手段。在这项研究中,为了提高酵母酿酒酵母中类胡萝卜素的产量,我们旨在建立一个可以预测类胡萝卜素生产最佳基因表达水平的机器学习模型。首先,我们获得了关于甲羟戊酸途径酶基因表达水平和类胡萝卜素产量的数据。然后,基于这些数据,我们建立了一个基于基因表达水平预测类胡萝卜素生产力的机器学习模型。使用测试数据达到了 0.6292 的预测准确性(决定系数)。与亲本菌株相比,工程菌株中类胡萝卜素的最大预测产量提高了 4.3 倍,这表明甲羟戊酸途径酶基因 tHMG1 和 ERG8 的表达水平对类胡萝卜素的生产力有特别大的影响。这项研究可能是解决代谢工程策略面临的基因型-表型相关性不确定性的重要成就之一。