Laboratory of Combinatorial Genetics and Synthetic Biology, School of Biomedical Sciences, The University of Hong Kong, Hong Kong, SAR, China.
Centre for Oncology and Immunology Limited, Hong Kong Science Park, Hong Kong, SAR, China.
Nat Commun. 2022 Apr 25;13(1):2219. doi: 10.1038/s41467-022-29874-5.
The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.5-fold in comparison to the null model. Using this approach and followed by structure-guided engineering, we identify the N888R/A889Q variant conferring increased editing activity on the protospacer adjacent motif-relaxed KKH variant of Cas9 nuclease from Staphylococcus aureus (KKH-SaCas9) and its derived base editor in human cells. Our work validates a readily applicable workflow to enable resource-efficient high-throughput engineering of genome editor's activity.
基因组编辑 Cas9 蛋白使用多个氨基酸残基来结合目标 DNA。如果只考虑与目标 DNA 邻近的残基作为优化 Cas9 活性的潜在位点,那么通过湿实验筛选的组合变体数量太大。在这里,我们生成并交叉验证了十个 Cas9 工程多结构域组合诱变文库的计算和实验数据集,并证明机器学习与工程方法相结合,可以将实验筛选负担减少高达 95%,同时与零模型相比,富集的最佳变体增加了约 7.5 倍。使用这种方法,并通过结构导向工程,我们确定了 N888R/A889Q 变体,该变体赋予金黄色葡萄球菌(KKH-SaCas9)Cas9 核酸酶的原间隔基序松弛 KKH 变体及其衍生的碱基编辑器在人类细胞中提高编辑活性。我们的工作验证了一种易于应用的工作流程,可实现基因组编辑活性的资源高效高通量工程。