• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习与组合诱变技术相结合,可实现高效资源利用的 CRISPR-Cas9 基因组编辑工具的工程改造。

Machine learning-coupled combinatorial mutagenesis enables resource-efficient engineering of CRISPR-Cas9 genome editor activities.

机构信息

Laboratory of Combinatorial Genetics and Synthetic Biology, School of Biomedical Sciences, The University of Hong Kong, Hong Kong, SAR, China.

Centre for Oncology and Immunology Limited, Hong Kong Science Park, Hong Kong, SAR, China.

出版信息

Nat Commun. 2022 Apr 25;13(1):2219. doi: 10.1038/s41467-022-29874-5.

DOI:10.1038/s41467-022-29874-5
PMID:35468907
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9039034/
Abstract

The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.5-fold in comparison to the null model. Using this approach and followed by structure-guided engineering, we identify the N888R/A889Q variant conferring increased editing activity on the protospacer adjacent motif-relaxed KKH variant of Cas9 nuclease from Staphylococcus aureus (KKH-SaCas9) and its derived base editor in human cells. Our work validates a readily applicable workflow to enable resource-efficient high-throughput engineering of genome editor's activity.

摘要

基因组编辑 Cas9 蛋白使用多个氨基酸残基来结合目标 DNA。如果只考虑与目标 DNA 邻近的残基作为优化 Cas9 活性的潜在位点,那么通过湿实验筛选的组合变体数量太大。在这里,我们生成并交叉验证了十个 Cas9 工程多结构域组合诱变文库的计算和实验数据集,并证明机器学习与工程方法相结合,可以将实验筛选负担减少高达 95%,同时与零模型相比,富集的最佳变体增加了约 7.5 倍。使用这种方法,并通过结构导向工程,我们确定了 N888R/A889Q 变体,该变体赋予金黄色葡萄球菌(KKH-SaCas9)Cas9 核酸酶的原间隔基序松弛 KKH 变体及其衍生的碱基编辑器在人类细胞中提高编辑活性。我们的工作验证了一种易于应用的工作流程,可实现基因组编辑活性的资源高效高通量工程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/094459b30c1a/41467_2022_29874_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/b0a33705d28d/41467_2022_29874_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/d45a2e566eeb/41467_2022_29874_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/04a5728f0975/41467_2022_29874_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/fff4d3b9e57c/41467_2022_29874_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/094459b30c1a/41467_2022_29874_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/b0a33705d28d/41467_2022_29874_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/d45a2e566eeb/41467_2022_29874_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/04a5728f0975/41467_2022_29874_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/fff4d3b9e57c/41467_2022_29874_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9fa/9039034/094459b30c1a/41467_2022_29874_Fig5_HTML.jpg

相似文献

1
Machine learning-coupled combinatorial mutagenesis enables resource-efficient engineering of CRISPR-Cas9 genome editor activities.机器学习与组合诱变技术相结合,可实现高效资源利用的 CRISPR-Cas9 基因组编辑工具的工程改造。
Nat Commun. 2022 Apr 25;13(1):2219. doi: 10.1038/s41467-022-29874-5.
2
High-fidelity KKH variant of Staphylococcus aureus Cas9 nucleases with improved base mismatch discrimination.具有提高碱基错配识别能力的高保真 KKH 变体金黄色葡萄球菌 Cas9 核酸酶。
Nucleic Acids Res. 2022 Feb 22;50(3):1650-1660. doi: 10.1093/nar/gkab1291.
3
Expanding CRISPR/Cas9 Genome Editing Capacity in Zebrafish Using SaCas9.利用SaCas9扩展斑马鱼中的CRISPR/Cas9基因组编辑能力
G3 (Bethesda). 2016 Aug 9;6(8):2517-21. doi: 10.1534/g3.116.031914.
4
Rationally engineered Cas9 nucleases with high genome-wide specificity.经理性设计的 Cas9 核酸酶具有高度的全基因组特异性。
Proc Natl Acad Sci U S A. 2019 Oct 15;116(42):20969-20976. doi: 10.1073/pnas.1906843116. Epub 2019 Sep 30.
5
Accurate top protein variant discovery via low-N pick-and-validate machine learning.通过低 N 挑选-验证机器学习实现准确的顶级蛋白质变异体发现。
Cell Syst. 2024 Feb 21;15(2):193-203.e6. doi: 10.1016/j.cels.2024.01.002. Epub 2024 Feb 9.
6
Genetic Manipulation of MRSA Using CRISPR/Cas9 Technology.使用CRISPR/Cas9技术对耐甲氧西林金黄色葡萄球菌进行基因操作。
Methods Mol Biol. 2020;2069:113-124. doi: 10.1007/978-1-4939-9849-4_9.
7
Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition.通过修饰PAM识别拓宽金黄色葡萄球菌CRISPR-Cas9的靶向范围
Nat Biotechnol. 2015 Dec;33(12):1293-1298. doi: 10.1038/nbt.3404. Epub 2015 Nov 2.
8
[Application of machine learning in the CRISPR/Cas9 system].[机器学习在CRISPR/Cas9系统中的应用]
Yi Chuan. 2018 Sep 20;40(9):704-723. doi: 10.16288/j.yczz.18-135.
9
Engineered CRISPR-Cas9 nucleases with altered PAM specificities.具有改变的PAM特异性的工程化CRISPR-Cas9核酸酶。
Nature. 2015 Jul 23;523(7561):481-5. doi: 10.1038/nature14592. Epub 2015 Jun 22.
10
Cas9, Cpf1 and C2c1/2/3-What's next?Cas9、Cpf1 和 C2c1/2/3——接下来是什么?
Bioengineered. 2017 May 4;8(3):265-273. doi: 10.1080/21655979.2017.1282018. Epub 2017 Jan 31.

引用本文的文献

1
Genome-wide CRISPR screens identify critical targets to enhance CAR-NK cell antitumor potency.全基因组CRISPR筛选确定增强CAR-NK细胞抗肿瘤效力的关键靶点。
Cancer Cell. 2025 Aug 18. doi: 10.1016/j.ccell.2025.07.021.
2
Recent advances in CRISPR-based single-nucleotide fidelity diagnostics.基于CRISPR的单核苷酸保真度诊断技术的最新进展。
Commun Med (Lond). 2025 Jul 1;5(1):252. doi: 10.1038/s43856-025-00933-4.
3
Custom CRISPR-Cas9 PAM variants via scalable engineering and machine learning.通过可扩展工程和机器学习实现定制化CRISPR-Cas9原间隔序列临近基序变体

本文引用的文献

1
High-fidelity KKH variant of Staphylococcus aureus Cas9 nucleases with improved base mismatch discrimination.具有提高碱基错配识别能力的高保真 KKH 变体金黄色葡萄球菌 Cas9 核酸酶。
Nucleic Acids Res. 2022 Feb 22;50(3):1650-1660. doi: 10.1093/nar/gkab1291.
2
Probing ion channel functional architecture and domain recombination compatibility by massively parallel domain insertion profiling.通过大规模并行结构域插入分析,探测离子通道功能结构和结构域重组兼容性。
Nat Commun. 2021 Dec 8;12(1):7114. doi: 10.1038/s41467-021-27342-0.
3
Informed training set design enables efficient machine learning-assisted directed protein evolution.
Nature. 2025 Apr 22. doi: 10.1038/s41586-025-09021-y.
4
Engineering a New Generation of Gene Editors: Integrating Synthetic Biology and AI Innovations.打造新一代基因编辑器:融合合成生物学与人工智能创新成果
ACS Synth Biol. 2025 Mar 21;14(3):636-647. doi: 10.1021/acssynbio.4c00686. Epub 2025 Feb 25.
5
Synergizing CRISPR-Cas9 with Advanced Artificial Intelligence and Machine Learning for Precision Drug Delivery: Technological Nexus and Regulatory Insights.将CRISPR-Cas9与先进的人工智能和机器学习相结合以实现精准药物递送:技术关联与监管见解。
Curr Gene Ther. 2025;25(4):467-496. doi: 10.2174/0115665232342293241120033251.
6
Recent developments and future directions in point-of-care next-generation CRISPR-based rapid diagnosis.即时护理下一代基于CRISPR的快速诊断的最新进展与未来方向
Clin Exp Med. 2025 Jan 9;25(1):33. doi: 10.1007/s10238-024-01540-8.
7
Quantifying Protein-Nucleic Acid Interactions for Engineering Useful CRISPR-Cas9 Genome-Editing Variants.量化蛋白-核酸相互作用,以工程化有用的 CRISPR-Cas9 基因组编辑变体。
Methods Mol Biol. 2025;2870:227-243. doi: 10.1007/978-1-0716-4213-9_12.
8
A conditional protein diffusion model generates artificial programmable endonuclease sequences with enhanced activity.一种条件性蛋白质扩散模型生成了具有增强活性的人工可编程核酸内切酶序列。
Cell Discov. 2024 Sep 10;10(1):95. doi: 10.1038/s41421-024-00728-2.
9
Machine intelligence accelerated design of conductive MXene aerogels with programmable properties.机器智能加速具有可编程特性的导电MXene气凝胶的设计。
Nat Commun. 2024 Jun 1;15(1):4685. doi: 10.1038/s41467-024-49011-8.
10
Multiplex CRISPR-Cas Genome Editing: Next-Generation Microbial Strain Engineering.多重 CRISPR-Cas 基因组编辑:下一代微生物菌株工程。
J Agric Food Chem. 2024 May 29;72(21):11871-11884. doi: 10.1021/acs.jafc.4c01650. Epub 2024 May 14.
知情训练集设计可实现高效的机器学习辅助定向蛋白质进化。
Cell Syst. 2021 Nov 17;12(11):1026-1045.e7. doi: 10.1016/j.cels.2021.07.008. Epub 2021 Aug 19.
4
Predicting mutant outcome by combining deep mutational scanning and machine learning.通过结合深度突变扫描和机器学习预测突变结果。
Proteins. 2022 Jan;90(1):45-57. doi: 10.1002/prot.26184. Epub 2021 Jul 31.
5
Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning.通过深度学习从抗体序列预测抗原特异性来优化治疗性抗体。
Nat Biomed Eng. 2021 Jun;5(6):600-612. doi: 10.1038/s41551-021-00699-9. Epub 2021 Apr 15.
6
Improved prime editors enable pathogenic allele correction and cancer modelling in adult mice.改良的先导编辑可实现成年小鼠中致病等位基因的修正和癌症建模。
Nat Commun. 2021 Apr 9;12(1):2121. doi: 10.1038/s41467-021-22295-w.
7
Low-N protein engineering with data-efficient deep learning.低蛋白工程与数据高效深度学习。
Nat Methods. 2021 Apr;18(4):389-396. doi: 10.1038/s41592-021-01100-y. Epub 2021 Apr 7.
8
Deep diversification of an AAV capsid protein by machine learning.机器学习深度多样化 AAV 衣壳蛋白。
Nat Biotechnol. 2021 Jun;39(6):691-696. doi: 10.1038/s41587-020-00793-4. Epub 2021 Feb 11.
9
Deep Learning in Protein Structural Modeling and Design.蛋白质结构建模与设计中的深度学习
Patterns (N Y). 2020 Nov 12;1(9):100142. doi: 10.1016/j.patter.2020.100142. eCollection 2020 Dec 11.
10
High-Throughput Protein Engineering by Massively Parallel Combinatorial Mutagenesis.高通量蛋白质工程通过大规模并行组合诱变。
Methods Mol Biol. 2021;2199:3-12. doi: 10.1007/978-1-0716-0892-0_1.