• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepCRISTL:用于在特定细胞环境中预测 CRISPR/Cas9 靶向编辑效率的深度迁移学习。

DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 on-target editing efficiency in specific cellular contexts.

机构信息

School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel.

Department of Computer Science, Bar-Ilan University, Ramat Gan 5290002, Israel.

出版信息

Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae481.

DOI:10.1093/bioinformatics/btae481
PMID:39073893
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11319645/
Abstract

MOTIVATION

CRISPR/Cas9 technology has been revolutionizing the field of gene editing. Guide RNAs (gRNAs) enable Cas9 proteins to target specific genomic loci for editing. However, editing efficiency varies between gRNAs and so computational methods were developed to predict editing efficiency for any gRNA of interest. High-throughput datasets of Cas9 editing efficiencies were produced to train machine-learning models to predict editing efficiency. However, these high-throughput datasets have a low correlation with functional and endogenous datasets, which are too small to train accurate machine-learning models on.

RESULTS

We developed DeepCRISTL, a deep-learning model to predict the editing efficiency in a specific cellular context. DeepCRISTL takes advantage of high-throughput datasets to learn general patterns of gRNA editing efficiency and then fine-tunes the model on functional or endogenous data to fit a specific cellular context. We tested two state-of-the-art models trained on high-throughput datasets for editing efficiency prediction, our newly improved DeepHF and CRISPRon, combined with various transfer-learning approaches. The combination of CRISPRon and fine-tuning all model weights was the overall best performer. DeepCRISTL outperformed state-of-the-art methods in predicting editing efficiency in a specific cellular context on functional and endogenous datasets. Using saliency maps, we identified and compared the important features learned by DeepCRISTL across cellular contexts. We believe DeepCRISTL will improve prediction performance in many other CRISPR/Cas9 editing contexts by leveraging transfer learning to utilize both high-throughput datasets and smaller and more biologically relevant datasets.

AVAILABILITY AND IMPLEMENTATION

DeepCRISTL is available via https://github.com/OrensteinLab/DeepCRISTL.

摘要

动机

CRISPR/Cas9 技术正在彻底改变基因编辑领域。向导 RNA(gRNA)使 Cas9 蛋白能够靶向特定的基因组位点进行编辑。然而,gRNA 之间的编辑效率存在差异,因此开发了计算方法来预测任何感兴趣的 gRNA 的编辑效率。产生了高通量的 Cas9 编辑效率数据集来训练机器学习模型以预测编辑效率。然而,这些高通量数据集与功能和内源性数据集相关性较低,这些数据集太小,无法在其上训练准确的机器学习模型。

结果

我们开发了 DeepCRISTL,这是一种深度学习模型,可预测特定细胞环境中的编辑效率。DeepCRISTL 利用高通量数据集来学习 gRNA 编辑效率的一般模式,然后在功能或内源性数据上对模型进行微调,以适应特定的细胞环境。我们测试了两种基于高通量数据集训练的用于编辑效率预测的最先进模型,即我们新改进的 DeepHF 和 CRISPRon,以及各种迁移学习方法。CRISPRon 与调整所有模型权重的组合是整体表现最好的。DeepCRISTL 在功能和内源性数据集上预测特定细胞环境中的编辑效率方面优于最先进的方法。使用显着性图,我们在不同的细胞环境中确定并比较了 DeepCRISTL 学到的重要特征。我们相信,通过利用迁移学习来利用高通量数据集和更小、更具生物学相关性的数据集,DeepCRISTL 将提高许多其他 CRISPR/Cas9 编辑环境中的预测性能。

可用性和实现

DeepCRISTL 可通过 https://github.com/OrensteinLab/DeepCRISTL 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/e5e4440b868c/btae481f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/6561fc90e8ee/btae481f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/fb0538da58cf/btae481f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/edbe95f94f15/btae481f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/ece512c8b03d/btae481f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/6d54de0153fa/btae481f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/cba9fc74e715/btae481f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/e5e4440b868c/btae481f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/6561fc90e8ee/btae481f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/fb0538da58cf/btae481f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/edbe95f94f15/btae481f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/ece512c8b03d/btae481f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/6d54de0153fa/btae481f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/cba9fc74e715/btae481f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30cf/11319645/e5e4440b868c/btae481f7.jpg

相似文献

1
DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 on-target editing efficiency in specific cellular contexts.DeepCRISTL:用于在特定细胞环境中预测 CRISPR/Cas9 靶向编辑效率的深度迁移学习。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae481.
2
DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 functional and endogenous on-target editing efficiency.DeepCRISTL:深度迁移学习预测 CRISPR/Cas9 功能和内源性靶标编辑效率。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i161-i168. doi: 10.1093/bioinformatics/btac218.
3
Enhancing CRISPR-Cas9 gRNA efficiency prediction by data integration and deep learning.通过数据集成和深度学习提高 CRISPR-Cas9 gRNA 效率预测。
Nat Commun. 2021 May 28;12(1):3238. doi: 10.1038/s41467-021-23576-0.
4
CRISPR-Cas9 gRNA efficiency prediction: an overview of predictive tools and the role of deep learning.CRISPR-Cas9 gRNA 效率预测:预测工具概述及深度学习的作用。
Nucleic Acids Res. 2022 Apr 22;50(7):3616-3637. doi: 10.1093/nar/gkac192.
5
Machine learning-based prediction models to guide the selection of Cas9 variants for efficient gene editing.基于机器学习的预测模型,用于指导 Cas9 变体的选择,以实现高效的基因编辑。
Cell Rep. 2024 Feb 27;43(2):113765. doi: 10.1016/j.celrep.2024.113765. Epub 2024 Feb 14.
6
Off-target predictions in CRISPR-Cas9 gene editing using deep learning.使用深度学习进行 CRISPR-Cas9 基因编辑中的脱靶预测。
Bioinformatics. 2018 Sep 1;34(17):i656-i663. doi: 10.1093/bioinformatics/bty554.
7
WheatCRISPR: a web-based guide RNA design tool for CRISPR/Cas9-mediated genome editing in wheat.小麦 CRISPR:一个基于网络的 guide RNA 设计工具,用于小麦中 CRISPR/Cas9 介导的基因组编辑。
BMC Plant Biol. 2019 Nov 6;19(1):474. doi: 10.1186/s12870-019-2097-z.
8
A Multiplexed CRISPR/Cas9 Editing System Based on the Endogenous tRNA Processing.一种基于内源性tRNA加工的多重CRISPR/Cas9编辑系统。
Methods Mol Biol. 2019;1917:63-73. doi: 10.1007/978-1-4939-8991-1_5.
9
CROTON: an automated and variant-aware deep learning framework for predicting CRISPR/Cas9 editing outcomes.克罗顿:一个自动化且变体感知的深度学习框架,用于预测 CRISPR/Cas9 编辑结果。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i342-i348. doi: 10.1093/bioinformatics/btab268.
10
gRNA Design: How Its Evolution Impacted on CRISPR/Cas9 Systems Refinement.gRNA 设计:其进化如何影响 CRISPR/Cas9 系统的改进。
Biomolecules. 2023 Nov 24;13(12):1698. doi: 10.3390/biom13121698.

引用本文的文献

1
Unlocking the potential of CRISPR tools and databases for precision genome editing.释放CRISPR工具和数据库在精准基因组编辑方面的潜力。
Front Plant Sci. 2025 Jul 21;16:1563711. doi: 10.3389/fpls.2025.1563711. eCollection 2025.
2
Off-target interactions in the CRISPR-Cas9 Machinery: mechanisms and outcomes.CRISPR-Cas9机制中的脱靶相互作用:机制与结果
Biochem Biophys Rep. 2025 Jul 5;43:102134. doi: 10.1016/j.bbrep.2025.102134. eCollection 2025 Sep.
3
From Code to Life: The AI-Driven Revolution in Genome Editing.从代码到生命:基因组编辑中的人工智能驱动革命

本文引用的文献

1
Generating, modeling and evaluating a large-scale set of CRISPR/Cas9 off-target sites with bulges.生成、建模和评估具有凸起结构的大规模 CRISPR/Cas9 脱靶位点集。
Nucleic Acids Res. 2024 Jul 8;52(12):6777-6790. doi: 10.1093/nar/gkae428.
2
Letter to the editor: Testing on external independent datasets is necessary to corroborate machine learning model improvement.致编辑的信:对外部独立数据集进行测试对于证实机器学习模型的改进是必要的。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad327.
3
DeepZF: improved DNA-binding prediction of C2H2-zinc-finger proteins by deep transfer learning.
Adv Sci (Weinh). 2025 Aug;12(30):e17029. doi: 10.1002/advs.202417029. Epub 2025 Jun 19.
DeepZF:通过深度迁移学习改进 C2H2-锌指蛋白的 DNA 结合预测。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii62-ii67. doi: 10.1093/bioinformatics/btac469.
4
A systematic evaluation of data processing and problem formulation of CRISPR off-target site prediction.CRISPR脱靶位点预测的数据处理与问题表述的系统评估
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac157.
5
CRISPR-Cas9 gRNA efficiency prediction: an overview of predictive tools and the role of deep learning.CRISPR-Cas9 gRNA 效率预测:预测工具概述及深度学习的作用。
Nucleic Acids Res. 2022 Apr 22;50(7):3616-3637. doi: 10.1093/nar/gkac192.
6
Enhancing CRISPR-Cas9 gRNA efficiency prediction by data integration and deep learning.通过数据集成和深度学习提高 CRISPR-Cas9 gRNA 效率预测。
Nat Commun. 2021 May 28;12(1):3238. doi: 10.1038/s41467-021-23576-0.
7
Avocado: a multi-scale deep tensor factorization method learns a latent representation of the human epigenome.鳄梨:一种多尺度深度张量分解方法,可学习人类表观基因组的潜在表示。
Genome Biol. 2020 Mar 30;21(1):81. doi: 10.1186/s13059-020-01977-6.
8
SpCas9 activity prediction by DeepSpCas9, a deep learning-based model with high generalization performance.基于深度学习的 SpCas9 活性预测模型 DeepSpCas9,具有出色的泛化性能。
Sci Adv. 2019 Nov 6;5(11):eaax9249. doi: 10.1126/sciadv.aax9249. eCollection 2019 Nov.
9
Optimized CRISPR guide RNA design for two high-fidelity Cas9 variants by deep learning.通过深度学习优化两个高保真 Cas9 变体的 CRISPR 引导 RNA 设计。
Nat Commun. 2019 Sep 19;10(1):4284. doi: 10.1038/s41467-019-12281-8.
10
An overview and metanalysis of machine and deep learning-based CRISPR gRNA design tools.基于机器和深度学习的 CRISPR gRNA 设计工具的概述和荟萃分析。
RNA Biol. 2020 Jan;17(1):13-22. doi: 10.1080/15476286.2019.1669406. Epub 2019 Sep 27.