• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用CRISPR-Cas9基因编辑中的新型sgRNA-DNA序列编码进行准确的深度学习脱靶预测。

Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-Cas9 gene editing.

作者信息

Charlier Jeremy, Nadon Robert, Makarenkov Vladimir

机构信息

Département d'Informatique, Université du Québec à Montréal, Montréal, QC H2X 3Y7, Canada.

McGill University and Genome Quebec Innovation Centre, Montreal, QC H3A 0C7, Canada.

出版信息

Bioinformatics. 2021 Aug 25;37(16):2299-2307. doi: 10.1093/bioinformatics/btab112.

DOI:10.1093/bioinformatics/btab112
PMID:33599251
Abstract

MOTIVATION

Off-target predictions are crucial in gene editing research. Recently, significant progress has been made in the field of prediction of off-target mutations, particularly with CRISPR-Cas9 data, thanks to the use of deep learning. CRISPR-Cas9 is a gene editing technique which allows manipulation of DNA fragments. The sgRNA-DNA (single guide RNA-DNA) sequence encoding for deep neural networks, however, has a strong impact on the prediction accuracy. We propose a novel encoding of sgRNA-DNA sequences that aggregates sequence data with no loss of information.

RESULTS

In our experiments, we compare the proposed sgRNA-DNA sequence encoding applied in a deep learning prediction framework with state-of-the-art encoding and prediction methods. We demonstrate the superior accuracy of our approach in a simulation study involving Feedforward Neural Networks (FNNs), Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) as well as the traditional Random Forest (RF), Naive Bayes (NB) and Logistic Regression (LR) classifiers. We highlight the quality of our results by building several FNNs, CNNs and RNNs with various layer depths and performing predictions on two popular gene editing datasets (CRISPOR and GUIDE-seq). In all our experiments, the new encoding led to more accurate off-target prediction results, providing an improvement of the area under the Receiver Operating Characteristic (ROC) curve up to 35%.

AVAILABILITY AND IMPLEMENTATION

The code and data used in this study are available at: https://github.com/dagrate/dl-offtarget.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

脱靶预测在基因编辑研究中至关重要。近年来,得益于深度学习的应用,脱靶突变预测领域取得了显著进展,尤其是在CRISPR-Cas9数据方面。CRISPR-Cas9是一种允许对DNA片段进行操作的基因编辑技术。然而,用于深度神经网络编码的sgRNA-DNA(单导向RNA-DNA)序列对预测准确性有很大影响。我们提出了一种新颖的sgRNA-DNA序列编码方法,该方法在不损失信息的情况下聚合序列数据。

结果

在我们的实验中,我们将应用于深度学习预测框架的sgRNA-DNA序列编码方法与最先进的编码和预测方法进行了比较。我们在涉及前馈神经网络(FNN)、卷积神经网络(CNN)和循环神经网络(RNN)以及传统随机森林(RF)、朴素贝叶斯(NB)和逻辑回归(LR)分类器的模拟研究中证明了我们方法的卓越准确性。我们通过构建具有不同层深度的多个FNN、CNN和RNN,并在两个流行的基因编辑数据集(CRISPOR和GUIDE-seq)上进行预测,突出了我们结果的质量。在我们所有的实验中,新编码都带来了更准确的脱靶预测结果,使受试者工作特征(ROC)曲线下面积提高了35%。

可用性和实现方式

本研究中使用的代码和数据可在以下网址获取:https://github.com/dagrate/dl-offtarget。

补充信息

补充数据可在《生物信息学》在线版获取。

相似文献

1
Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-Cas9 gene editing.利用CRISPR-Cas9基因编辑中的新型sgRNA-DNA序列编码进行准确的深度学习脱靶预测。
Bioinformatics. 2021 Aug 25;37(16):2299-2307. doi: 10.1093/bioinformatics/btab112.
2
Off-target predictions in CRISPR-Cas9 gene editing using deep learning.使用深度学习进行 CRISPR-Cas9 基因编辑中的脱靶预测。
Bioinformatics. 2018 Sep 1;34(17):i656-i663. doi: 10.1093/bioinformatics/bty554.
3
Prediction of CRISPR/Cas9 single guide RNA cleavage efficiency and specificity by attention-based convolutional neural networks.基于注意力机制的卷积神经网络预测CRISPR/Cas9单向导RNA切割效率和特异性
Comput Struct Biotechnol J. 2021 Mar 7;19:1445-1457. doi: 10.1016/j.csbj.2021.03.001. eCollection 2021.
4
C-RNNCrispr: Prediction of CRISPR/Cas9 sgRNA activity using convolutional and recurrent neural networks.C-RNNCrispr:使用卷积神经网络和循环神经网络预测CRISPR/Cas9 sgRNA活性。
Comput Struct Biotechnol J. 2020 Feb 12;18:344-354. doi: 10.1016/j.csbj.2020.01.013. eCollection 2020.
5
Using traditional machine learning and deep learning methods for on- and off-target prediction in CRISPR/Cas9: a review.基于传统机器学习和深度学习方法的 CRISPR/Cas9 脱靶和靶标预测:综述。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad131.
6
Deep learning improves the ability of sgRNA off-target propensity prediction.深度学习提高 sgRNA 脱靶倾向预测能力。
BMC Bioinformatics. 2020 Feb 10;21(1):51. doi: 10.1186/s12859-020-3395-z.
7
CROTON: an automated and variant-aware deep learning framework for predicting CRISPR/Cas9 editing outcomes.克罗顿:一个自动化且变体感知的深度学习框架,用于预测 CRISPR/Cas9 编辑结果。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i342-i348. doi: 10.1093/bioinformatics/btab268.
8
Prediction of sgRNA Off-Target Activity in CRISPR/Cas9 Gene Editing Using Graph Convolution Network.利用图卷积网络预测CRISPR/Cas9基因编辑中sgRNA的脱靶活性
Entropy (Basel). 2021 May 14;23(5):608. doi: 10.3390/e23050608.
9
sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.sgRNA CNN:使用卷积神经网络集合鉴定四种作物中的 sgRNA 靶标活性。
Plant Mol Biol. 2021 Mar;105(4-5):483-495. doi: 10.1007/s11103-020-01102-y. Epub 2021 Jan 1.
10
Effective use of sequence information to predict CRISPR-Cas9 off-target.有效利用序列信息预测CRISPR-Cas9脱靶效应。
Comput Struct Biotechnol J. 2022 Jan 19;20:650-661. doi: 10.1016/j.csbj.2022.01.006. eCollection 2022.

引用本文的文献

1
The Interface of Gene Editing with Regenerative Medicine.基因编辑与再生医学的界面
Engineering (Beijing). 2025 Mar;46:73-100. doi: 10.1016/j.eng.2024.10.019. Epub 2024 Nov 30.
2
Off-target sequence variations driven by the intrinsic properties of the Cas-sgRNA-DNA complex in genome editing.基因组编辑中由Cas-sgRNA-DNA复合物的内在特性驱动的脱靶序列变异。
PLoS One. 2025 Jul 18;20(7):e0328905. doi: 10.1371/journal.pone.0328905. eCollection 2025.
3
Deep Learning Based Models for CRISPR/Cas Off-Target Prediction.基于深度学习的CRISPR/Cas脱靶预测模型
Small Methods. 2025 Jul;9(7):e2500122. doi: 10.1002/smtd.202500122. Epub 2025 Jun 4.
4
Utilizing Target Sequences with Multiple Flanking Protospacer Adjacent Motif (PAM) Sites Reduces Off-Target Effects of the Cas9 Enzyme in Pineapple.利用具有多个侧翼原间隔相邻基序(PAM)位点的靶序列可降低菠萝中Cas9酶的脱靶效应。
Genes (Basel). 2025 Feb 13;16(2):217. doi: 10.3390/genes16020217.
5
Engineering a New Generation of Gene Editors: Integrating Synthetic Biology and AI Innovations.打造新一代基因编辑器:融合合成生物学与人工智能创新成果
ACS Synth Biol. 2025 Mar 21;14(3):636-647. doi: 10.1021/acssynbio.4c00686. Epub 2025 Feb 25.
6
Overview and Prospects of DNA Sequence Visualization.DNA序列可视化概述与展望
Int J Mol Sci. 2025 Jan 8;26(2):477. doi: 10.3390/ijms26020477.
7
Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding.使用具有BERT嵌入的双向长短期记忆网络预测人类原代细胞中的CRISPR-Cas9脱靶效应。
Bioinform Adv. 2024 Dec 30;5(1):vbae184. doi: 10.1093/bioadv/vbae184. eCollection 2025.
8
The Evolution of Nucleic Acid-Based Diagnosis Methods from the (pre-)CRISPR to CRISPR era and the Associated Machine/Deep Learning Approaches in Relevant RNA Design.从(前)CRISPR 时代到 CRISPR 时代的核酸诊断方法的演变,以及相关 RNA 设计中的机器/深度学习方法。
Methods Mol Biol. 2025;2847:241-300. doi: 10.1007/978-1-0716-4079-1_17.
9
Overcoming CRISPR-Cas9 off-target prediction hurdles: A novel approach with ESB rebalancing strategy and CRISPR-MCA model.克服 CRISPR-Cas9 脱靶预测障碍:一种新的方法,结合 ESB 再平衡策略和 CRISPR-MCA 模型。
PLoS Comput Biol. 2024 Sep 3;20(9):e1012340. doi: 10.1371/journal.pcbi.1012340. eCollection 2024 Sep.
10
Generating, modeling and evaluating a large-scale set of CRISPR/Cas9 off-target sites with bulges.生成、建模和评估具有凸起结构的大规模 CRISPR/Cas9 脱靶位点集。
Nucleic Acids Res. 2024 Jul 8;52(12):6777-6790. doi: 10.1093/nar/gkae428.