• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

sgRNA CNN:使用卷积神经网络集合鉴定四种作物中的 sgRNA 靶标活性。

sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.

机构信息

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.

Department of System Integration, Sparebanken Vest, Bergen, Norway.

出版信息

Plant Mol Biol. 2021 Mar;105(4-5):483-495. doi: 10.1007/s11103-020-01102-y. Epub 2021 Jan 1.

DOI:10.1007/s11103-020-01102-y
PMID:33385273
Abstract

We proposed an ensemble convolutional neural network model to identify sgRNA high on-target activity in four crops and we used one-hot encoding and k-mers for sequence encoding. As an important component of the CRISPR/Cas9 system, single-guide RNA (sgRNA) plays an important role in gene redirection and editing. sgRNA has played an important role in the improvement of agronomic species, but there is a lack of effective bioinformatics tools to identify the activity of sgRNA in agronomic species. Therefore, it is necessary to develop a method based on machine learning to identify sgRNA high on-target activity. In this work, we proposed a simple convolutional neural network method to identify sgRNA high on-target activity. Our study used one-hot encoding and k-mers for sequence data conversion and a voting algorithm for constructing the convolutional neural network ensemble model sgRNACNN for the prediction of sgRNA activity. The ensemble model sgRNACNN was used for predictions in four crops: Glycine max, Zea mays, Sorghum bicolor and Triticum aestivum. The accuracy rates of the four crops in the sgRNACNN model were 82.43%, 80.33%, 78.25% and 87.49%, respectively. The experimental results showed that sgRNACNN realizes the identification of high on-target activity sgRNA of agronomic data and can meet the demands of sgRNA activity prediction in agronomy to a certain extent. These results have certain significance for guiding crop gene editing and academic research. The source code and relevant dataset can be found in the following link: https://github.com/nmt315320/sgRNACNN.git .

摘要

我们提出了一个集成卷积神经网络模型,用于识别四种作物中 sgRNA 的高靶标活性,我们使用独热编码和 K-mer 进行序列编码。作为 CRISPR/Cas9 系统的重要组成部分,单导向 RNA(sgRNA)在基因重定向和编辑中发挥着重要作用。sgRNA 在改良农作物品种方面发挥了重要作用,但缺乏有效的生物信息学工具来识别农作物中的 sgRNA 活性。因此,有必要开发一种基于机器学习的方法来识别 sgRNA 的高靶标活性。在这项工作中,我们提出了一种简单的卷积神经网络方法来识别 sgRNA 的高靶标活性。我们的研究使用独热编码和 K-mer 进行序列数据转换,并使用投票算法构建卷积神经网络集成模型 sgRNACNN 用于 sgRNA 活性预测。集成模型 sgRNACNN 用于四种作物的预测:大豆、玉米、高粱和小麦。sgRNACNN 模型在四种作物中的准确率分别为 82.43%、80.33%、78.25%和 87.49%。实验结果表明,sgRNACNN 实现了对农业数据中高靶标活性 sgRNA 的识别,在一定程度上能够满足农业中 sgRNA 活性预测的需求。这些结果对于指导作物基因编辑和学术研究具有一定的意义。源代码和相关数据集可在以下链接中找到:https://github.com/nmt315320/sgRNACNN.git。

相似文献

1
sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.sgRNA CNN:使用卷积神经网络集合鉴定四种作物中的 sgRNA 靶标活性。
Plant Mol Biol. 2021 Mar;105(4-5):483-495. doi: 10.1007/s11103-020-01102-y. Epub 2021 Jan 1.
2
CrnnCrispr: An Interpretable Deep Learning Method for CRISPR/Cas9 sgRNA On-Target Activity Prediction.CrnnCrispr:一种用于CRISPR/Cas9 sgRNA靶向活性预测的可解释深度学习方法。
Int J Mol Sci. 2024 Apr 17;25(8):4429. doi: 10.3390/ijms25084429.
3
Prediction of sgRNA on-target activity in bacteria by deep learning.通过深度学习预测细菌中 sgRNA 的靶向活性。
BMC Bioinformatics. 2019 Oct 24;20(1):517. doi: 10.1186/s12859-019-3151-4.
4
Prediction of CRISPR sgRNA Activity Using a Deep Convolutional Neural Network.利用深度卷积神经网络预测 CRISPR sgRNA 活性。
J Chem Inf Model. 2019 Jan 28;59(1):615-624. doi: 10.1021/acs.jcim.8b00368. Epub 2018 Dec 7.
5
CNN-XG: A Hybrid Framework for sgRNA On-Target Prediction.CNN-XG:一种用于 sgRNA 靶标预测的混合框架。
Biomolecules. 2022 Mar 7;12(3):409. doi: 10.3390/biom12030409.
6
[Prediction of CRISPR/Cas9 off-target activity using multi-scale convolutional neural network].[使用多尺度卷积神经网络预测CRISPR/Cas9脱靶活性]
Sheng Wu Gong Cheng Xue Bao. 2024 Mar 25;40(3):858-876. doi: 10.13345/j.cjb.230382.
7
CRISPR-M: Predicting sgRNA off-target effect using a multi-view deep learning network.CRISPR-M:使用多视图深度学习网络预测 sgRNA 脱靶效应。
PLoS Comput Biol. 2024 Mar 14;20(3):e1011972. doi: 10.1371/journal.pcbi.1011972. eCollection 2024 Mar.
8
TransCrispr: Transformer Based Hybrid Model for Predicting CRISPR/Cas9 Single Guide RNA Cleavage Efficiency.TransCrispr:用于预测 CRISPR/Cas9 单指导 RNA 切割效率的基于 Transformer 的混合模型。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1518-1528. doi: 10.1109/TCBB.2022.3201631. Epub 2023 Apr 3.
9
CRISPRpred(SEQ): a sequence-based method for sgRNA on target activity prediction using traditional machine learning.CRISPRpred(SEQ):一种基于序列的 sgRNA 靶活性预测方法,使用传统机器学习。
BMC Bioinformatics. 2020 Jun 1;21(1):223. doi: 10.1186/s12859-020-3531-9.
10
Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature.利用注意力增强深度学习和基于网络的基因特征预测 CRISPR-Cas 系统的脱靶特异性和细胞特异性适应性。
PLoS Comput Biol. 2019 Oct 28;15(10):e1007480. doi: 10.1371/journal.pcbi.1007480. eCollection 2019 Oct.

引用本文的文献

1
Leveraging protein language models for cross-variant CRISPR/Cas9 sgRNA activity prediction.利用蛋白质语言模型进行跨变体CRISPR/Cas9 sgRNA活性预测。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf385.
2
Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.从湿实验室到人工智能的转变:对CRISPR中人工智能预测因子的系统综述
J Transl Med. 2025 Feb 4;23(1):153. doi: 10.1186/s12967-024-06013-w.
3
DeepMEns: an ensemble model for predicting sgRNA on-target activity based on multiple features.

本文引用的文献

1
The Past, Present, and Future of Maize Improvement: Domestication, Genomics, and Functional Genomic Routes toward Crop Enhancement.玉米改良的过去、现在和未来:驯化、基因组学和功能基因组途径促进作物改良。
Plant Commun. 2019 Nov 27;1(1):100010. doi: 10.1016/j.xplc.2019.100010. eCollection 2020 Jan 13.
2
NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data.NOREVA:时间进程和多类代谢组学数据的增强标准化和评估。
Nucleic Acids Res. 2020 Jul 2;48(W1):W436-W448. doi: 10.1093/nar/gkaa258.
3
CirRNAPL: A web server for the identification of circRNA based on extreme learning machine.
DeepMEns:一种基于多种特征预测sgRNA靶向活性的集成模型。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae043.
4
GBMPhos: A Gating Mechanism and Bi-GRU-Based Method for Identifying Phosphorylation Sites of SARS-CoV-2 Infection.GBMPhos:一种基于门控机制和双向门控循环单元的新冠病毒感染磷酸化位点识别方法。
Biology (Basel). 2024 Oct 6;13(10):798. doi: 10.3390/biology13100798.
5
The Evolution of Nucleic Acid-Based Diagnosis Methods from the (pre-)CRISPR to CRISPR era and the Associated Machine/Deep Learning Approaches in Relevant RNA Design.从(前)CRISPR 时代到 CRISPR 时代的核酸诊断方法的演变,以及相关 RNA 设计中的机器/深度学习方法。
Methods Mol Biol. 2025;2847:241-300. doi: 10.1007/978-1-0716-4079-1_17.
6
Optimizing protein sequence classification: integrating deep learning models with Bayesian optimization for enhanced biological analysis.优化蛋白质序列分类:将深度学习模型与贝叶斯优化相结合,以增强生物分析。
BMC Med Inform Decis Mak. 2024 Aug 27;24(1):236. doi: 10.1186/s12911-024-02631-y.
7
Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.深度序列到蛋白预测(Deep-STP):一种基于深度学习的方法,通过词嵌入来预测蛇毒蛋白。
Front Med (Lausanne). 2024 Jan 17;10:1291352. doi: 10.3389/fmed.2023.1291352. eCollection 2023.
8
Deep learning in CRISPR-Cas systems: a review of recent studies.CRISPR-Cas系统中的深度学习:近期研究综述
Front Bioeng Biotechnol. 2023 Jul 3;11:1226182. doi: 10.3389/fbioe.2023.1226182. eCollection 2023.
9
Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods.使用机器学习方法对激素结合蛋白进行计算预测的实证比较与最新进展
Comput Struct Biotechnol J. 2023 Mar 17;21:2253-2261. doi: 10.1016/j.csbj.2023.03.024. eCollection 2023.
10
CRISPR-mediated technology for seed oil improvement in rapeseed: Challenges and future perspectives.用于油菜籽种子油改良的CRISPR介导技术:挑战与未来展望。
Front Plant Sci. 2023 Mar 21;14:1086847. doi: 10.3389/fpls.2023.1086847. eCollection 2023.
CirRNAPL:一个基于极限学习机的环状RNA识别网络服务器。
Comput Struct Biotechnol J. 2020 Apr 2;18:834-842. doi: 10.1016/j.csbj.2020.03.028. eCollection 2020.
4
sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.sgRNA-PSM:基于位置特异性错配预测sgRNA的靶向活性。
Mol Ther Nucleic Acids. 2020 Jun 5;20:323-330. doi: 10.1016/j.omtn.2020.01.029. Epub 2020 Jan 31.
5
C-RNNCrispr: Prediction of CRISPR/Cas9 sgRNA activity using convolutional and recurrent neural networks.C-RNNCrispr:使用卷积神经网络和循环神经网络预测CRISPR/Cas9 sgRNA活性。
Comput Struct Biotechnol J. 2020 Feb 12;18:344-354. doi: 10.1016/j.csbj.2020.01.013. eCollection 2020.
6
High-Throughput CRISPR/Cas9 Mutagenesis Streamlines Trait Gene Identification in Maize.高通量 CRISPR/Cas9 诱变技术简化了玉米性状基因的鉴定。
Plant Cell. 2020 May;32(5):1397-1413. doi: 10.1105/tpc.19.00934. Epub 2020 Feb 25.
7
SSizer: Determining the Sample Sufficiency for Comparative Biological Study.SSizer:确定比较生物学研究的样本充足性。
J Mol Biol. 2020 May 15;432(11):3411-3421. doi: 10.1016/j.jmb.2020.01.027. Epub 2020 Feb 7.
8
iEnhancer-ECNN: identifying enhancers and their strength using ensembles of convolutional neural networks.iEnhancer-ECNN:使用卷积神经网络的集合来识别增强子及其强度。
BMC Genomics. 2019 Dec 24;20(Suppl 9):951. doi: 10.1186/s12864-019-6336-3.
9
Computational and Biological Methods for Gene Therapy.基因治疗的计算与生物学方法
Curr Gene Ther. 2019;19(4):210. doi: 10.2174/156652321904191022113307.
10
Therapeutic target database 2020: enriched resource for facilitating research and early development of targeted therapeutics.治疗靶点数据库 2020 年版:一个丰富的资源,有助于靶向治疗的研究和早期开发。
Nucleic Acids Res. 2020 Jan 8;48(D1):D1031-D1041. doi: 10.1093/nar/gkz981.