• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习方法识别 CRISPR/Cas9 活性预测新特征的重要性。

A Machine Learning Approach to Identify the Importance of Novel Features for CRISPR/Cas9 Activity Prediction.

机构信息

Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India.

Yardi School of Artificial Intelligence, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India.

出版信息

Biomolecules. 2022 Aug 16;12(8):1123. doi: 10.3390/biom12081123.

DOI:10.3390/biom12081123
PMID:36009017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9405635/
Abstract

The reprogrammable CRISPR/Cas9 genome editing tool's growing popularity is hindered by unwanted off-target effects. Efforts have been directed toward designing efficient guide RNAs as well as identifying potential off-target threats, yet factors that determine efficiency and off-target activity remain obscure. Based on sequence features, previous machine learning models performed poorly on new datasets, thus there is a need for the incorporation of novel features. The binding energy estimation of the gRNA-DNA hybrid as well as the Cas9-gRNA-DNA hybrid allowed generating better performing machine learning models for the prediction of Cas9 activity. The analysis of feature contribution towards the model output on a limited dataset indicated that energy features played a determining role along with the sequence features. The binding energy features proved essential for the prediction of on-target activity and off-target sites. The plateau, in the performance on unseen datasets, of current machine learning models could be overcome by incorporating novel features, such as binding energy, among others. The models are provided on GitHub (GitHub Inc., San Francisco, CA, USA).

摘要

可重编程的 CRISPR/Cas9 基因组编辑工具的日益普及受到了脱靶效应的阻碍。人们一直在努力设计高效的向导 RNA,以及识别潜在的脱靶威胁,但决定效率和脱靶活性的因素仍然不清楚。基于序列特征,以前的机器学习模型在新数据集上表现不佳,因此需要结合新的特征。gRNA-DNA 杂交体和 Cas9-gRNA-DNA 杂交体的结合能估计可以生成性能更好的机器学习模型,用于预测 Cas9 的活性。在有限的数据集上对特征对模型输出的贡献进行分析表明,能量特征与序列特征一样起着决定性的作用。结合能特征对于预测靶活性和脱靶位点是必不可少的。通过结合新的特征,如结合能等,可以克服当前机器学习模型在未见数据集上的性能平台期。模型已在 GitHub(美国加利福尼亚州旧金山的 GitHub Inc.)上提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/a4cd2a79426a/biomolecules-12-01123-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/685926299cf6/biomolecules-12-01123-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/80f9819537b7/biomolecules-12-01123-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/71a86416431c/biomolecules-12-01123-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/0b746c768f6a/biomolecules-12-01123-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/c4554f427edb/biomolecules-12-01123-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/a4cd2a79426a/biomolecules-12-01123-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/685926299cf6/biomolecules-12-01123-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/80f9819537b7/biomolecules-12-01123-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/71a86416431c/biomolecules-12-01123-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/0b746c768f6a/biomolecules-12-01123-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/c4554f427edb/biomolecules-12-01123-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c77b/9405635/a4cd2a79426a/biomolecules-12-01123-g006.jpg

相似文献

1
A Machine Learning Approach to Identify the Importance of Novel Features for CRISPR/Cas9 Activity Prediction.机器学习方法识别 CRISPR/Cas9 活性预测新特征的重要性。
Biomolecules. 2022 Aug 16;12(8):1123. doi: 10.3390/biom12081123.
2
DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 functional and endogenous on-target editing efficiency.DeepCRISTL:深度迁移学习预测 CRISPR/Cas9 功能和内源性靶标编辑效率。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i161-i168. doi: 10.1093/bioinformatics/btac218.
3
CRISPR-Cas9 off-targeting assessment with nucleic acid duplex energy parameters.CRISPR-Cas9 脱靶评估与核酸双链能参数。
Genome Biol. 2018 Oct 26;19(1):177. doi: 10.1186/s13059-018-1534-x.
4
DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 on-target editing efficiency in specific cellular contexts.DeepCRISTL:用于在特定细胞环境中预测 CRISPR/Cas9 靶向编辑效率的深度迁移学习。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae481.
5
Machine learning-based prediction models to guide the selection of Cas9 variants for efficient gene editing.基于机器学习的预测模型,用于指导 Cas9 变体的选择,以实现高效的基因编辑。
Cell Rep. 2024 Feb 27;43(2):113765. doi: 10.1016/j.celrep.2024.113765. Epub 2024 Feb 14.
6
CRISPR/Cas9 Guide RNA Design Rules for Predicting Activity.CRISPR/Cas9 引导 RNA 设计规则预测活性。
Methods Mol Biol. 2020;2115:351-364. doi: 10.1007/978-1-0716-0290-4_19.
7
Cas9 Cuts and Consequences; Detecting, Predicting, and Mitigating CRISPR/Cas9 On- and Off-Target Damage: Techniques for Detecting, Predicting, and Mitigating the On- and off-target Effects of Cas9 Editing.Cas9 切割与后果;检测、预测和减轻 CRISPR/Cas9 的脱靶效应:检测、预测和减轻 Cas9 编辑的脱靶效应的技术。
Bioessays. 2020 Sep;42(9):e2000047. doi: 10.1002/bies.202000047. Epub 2020 Jul 9.
8
Hybrid Multitask Learning Reveals Sequence Features Driving Specificity in the CRISPR/Cas9 System.混合多任务学习揭示了 CRISPR/Cas9 系统中特异性驱动序列特征。
Biomolecules. 2023 Apr 3;13(4):641. doi: 10.3390/biom13040641.
9
SeqCor: correct the effect of guide RNA sequences in clustered regularly interspaced short palindromic repeats/Cas9 screening by machine learning algorithm.SeqCor:通过机器学习算法纠正簇状规则间隔短回文重复序列/Cas9 筛选中引导 RNA 序列的影响。
J Genet Genomics. 2020 Nov 20;47(11):672-680. doi: 10.1016/j.jgg.2020.10.007. Epub 2020 Nov 28.
10
An overview and metanalysis of machine and deep learning-based CRISPR gRNA design tools.基于机器和深度学习的 CRISPR gRNA 设计工具的概述和荟萃分析。
RNA Biol. 2020 Jan;17(1):13-22. doi: 10.1080/15476286.2019.1669406. Epub 2019 Sep 27.

引用本文的文献

1
Gene therapy for genetic diseases: challenges and future directions.用于治疗遗传疾病的基因疗法:挑战与未来方向。
MedComm (2020). 2025 Feb 13;6(2):e70091. doi: 10.1002/mco2.70091. eCollection 2025 Feb.
2
Improved Machine Learning-Based Model for the Classification of Off-Targets in the CRISPR/Cpf1 System.基于机器学习的改进模型用于CRISPR/Cpf1系统中脱靶效应的分类
ACS Omega. 2023 Nov 17;8(48):45578-45588. doi: 10.1021/acsomega.3c05691. eCollection 2023 Dec 5.

本文引用的文献

1
Structural basis for mismatch surveillance by CRISPR-Cas9.CRISPR-Cas9 错配监控的结构基础。
Nature. 2022 Mar;603(7900):343-347. doi: 10.1038/s41586-022-04470-1. Epub 2022 Mar 2.
2
Establishing the allosteric mechanism in CRISPR-Cas9.确定CRISPR-Cas9中的变构机制。
Wiley Interdiscip Rev Comput Mol Sci. 2021 May-Jun;11(3). doi: 10.1002/wcms.1503. Epub 2020 Oct 26.
3
Catalytic Mechanism of Non-Target DNA Cleavage in CRISPR-Cas9 Revealed by Molecular Dynamics.分子动力学揭示CRISPR-Cas9中非靶向DNA切割的催化机制
ACS Catal. 2020 Nov 20;10(22):13596-13605. doi: 10.1021/acscatal.0c03566. Epub 2020 Nov 10.
4
From Local Explanations to Global Understanding with Explainable AI for Trees.利用可解释人工智能实现从局部解释到树木的全局理解
Nat Mach Intell. 2020 Jan;2(1):56-67. doi: 10.1038/s42256-019-0138-9. Epub 2020 Jan 17.
5
Evaluation of off-targets predicted by sgRNA design tools.对sgRNA设计工具预测的脱靶效应的评估。
Genomics. 2020 Sep;112(5):3609-3614. doi: 10.1016/j.ygeno.2020.04.024. Epub 2020 Apr 27.
6
Spontaneous Embedding of DNA Mismatches Within the RNA:DNA Hybrid of CRISPR-Cas9.DNA错配在CRISPR-Cas9的RNA:DNA杂交体中的自发嵌入
Front Mol Biosci. 2020 Mar 17;7:39. doi: 10.3389/fmolb.2020.00039. eCollection 2020.
7
Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature.利用注意力增强深度学习和基于网络的基因特征预测 CRISPR-Cas 系统的脱靶特异性和细胞特异性适应性。
PLoS Comput Biol. 2019 Oct 28;15(10):e1007480. doi: 10.1371/journal.pcbi.1007480. eCollection 2019 Oct.
8
An overview and metanalysis of machine and deep learning-based CRISPR gRNA design tools.基于机器和深度学习的 CRISPR gRNA 设计工具的概述和荟萃分析。
RNA Biol. 2020 Jan;17(1):13-22. doi: 10.1080/15476286.2019.1669406. Epub 2019 Sep 27.
9
Exploring alternative catalytic mechanisms of the Cas9 HNH domain.探索 Cas9 HNH 结构域的替代催化机制。
Proteins. 2020 Feb;88(2):260-264. doi: 10.1002/prot.25796. Epub 2019 Sep 6.
10
Prediction of activity and specificity of CRISPR-Cpf1 using convolutional deep learning neural networks.基于卷积深度学习神经网络的 CRISPR-Cpf1 活性和特异性预测。
BMC Bioinformatics. 2019 Jun 13;20(1):332. doi: 10.1186/s12859-019-2939-6.