• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

sgRNA-PSM:基于位置特异性错配预测sgRNA的靶向活性。

sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.

作者信息

Liu Bin, Luo Zhihua, He Juan

机构信息

School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China; Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing, China.

Affiliated Shenzhen Maternity & Child Healthcare Hospital, Southern Medical University, Shenzhen, Guangdong, China.

出版信息

Mol Ther Nucleic Acids. 2020 Jun 5;20:323-330. doi: 10.1016/j.omtn.2020.01.029. Epub 2020 Jan 31.

DOI:10.1016/j.omtn.2020.01.029
PMID:32199128
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7083770/
Abstract

As a key technique for the CRISPR-Cas9 system, identification of single-guide RNAs (sgRNAs) on-target activity is critical for both theoretical research (investigation of RNA functions) and real-world applications (genome editing and synthetic biology). Because of its importance, several computational predictors have been proposed to predict sgRNAs on-target activity. All of these methods have clearly contributed to the developments of this very important field. However, they are suffering from certain limitations. We proposed two new methods called "sgRNA-PSM" and "sgRNA-ExPSM" for sgRNAs on-target activity prediction via capturing the long-range sequence information and evolutionary information using a new way to reduce the dimension of the feature vector to avoid the risk of overfitting. Rigorous leave-one-gene-out cross-validation on a benchmark dataset with 11 human genes and 6 mouse genes, as well as an independent dataset, indicated that the two new methods outperformed other competing methods. To make it easier for users to use the proposed sgRNA-PSM predictor, we have established a corresponding web server, which is available at http://bliulab.net/sgRNA-PSM/.

摘要

作为CRISPR-Cas9系统的一项关键技术,单导向RNA(sgRNA)靶向活性的鉴定对于理论研究(RNA功能研究)和实际应用(基因组编辑与合成生物学)都至关重要。鉴于其重要性,人们已经提出了几种计算预测方法来预测sgRNA的靶向活性。所有这些方法都为这个非常重要的领域的发展做出了明确贡献。然而,它们存在一定的局限性。我们提出了两种名为“sgRNA-PSM”和“sgRNA-ExPSM”的新方法,通过以一种新的方式捕捉长程序列信息和进化信息来减少特征向量的维度,从而避免过拟合风险,以预测sgRNA的靶向活性。在一个包含11个人类基因和6个小鼠基因的基准数据集以及一个独立数据集上进行的严格留一基因交叉验证表明,这两种新方法优于其他竞争方法。为了方便用户使用所提出的sgRNA-PSM预测器,我们建立了一个相应的网络服务器,可在http://bliulab.net/sgRNA-PSM/上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/363eb2681ef8/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/c27a0ce24245/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/e80ae9a7891d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/2fa764d22d8a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/363eb2681ef8/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/c27a0ce24245/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/e80ae9a7891d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/2fa764d22d8a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf8b/7083770/363eb2681ef8/gr4.jpg

相似文献

1
sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.sgRNA-PSM:基于位置特异性错配预测sgRNA的靶向活性。
Mol Ther Nucleic Acids. 2020 Jun 5;20:323-330. doi: 10.1016/j.omtn.2020.01.029. Epub 2020 Jan 31.
2
sgRNA-2wPSM: Identify sgRNAs on-target activity by combining two-window-based position specific mismatch and synthetic minority oversampling technique.sgRNA-2wPSM:通过结合基于双窗口的位置特异性错配和合成少数类过采样技术来识别sgRNA的靶向活性。
Comput Biol Med. 2023 Mar;155:106489. doi: 10.1016/j.compbiomed.2022.106489. Epub 2022 Dec 29.
3
SgRNA-RF: Identification of SgRNA On-Target Activity With Imbalanced Datasets.SgRNA-RF:利用不平衡数据集识别 sgRNA 的靶标活性。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2442-2453. doi: 10.1109/TCBB.2021.3079116. Epub 2022 Aug 8.
4
Selection of highly efficient sgRNAs for CRISPR/Cas9-based plant genome editing.用于基于CRISPR/Cas9的植物基因组编辑的高效sgRNA的筛选
Sci Rep. 2016 Feb 19;6:21451. doi: 10.1038/srep21451.
5
ExsgRNA: reduce off-target efficiency by on-target mismatched sgRNA.ExsgRNA:通过与靶标错配的 sgRNA 降低脱靶效率。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac183.
6
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm.通过一种高效的计算算法对 CRISPR Cas9 sgRNA 脱靶同源物进行全基因组分析。
BMC Genomics. 2017 Nov 17;18(Suppl 9):826. doi: 10.1186/s12864-017-4225-1.
7
CRISPy-web: An online resource to design sgRNAs for CRISPR applications.CRISPy-web:一个用于设计CRISPR应用中sgRNA的在线资源。
Synth Syst Biotechnol. 2016 Feb 12;1(2):118-121. doi: 10.1016/j.synbio.2016.01.003. eCollection 2016 Jun.
8
CRISPRlnc: a machine learning method for lncRNA-specific single-guide RNA design of CRISPR/Cas9 system.CRISPRlnc:一种用于 CRISPR/Cas9 系统的长非编码 RNA 特异性单指导 RNA 设计的机器学习方法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae066.
9
Improved sgRNA design in bacteria via genome-wide activity profiling.通过全基因组活性分析提高细菌中的 sgRNA 设计。
Nucleic Acids Res. 2018 Aug 21;46(14):7052-7069. doi: 10.1093/nar/gky572.
10
Prediction of CRISPR sgRNA Activity Using a Deep Convolutional Neural Network.利用深度卷积神经网络预测 CRISPR sgRNA 活性。
J Chem Inf Model. 2019 Jan 28;59(1):615-624. doi: 10.1021/acs.jcim.8b00368. Epub 2018 Dec 7.

引用本文的文献

1
A systematic mapping study on machine learning techniques for the prediction of CRISPR/Cas9 sgRNA target cleavage.一项关于用于预测CRISPR/Cas9 sgRNA靶点切割的机器学习技术的系统映射研究。
Comput Struct Biotechnol J. 2022 Oct 21;20:5813-5823. doi: 10.1016/j.csbj.2022.10.013. eCollection 2022.
2
Designing libraries for pooled CRISPR functional screens of long noncoding RNAs.设计用于长非编码 RNA pooled CRISPR 功能筛选的文库。
Mamm Genome. 2022 Jun;33(2):312-327. doi: 10.1007/s00335-021-09918-9. Epub 2021 Sep 17.
3
Accurate identification of RNA D modification using multiple features.

本文引用的文献

1
Fold-LTR-TCP: protein fold recognition based on triadic closure principle.Fold-LTR-TCP:基于三元闭合原理的蛋白质折叠识别。
Brief Bioinform. 2020 Dec 1;21(6):2185-2193. doi: 10.1093/bib/bbz139.
2
MotifCNN-fold: protein fold recognition based on fold-specific features extracted by motif-based convolutional neural networks.MotifCNN-fold:基于基于模体的卷积神经网络提取的折叠特异特征的蛋白质折叠识别。
Brief Bioinform. 2020 Dec 1;21(6):2133-2141. doi: 10.1093/bib/bbz133.
3
A novel molecular representation with BiGRU neural networks for learning atom.
使用多种特征准确识别 RNA D 修饰。
RNA Biol. 2021 Dec;18(12):2236-2246. doi: 10.1080/15476286.2021.1898160. Epub 2021 Mar 17.
4
sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.sgRNA CNN:使用卷积神经网络集合鉴定四种作物中的 sgRNA 靶标活性。
Plant Mol Biol. 2021 Mar;105(4-5):483-495. doi: 10.1007/s11103-020-01102-y. Epub 2021 Jan 1.
5
An Interpretable Prediction Model for Identifying N-Methylguanosine Sites Based on XGBoost and SHAP.一种基于XGBoost和SHAP的用于识别N-甲基鸟苷位点的可解释预测模型。
Mol Ther Nucleic Acids. 2020 Aug 25;22:362-372. doi: 10.1016/j.omtn.2020.08.022. eCollection 2020 Dec 4.
6
Prediction of G Protein-Coupled Receptors With CTDC Extraction and MRMD2.0 Dimension-Reduction Methods.基于CTDC提取和MRMD2.0降维方法的G蛋白偶联受体预测
Front Bioeng Biotechnol. 2020 Jun 25;8:635. doi: 10.3389/fbioe.2020.00635. eCollection 2020.
用于学习原子的 BiGRU 神经网络的新型分子表示。
Brief Bioinform. 2020 Dec 1;21(6):2099-2111. doi: 10.1093/bib/bbz125.
4
DeepSVM-fold: protein fold recognition by combining support vector machines and pairwise sequence similarity scores generated by deep learning networks.DeepSVM-fold:通过结合支持向量机和深度学习网络生成的成对序列相似性得分来进行蛋白质折叠识别。
Brief Bioinform. 2020 Sep 25;21(5):1733-1741. doi: 10.1093/bib/bbz098.
5
Identifying enhancer-promoter interactions with neural network based on pre-trained DNA vectors and attention mechanism.基于预训练 DNA 向量和注意力机制的神经网络识别增强子-启动子相互作用。
Bioinformatics. 2020 Feb 15;36(4):1037-1043. doi: 10.1093/bioinformatics/btz694.
6
A Consensus Community-Based Particle Swarm Optimization for Dynamic Community Detection.基于共识的粒子群优化的动态社区发现算法。
IEEE Trans Cybern. 2020 Jun;50(6):2502-2513. doi: 10.1109/TCYB.2019.2938895. Epub 2019 Sep 23.
7
iPromoter-2L2.0: Identifying Promoters and Their Types by Combining Smoothing Cutting Window Algorithm and Sequence-Based Features.iPromoter-2L2.0:结合平滑切割窗口算法和基于序列的特征识别启动子及其类型
Mol Ther Nucleic Acids. 2019 Dec 6;18:80-87. doi: 10.1016/j.omtn.2019.08.008. Epub 2019 Aug 14.
8
BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.BioSeq-Analysis2.0:一个基于机器学习方法的更新平台,用于在序列水平和残基水平上分析 DNA、RNA 和蛋白质序列。
Nucleic Acids Res. 2019 Nov 18;47(20):e127. doi: 10.1093/nar/gkz740.
9
Deep Collaborative Filtering for Prediction of Disease Genes.深度协同过滤在疾病基因预测中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Sep-Oct;17(5):1639-1647. doi: 10.1109/TCBB.2019.2907536. Epub 2019 Mar 26.
10
Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures.通过前体miRNA序列和结构的互信息改进前体miRNA识别
Front Genet. 2019 Feb 25;10:119. doi: 10.3389/fgene.2019.00119. eCollection 2019.