Suppr超能文献

SgRNA-RF:利用不平衡数据集识别 sgRNA 的靶标活性。

SgRNA-RF: Identification of SgRNA On-Target Activity With Imbalanced Datasets.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2442-2453. doi: 10.1109/TCBB.2021.3079116. Epub 2022 Aug 8.

Abstract

Single-guide RNA is a guide RNA (gRNA), which guides the insertion or deletion of uridine residues into kinetoplastid during RNA editing. It is a small non-coding RNA that can be combined with pre -mRNA pairing. SgRNA is a critical component of the CRISPR/Cas9 gene knockout system and play an important role in gene editing and gene regulation. It is important to accurately and quickly identify highly on-target activity sgRNAs. Due to its importance, several computational predictors have been proposed to predict sgRNAs on-target activity. All these methods have clearly contributed to the development of this very important field. However, they also have certain limitations. In the paper, we developed a new classifier SgRNA-RF, which extracts the features of nucleic acid composition and structure of on-target activity sgRNA sequence and identified by random forest algorithm. In addition to solving an imbalanced dataset, this paper proposed a new method called CS-Smote. We compared sgRNA-RF with state-of-the-art predictors on the five datasets, and found SgRNA-RF significantly improved the identification accuracy, with accuracies of 0.8636,0.9161,0.894,0.938,0.965,0.77,0.979,0.973, respectively. The user-friendly web server that implements sgRNA-RF is freely available at http://server.malab.cn/sgRNA-RF/.

摘要

单指导 RNA 是一种指导 RNA(gRNA),它在 RNA 编辑过程中指导尿嘧啶残基插入或缺失动基体。它是一种小的非编码 RNA,可以与前 mRNA 配对结合。sgRNA 是 CRISPR/Cas9 基因敲除系统的关键组成部分,在基因编辑和基因调控中发挥重要作用。准确快速地识别高靶活性 sgRNA 非常重要。由于其重要性,已经提出了几种计算预测器来预测 sgRNA 的靶活性。所有这些方法都明显促进了这个非常重要领域的发展。然而,它们也有一定的局限性。在本文中,我们开发了一种新的分类器 SgRNA-RF,它提取了靶活性 sgRNA 序列的核酸组成和结构特征,并通过随机森林算法进行识别。除了解决不平衡数据集的问题外,本文还提出了一种称为 CS-Smote 的新方法。我们将 sgRNA-RF 与五种数据集上的最新预测器进行了比较,发现 SgRNA-RF 显著提高了识别精度,分别达到了 0.8636、0.9161、0.894、0.938、0.965、0.77、0.979、0.973。实现 sgRNA-RF 的用户友好型网络服务器可在 http://server.malab.cn/sgRNA-RF/ 免费获得。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验