Alipanahi Roghayyeh, Safari Leila, Khanteymoori Alireza
Department of Computer Engineering, University of Zanjan, Zanjan, Iran.
Department of Psychology, University of Freiburg, Freiburg, Germany.
Mol Ther Nucleic Acids. 2024 Oct 28;35(4):102370. doi: 10.1016/j.omtn.2024.102370. eCollection 2024 Dec 10.
Prime editors are CRISPR-based genome engineering tools with significant potential for rectifying patient mutations. However, their usage requires experimental optimization of the prime editing guide RNA (PegRNA) to achieve high editing efficiency. This paper introduces the deep transformer-based model for predicting prime editing efficiency (DTMP-Prime), a tool specifically designed to predict PegRNA activity and prime editing (PE) efficiency. DTMP-Prime facilitates the design of appropriate PegRNA and ngRNA. A transformer-based model was constructed to scrutinize a wide-ranging set of PE data, enabling the extraction of effective features of PegRNAs and target DNA sequences. The integration of these features with the proposed encoding strategy and DNABERT-based embedding has notably improved the predictive capabilities of DTMP-Prime for off-target sites. Moreover, DTMP-Prime is a promising tool for precisely predicting off-target sites in CRISPR experiments. The integration of a multi-head attention framework has additionally improved the precision and generalizability of DTMP-Prime across various PE models and cell lines. Evaluation results based on the Pearson and Spearman correlation coefficient demonstrate that DTMP-Prime outperforms other state-of-the-art models in predicting the efficiency and outcomes of PE experiments.
引导编辑是基于CRISPR的基因组工程工具,在纠正患者突变方面具有巨大潜力。然而,其应用需要对引导编辑向导RNA(PegRNA)进行实验优化,以实现高编辑效率。本文介绍了基于深度变压器的引导编辑效率预测模型(DTMP-Prime),这是一种专门设计用于预测PegRNA活性和引导编辑(PE)效率的工具。DTMP-Prime有助于设计合适的PegRNA和ngRNA。构建了一个基于变压器的模型来仔细研究大量的PE数据,从而能够提取PegRNA和目标DNA序列的有效特征。这些特征与所提出的编码策略以及基于DNABERT的嵌入相结合,显著提高了DTMP-Prime对脱靶位点的预测能力。此外,DTMP-Prime是在CRISPR实验中精确预测脱靶位点的一个有前景的工具。多头注意力框架的整合进一步提高了DTMP-Prime在各种PE模型和细胞系中的精度和通用性。基于皮尔逊和斯皮尔曼相关系数的评估结果表明,DTMP-Prime在预测PE实验的效率和结果方面优于其他最先进的模型。