Division of Genome Science and Cancer and The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, College of Health and Medicine, The Australian National University, Canberra, ACT, Australia.
Commonwealth Scientific and Industrial Research (CSIRO) Health and Biosecurity, Adelaide, SA, Australia.
PLoS One. 2023 Oct 17;18(10):e0292924. doi: 10.1371/journal.pone.0292924. eCollection 2023.
Genome editing through the development of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeat)-Cas technology has revolutionized many fields in biology. Beyond Cas9 nucleases, Cas12a (formerly Cpf1) has emerged as a promising alternative to Cas9 for editing AT-rich genomes. Despite the promises, guide RNA efficiency prediction through computational tools search still lacks accuracy. Through a computational meta-analysis, here we report that Cas12a target and off-target cleavage behavior are a factor of nucleotide bias combined with nucleotide mismatches relative to the protospacer adjacent motif (PAM) site. These features helped to train a Random Forest machine learning model to improve the accuracy by at least 15% over existing algorithms to predict guide RNA efficiency for the Cas12a enzyme. Despite the progresses, our report underscores the need for more representative datasets and further benchmarking to reliably and accurately predict guide RNA efficiency and off-target effects for Cas12a enzymes.
通过开发 CRISPR(成簇规律间隔短回文重复)-Cas 技术进行基因组编辑,已经彻底改变了生物学的许多领域。除了 Cas9 核酸酶外,Cas12a(以前称为 Cpf1)已成为 Cas9 编辑富含 AT 基因组的有前途的替代物。尽管有很多承诺,但通过计算工具搜索预测向导 RNA 效率仍然缺乏准确性。通过计算元分析,我们在这里报告称,Cas12a 的靶标和脱靶切割行为是核苷酸偏置与相对于原间隔基序(PAM)位点的核苷酸错配相结合的因素。这些特征有助于训练随机森林机器学习模型,从而将 Cas12a 酶的向导 RNA 效率预测的准确性提高至少 15%,超过现有的算法。尽管取得了这些进展,但我们的报告强调需要更具代表性的数据集和进一步的基准测试,以可靠和准确地预测 Cas12a 酶的向导 RNA 效率和脱靶效应。