Hwang Gue-Ho, Yu Jihyeon, Yang Soyeon, Son Woo Jae, Lim Kayeong, Kim Heon Seok, Kim Jin-Soo, Bae Sangsu
Department of Chemistry, Hanyang University, Seoul 04763, South Korea.
Research Institute for Convergence of Basic Sciences, Hanyang University, Seoul 04763, South Korea.
Comput Struct Biotechnol J. 2020 Jun 25;18:1686-1694. doi: 10.1016/j.csbj.2020.06.026. eCollection 2020.
CRISPR-Cas9 induces DNA cleavages at desired target sites in a guide RNA-dependent manner; DNA editing occurs through the resulting activity of DNA repair processes including non-homologous end joining (NHEJ), which is dominant in mammalian cells. NHEJ repair frequently causes small insertions and deletions (indels) near DNA cleavage sites but only rarely causes nucleotide substitutions. High-throughput sequencing is the primary means of assessing indel and substitution frequencies in bulk populations of cells in the gene editing field. However, it is difficult to detect bona fide substitutions, which are embedded among experimentally-induced substitution errors, in high-throughput sequencing data. Here, we developed a novel analysis method, named CRISPR-Sub, to statistically detect Cas9-mediated substitutions in high-throughput sequencing data by comparing Mock- and CRISPR-treated samples. We first pinpointed 'hotspot positions' in target sequences at which substitution mutations were quantitatively observed much more often (p > 0.001) in CRISPR- versus Mock-treated samples. We refer to the substitution mutations in defined hotspot positions as 'apparent substitutions' and ultimately calculated 'apparent substitution frequencies' for each target. By examining 51 endogenous target sites in HeLa cells, we found that the average apparent substitution frequency was 0.8% in all queries, that apparent substitutions frequently occur near CRISPR-Cas9 cleavage sites, and that nucleotide conversion showed no meaningful nucleotide preference patterns. Furthermore, we generated NHEJ-inhibited cell lines ( ) by knockout of the gene encoding ligase IV and found that the apparent substitution frequencies were significantly decreased in cells, strongly suggesting that DNA substitutions are generated by the NHEJ pathway.
CRISPR-Cas9以一种依赖向导RNA的方式在所需靶位点诱导DNA切割;DNA编辑通过包括非同源末端连接(NHEJ)在内的DNA修复过程的活性产生,NHEJ在哺乳动物细胞中占主导地位。NHEJ修复经常在DNA切割位点附近导致小的插入和缺失(indel),但很少导致核苷酸替换。高通量测序是评估基因编辑领域大量细胞群体中indel和替换频率的主要手段。然而,在高通量测序数据中很难检测到真正的替换,这些替换夹杂在实验诱导的替换错误中。在这里,我们开发了一种名为CRISPR-Sub的新型分析方法,通过比较Mock处理和CRISPR处理的样本,在高通量测序数据中统计检测Cas9介导的替换。我们首先在靶序列中确定“热点位置”,在这些位置,与Mock处理的样本相比,CRISPR处理的样本中定量观察到的替换突变要频繁得多(p>0.001)。我们将定义热点位置中的替换突变称为“明显替换”,并最终计算每个靶标的“明显替换频率”。通过检查HeLa细胞中的51个内源性靶位点,我们发现所有查询中的平均明显替换频率为0.8%,明显替换经常发生在CRISPR-Cas9切割位点附近,并且核苷酸转换没有显示出有意义的核苷酸偏好模式。此外,我们通过敲除编码连接酶IV的基因生成了NHEJ抑制细胞系( ),并发现 细胞中的明显替换频率显著降低,这强烈表明DNA替换是由NHEJ途径产生的。