Wang Qingbo, Ui-Tei Kumiko
Department of Bioinformatics and Systems Biology, Faculty of Science, The University of Tokyo, Tokyo, Japan.
Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan.
Methods Mol Biol. 2017;1630:43-53. doi: 10.1007/978-1-4939-7128-2_4.
The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated (Cas) system is a prominent genome engineering technology. In the CRISPR/Cas system, the RNA-guided endonuclease Cas protein introduces a DNA double-stranded break at the genome position recognized by a guide RNA (gRNA) based on complementary base-pairing of about 20-nucleotides in length. The 8- or 12-mer gRNA sequence in the proximal region is especially important for target recognition, and the genes with sequence complementarity to such regions are often disrupted. To carry out target site-specific genome editing, we released the CRISPRdirect ( http://crispr.dbcls.jp /) website. This website allows us to select target site-specific gRNA sequences by performing exhaustive searches against entire genomic sequences. In this study, target site-specific gRNA sequences were designed for human, mouse, Drosophila melanogaster, and Caenorhabditis elegans. The calculation results revealed that at least five gRNA sequences, each of them having only one perfectly complementary site in the whole genome, could be designed for more than 95% of genes, regardless of the organism. Next, among those gRNAs, we selected gRNAs that did not have any other complementary site to the unique 12-mer proximal sequences to avoid possible off-target effects. This computational prediction revealed that target site-specific gRNAs are selectable for the majority of genes in D. melanogaster and C. elegans. However, for >50% of genes in humans and mice, there are no target sites without possible off-target effects.
成簇规律间隔短回文重复序列(CRISPR)/CRISPR相关蛋白(Cas)系统是一项卓越的基因组工程技术。在CRISPR/Cas系统中,RNA引导的核酸内切酶Cas蛋白会在基因组位置引入DNA双链断裂,该位置由引导RNA(gRNA)基于长度约为20个核苷酸的互补碱基配对识别。近端区域8或12聚体的gRNA序列对于靶标识别尤为重要,与这些区域具有序列互补性的基因常被破坏。为了进行靶位点特异性基因组编辑,我们发布了CRISPRdirect(http://crispr.dbcls.jp/)网站。该网站使我们能够通过对整个基因组序列进行详尽搜索来选择靶位点特异性gRNA序列。在本研究中,针对人类、小鼠、黑腹果蝇和秀丽隐杆线虫设计了靶位点特异性gRNA序列。计算结果表明,无论生物体如何,超过95%的基因都可以设计出至少五个gRNA序列,每个序列在整个基因组中仅有一个完全互补位点。接下来,在这些gRNA中,我们选择了与独特的12聚体近端序列没有任何其他互补位点的gRNA,以避免可能的脱靶效应。这种计算预测表明,对于黑腹果蝇和秀丽隐杆线虫中的大多数基因,都可以选择靶位点特异性gRNA。然而,对于人类和小鼠中超过50%的基因,不存在没有可能脱靶效应的靶位点。