bcSeq：一个用于高通量 shRNA 和 CRISPR 筛选中快速序列映射的 R 包。

bcSeq: an R package for fast sequence mapping in high-throughput shRNA and CRISPR screens.

机构信息

Biostatistics and Bioinformatics, Duke University Medical Center, Durham, NC, USA.

Duke Cancer Institute, Duke University Medical Center, Durham, NC, USA.

出版信息

Bioinformatics. 2018 Oct 15;34(20):3581-3583. doi: 10.1093/bioinformatics/bty402.

DOI:10.1093/bioinformatics/bty402

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6184561/

Abstract

SUMMARY

CRISPR-Cas9 and shRNA high-throughput sequencing screens have abundant applications for basic and translational research. Methods and tools for the analysis of these screens must properly account for sequencing error, resolve ambiguous mappings among similar sequences in the barcode library in a statistically principled manner, and be computationally efficient. Herein we present bcSeq, an open source R package that implements a fast and parallelized algorithm for mapping high-throughput sequencing reads to a barcode library while tolerating sequencing error. The algorithm uses a Trie data structure for speed and resolves ambiguous mappings by using a statistical sequencing error model based on Phred scores for each read.

AVAILABILITY AND IMPLEMENTATION

The package source code and an accompanying tutorial are available at http://bioconductor.org/packages/bcSeq/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

CRISPR-Cas9 和 shRNA 高通量测序筛选在基础研究和转化研究中有广泛的应用。分析这些筛选的方法和工具必须正确考虑测序错误，以统计上合理的方式解决条形码库中相似序列之间的模糊映射，并具有计算效率。本文介绍了 bcSeq，这是一个开源的 R 包，它实现了一种快速并行的算法，用于将高通量测序读取映射到条形码库，同时容忍测序错误。该算法使用 Trie 数据结构来提高速度，并使用基于每个读取的 Phred 分数的统计测序错误模型来解决模糊映射。

可用性和实现

软件包的源代码和一个附带的教程可在 http://bioconductor.org/packages/bcSeq/ 获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1

bcSeq: an R package for fast sequence mapping in high-throughput shRNA and CRISPR screens.bcSeq：一个用于高通量 shRNA 和 CRISPR 筛选中快速序列映射的 R 包。

Bioinformatics. 2018 Oct 15;34(20):3581-3583. doi: 10.1093/bioinformatics/bty402.

2

Enhancing the throughput and multiplexing capabilities of next generation sequencing for efficient implementation of pooled shRNA and CRISPR screens.提高下一代测序的通量和多重检测能力，以有效地实现汇集 shRNA 和 CRISPR 筛选。

Sci Rep. 2017 Apr 21;7(1):1040. doi: 10.1038/s41598-017-01170-z.

3

Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes.对用于必需基因的CRISPR/Cas9和RNA干扰筛选进行系统比较。

Nat Biotechnol. 2016 Jun;34(6):634-6. doi: 10.1038/nbt.3567. Epub 2016 May 9.

4

ScreenBEAM: a novel meta-analysis algorithm for functional genomics screens via Bayesian hierarchical modeling.ScreenBEAM：一种通过贝叶斯层次模型进行功能基因组筛选的新型荟萃分析算法。

Bioinformatics. 2016 Jan 15;32(2):260-7. doi: 10.1093/bioinformatics/btv556. Epub 2015 Sep 28.

5

Genome-wide functional analysis using the barcode sequence alignment and statistical analysis (Barcas) tool.使用条形码序列比对和统计分析（Barcas）工具进行全基因组功能分析。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):475. doi: 10.1186/s12859-016-1326-9.

6

QuasR: quantification and annotation of short reads in R.QuasR：R语言中短读长的定量与注释

Bioinformatics. 2015 Apr 1;31(7):1130-2. doi: 10.1093/bioinformatics/btu781. Epub 2014 Nov 21.

7

ReCo: automated NGS read-counting of single and combinatorial CRISPR gRNAs.ReCo：用于单靶和组合 CRISPR gRNA 的自动化 NGS 读段计数。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad448.

8

A Comprehensive Protocol Resource for Performing Pooled shRNA and CRISPR Screens.用于进行汇集式短发夹RNA（shRNA）和CRISPR筛选的综合方案资源

Methods Mol Biol. 2018;1725:201-227. doi: 10.1007/978-1-4939-7568-6_17.

9

TrieDedup: a fast trie-based deduplication algorithm to handle ambiguous bases in high-throughput sequencing.TrieDedup：一种基于 Trie 的快速去重算法，用于处理高通量测序中的模糊碱基。

BMC Bioinformatics. 2024 Apr 18;25(1):154. doi: 10.1186/s12859-024-05775-w.

10

Nubeam-dedup: a fast and RAM-efficient tool to de-duplicate sequencing reads without mapping.Nubeam-dedup：一款快速且节省内存的去重工具，无需进行测序读取映射。

Bioinformatics. 2020 May 1;36(10):3254-3256. doi: 10.1093/bioinformatics/btaa112.

引用本文的文献

1

Cooperative regulation of coupled oncoprotein synthesis and stability in triple-negative breast cancer by EGFR and CDK12/13.EGFR 和 CDK12/13 协同调节三阴性乳腺癌中偶联癌蛋白的合成和稳定性。

Proc Natl Acad Sci U S A. 2023 Sep 19;120(38):e2221448120. doi: 10.1073/pnas.2221448120. Epub 2023 Sep 11.

2

CBF-Beta Mitigates PI3K-Alpha-Specific Inhibitor Killing through PIM1 in PIK3CA-Mutant Gastric Cancer.CBF-β 通过 PIM1 减轻 PI3K-α 特异性抑制剂在 PIK3CA 突变型胃癌中的杀伤作用。

Mol Cancer Res. 2023 Nov 1;21(11):1148-1162. doi: 10.1158/1541-7786.MCR-23-0034.

3

ABL allosteric inhibitors synergize with statins to enhance apoptosis of metastatic lung cancer cells.ABL 变构抑制剂与他汀类药物协同作用增强转移性肺癌细胞的凋亡。

Cell Rep. 2021 Oct 26;37(4):109880. doi: 10.1016/j.celrep.2021.109880.

本文引用的文献

1

Genome-wide functional analysis using the barcode sequence alignment and statistical analysis (Barcas) tool.使用条形码序列比对和统计分析（Barcas）工具进行全基因组功能分析。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):475. doi: 10.1186/s12859-016-1326-9.

2

Orchestrating high-throughput genomic analysis with Bioconductor.使用Bioconductor编排高通量基因组分析。

Nat Methods. 2015 Feb;12(2):115-21. doi: 10.1038/nmeth.3252.

3

MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens.MAGeCK能够从全基因组规模的CRISPR/Cas9基因敲除筛选中可靠地鉴定必需基因。

Genome Biol. 2014;15(12):554. doi: 10.1186/s13059-014-0554-4.

4

Genome editing. The new frontier of genome engineering with CRISPR-Cas9.基因组编辑。CRISPR-Cas9 技术引领的基因组工程新前沿。

Science. 2014 Nov 28;346(6213):1258096. doi: 10.1126/science.1258096.

5

edgeR: a versatile tool for the analysis of shRNA-seq and CRISPR-Cas9 genetic screens.edgeR：一种用于分析shRNA测序和CRISPR-Cas9基因筛选的多功能工具。

F1000Res. 2014 Apr 24;3:95. doi: 10.12688/f1000research.3928.2. eCollection 2014.

6

High-throughput RNA interference screening using pooled shRNA libraries and next generation sequencing.高通量 RNA 干扰筛选技术：使用汇集 shRNA 文库和下一代测序。

Genome Biol. 2011 Oct 21;12(10):R104. doi: 10.1186/gb-2011-12-10-r104.

7

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。

Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验