Suppr超能文献

使用条形码序列比对和统计分析(Barcas)工具进行全基因组功能分析。

Genome-wide functional analysis using the barcode sequence alignment and statistical analysis (Barcas) tool.

作者信息

Mun Jihyeob, Kim Dong-Uk, Hoe Kwang-Lae, Kim Seon-Young

机构信息

Korea Research Institute of Bioscience and Biotechnology (KRIBB), Personalized Genomic Medicine Research Center, Daejeon, Republic of Korea.

Department of Functional Genomics, University of Science and Technology, Daejeon, Republic of Korea.

出版信息

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):475. doi: 10.1186/s12859-016-1326-9.

Abstract

BACKGROUND

Pooled library screen analysis using shRNAs or CRISPR-Cas9 hold great promise to genome-wide functional studies. While pooled library screens are effective tools, erroneous barcodes can potentially be generated during the production of many barcodes. However, no current tools can distinguish erroneous barcodes from PCR or sequencing errors in a data preprocessing step.

RESULTS

We developed the Barcas program, a specialized program for the mapping and analysis of multiplexed barcode sequencing (barcode-seq) data. For fast and efficient mapping, Barcas uses a trie data structure based imperfect matching algorithm which generates precise mapping results containing mismatches, shifts, insertions and deletions (indel) in a flexible manner. Barcas provides three functions for quality control (QC) of a barcode library and distinguishes erroneous barcodes from PCR or sequencing errors. It also provides useful functions for data analysis and visualization.

CONCLUSIONS

Barcas is an all-in-one package providing useful functions including mapping, data QC, library QC, statistical analysis and visualization in genome-wide pooled screens.

摘要

背景

使用短发夹RNA(shRNAs)或CRISPR-Cas9进行的混合文库筛选分析在全基因组功能研究中具有巨大潜力。虽然混合文库筛选是有效的工具,但在生成众多条形码的过程中可能会产生错误的条形码。然而,目前没有工具能够在数据预处理步骤中区分由PCR或测序错误导致的错误条形码。

结果

我们开发了Barcas程序,这是一个专门用于多重条形码测序(barcode-seq)数据映射和分析的程序。为了实现快速高效的映射,Barcas使用基于前缀树数据结构的不完全匹配算法,该算法能够灵活地生成包含错配、移位、插入和缺失(indel)的精确映射结果。Barcas为条形码文库的质量控制(QC)提供了三种功能,并能区分错误的条形码与PCR或测序错误。它还为数据分析和可视化提供了有用的功能。

结论

Barcas是一个一体化软件包,在全基因组混合筛选中提供包括映射、数据QC、文库QC、统计分析和可视化等有用功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0212/5260075/f9d0c76cd99d/12859_2016_1326_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验