Zhao Lili, Anderson Mark T, Wu Weisheng, T Mobley Harry L, Bachman Michael A
Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, USA.
Department of Microbiology and Immunology, School of medicine, University of Michigan, Ann Arbor, USA.
BMC Bioinformatics. 2017 Jul 6;18(1):326. doi: 10.1186/s12859-017-1745-2.
Tn-Seq is a high throughput technique for analysis of transposon mutant libraries to determine conditional essentiality of a gene under an experimental condition. A special feature of the Tn-seq data is that multiple mutants in a gene provides independent evidence to prioritize that gene as being essential. The existing methods do not account for this feature or rely on a high-density transposon library. Moreover, these methods are unable to accommodate complex designs.
The method proposed here is specifically designed for the analysis of Tn-Seq data. It utilizes two steps to estimate the conditional essentiality for each gene in the genome. First, it collects evidence of conditional essentiality for each insertion by comparing read counts of that insertion between conditions. Second, it combines insertion-level evidence for the corresponding gene. It deals with data from both low- and high-density transposon libraries and accommodates complex designs. Moreover, it is very fast to implement. The performance of the proposed method was tested on simulated data and experimental Tn-Seq data from Serratia marcescens transposon mutant library used to identify genes that contribute to fitness in a murine model of infection.
We describe a new, efficient method for identifying conditionally essential genes in Tn-Seq experiments with high detection sensitivity and specificity. It is implemented as TnseqDiff function in R package Tnseq and can be installed from the Comprehensive R Archive Network, CRAN.
转座子测序(Tn-Seq)是一种用于分析转座子突变文库以确定基因在实验条件下的条件必需性的高通量技术。Tn-Seq数据的一个特殊特征是基因中的多个突变提供了独立的证据,将该基因列为必需基因的优先级。现有方法没有考虑到这一特征,或者依赖于高密度转座子文库。此外,这些方法无法适应复杂的设计。
这里提出的方法是专门为分析Tn-Seq数据而设计的。它利用两个步骤来估计基因组中每个基因的条件必需性。首先,通过比较不同条件下该插入位点的读数计数,收集每个插入位点的条件必需性证据。其次,它结合了相应基因的插入水平证据。它处理来自低密度和高密度转座子文库的数据,并适应复杂的设计。此外,它的实现速度非常快。所提出方法的性能在模拟数据和来自粘质沙雷氏菌转座子突变文库的实验Tn-Seq数据上进行了测试,该文库用于鉴定在小鼠感染模型中对适应性有贡献的基因。
我们描述了一种新的、高效的方法,用于在Tn-Seq实验中鉴定条件必需基因,具有高检测灵敏度和特异性。它在R包Tnseq中作为TnseqDiff函数实现,可以从综合R存档网络(CRAN)安装。