Cambridge Institute CRUK, University of Cambridge Cambridge, UK.
Lymphocyte Development, MRC Clinical Sciences Centre, Imperial College London, UK.
Front Genet. 2014 Apr 10;5:75. doi: 10.3389/fgene.2014.00075. eCollection 2014.
With the advent of ChIP-seq multiplexing technologies and the subsequent increase in ChIP-seq throughput, the development of working standards for the quality assessment of ChIP-seq studies has received significant attention. The ENCODE consortium's large scale analysis of transcription factor binding and epigenetic marks as well as concordant work on ChIP-seq by other laboratories has established a new generation of ChIP-seq quality control measures. The use of these metrics alongside common processing steps has however not been evaluated. In this study, we investigate the effects of blacklisting and removal of duplicated reads on established metrics of ChIP-seq quality and show that the interpretation of these metrics is highly dependent on the ChIP-seq preprocessing steps applied. Further to this we perform the first investigation of the use of these metrics for ChIP-exo data and make recommendations for the adaptation of the NSC statistic to allow for the assessment of ChIP-exo efficiency.
随着 ChIP-seq 多重分析技术的出现以及 ChIP-seq 通量的随后增加,用于评估 ChIP-seq 研究质量的工作标准的开发受到了极大关注。ENCODE 联盟对转录因子结合和表观遗传标记的大规模分析以及其他实验室在 ChIP-seq 方面的一致性工作,确立了新一代的 ChIP-seq 质量控制措施。然而,这些指标的使用以及常见的处理步骤尚未得到评估。在这项研究中,我们调查了黑名单和去除重复读取对 ChIP-seq 质量的既定指标的影响,并表明这些指标的解释高度依赖于应用的 ChIP-seq 预处理步骤。此外,我们首次对这些指标在 ChIP-exo 数据中的使用进行了调查,并为 NSC 统计的适应提出了建议,以允许评估 ChIP-exo 的效率。