Baumgarten Sebastian, Bryant Jessica
Plasmodium RNA Biology Group, Pasteur Institute, Paris, Paris, 75015, France.
Biology of Host-Parasite Interactions Unit, Pasteur Institute, Paris, Paris, 75015, France.
Open Res Eur. 2022 Sep 15;2:75. doi: 10.12688/openreseurope.14836.2. eCollection 2022.
The maintenance, regulation, and dynamics of heterochromatin in the human malaria parasite, has drawn increasing attention due to its regulatory role in mutually exclusive virulence gene expression and the silencing of key developmental regulators. The advent of genome-wide analyses such as chromatin-immunoprecipitation followed by sequencing (ChIP-seq) has been instrumental in understanding chromatin composition; however, even in model organisms, ChIP-seq experiments are susceptible to intrinsic experimental biases arising from underlying chromatin structure. We performed a control ChIP-seq experiment, re-analyzed previously published ChIP-seq datasets and compared different analysis approaches to characterize biases of genome-wide analyses in . We found that heterochromatic regions in input control samples used for ChIP-seq normalization are systematically underrepresented in regard to sequencing coverage across the genome. This underrepresentation, in combination with a non-specific or inefficient immunoprecipitation, can lead to the identification of false enrichment and peaks across these regions. We observed that such biases can also be seen at background levels in specific and efficient ChIP-seq experiments. We further report on how different read mapping approaches can also skew sequencing coverage within highly similar subtelomeric regions and virulence gene families. To ameliorate these issues, we discuss orthogonal methods that can be used to characterize chromatin-associated proteins. Our results highlight the impact of chromatin structure on genome-wide analyses in the parasite and the need for caution when characterizing chromatin-associated proteins and features.
人类疟原虫中异染色质的维持、调控及动态变化,因其在互斥毒力基因表达和关键发育调控因子沉默中的调节作用而受到越来越多的关注。诸如染色质免疫沉淀测序(ChIP-seq)等全基因组分析方法的出现,有助于理解染色质组成;然而,即使在模式生物中,ChIP-seq实验也容易受到由潜在染色质结构引起的内在实验偏差的影响。我们进行了一项对照ChIP-seq实验,重新分析了先前发表的ChIP-seq数据集,并比较了不同的分析方法,以表征全基因组分析中的偏差。我们发现,用于ChIP-seq标准化的输入对照样本中的异染色质区域,在全基因组测序覆盖方面系统性地代表性不足。这种代表性不足,再加上非特异性或低效的免疫沉淀,可能导致在这些区域识别出假阳性富集和峰。我们观察到,在特定且高效的ChIP-seq实验的背景水平上也能看到这种偏差。我们还报告了不同的读段映射方法如何在高度相似的亚端粒区域和毒力基因家族中扭曲测序覆盖。为了改善这些问题,我们讨论了可用于表征染色质相关蛋白的正交方法。我们的结果突出了染色质结构对疟原虫全基因组分析的影响,以及在表征染色质相关蛋白和特征时需要谨慎行事。