Cao Yuwei, Patel Lauren, Alcoser Lauren, Mendenhall Eric, Benner Christopher, Heinz Sven, Goren Alon
Department of Medicine, Division of Genomics & Precision Medicine, University of California San Diego, La Jolla, CA.
Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA.
bioRxiv. 2025 Aug 21:2025.08.14.670415. doi: 10.1101/2025.08.14.670415.
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a well-established method for studying the genomic localization of DNA-associated proteins. Yet, while useful, most ChIP-seq protocols include multiple manual steps that can introduce inconsistency and make it burdensome to analyze large sample sets, limiting the inclusion of appropriate replicates and other controls. Although some of these challenges were addressed by incorporation of liquid handling platforms, most of those previous efforts have automated only a subset of the ChIP-seq steps. Further, using automation for efficiently mapping non-histone proteins, such as chromatin regulators, has been challenging. We recently developed a single-pot ChIP-seq protocol. Here we established a liquid handler operation for this protocol and created an end-to-end fully automated version that is scalable from 8 to 96 ChIP-seq reactions. Our single-pot automated (spa-ChIP-seq) protocol requires about 3 days from cross-linked cells to library, and the costs to produce indexed libraries is approximately $70 per sample. We first benchmarked spa-ChIP-seq against manual ChIP-seq that was performed in parallel, showing a nearly indistinguishable signal-to-noise ratio between the two workflows. Next, we used spa-ChIP-seq to systematically evaluate multiple parameters including shearing and crosslinking conditions, buffer compositions, and antibody to cell-number ratios. Our method allowed us to identify optimal conditions for double crosslinked chromatin as well as easily and robustly conduct titrations and screening of different antibodies and reagents, eliminating many laborious and costly steps. We show, for the first time to our knowledge, that the effect of the ratio of antibody to cell-number is most pronounced in detecting weak genomic localization signals. In particular, while strong signal seems to be unimpacted by low antibody to cell-number ratio, weaker signals are sensitive to this ratio. We note the importance of maintaining a consistent antibody to cell-number ratio, especially when conducting comparative studies, e.g., between treatments such as small molecules, or individuals such as for chromatin-QTL mapping. Our automated ChIP-seq protocol is publicly available, including specific deck setups, software files and parameters. Lastly, we envision that our robust, cost-efficient protocol can advance research via multiple fronts, e.g., by (i) allowing the scaling up the number of replicates and conditions tested, (ii) improving quantification precision when using spike-in normalization in ChIP-seq experiments, (iii) enabling core facilities to provide high-throughput ChIP-seq as a service and (iv) being incorporated into antibody evaluation procedures, compound screening, population genomics and diagnostic frameworks.
染色质免疫沉淀测序(ChIP-seq)是一种成熟的研究与DNA相关蛋白基因组定位的方法。然而,尽管ChIP-seq很有用,但大多数ChIP-seq方案包含多个手动步骤,这些步骤可能会引入不一致性,并且分析大量样本集时会很繁琐,限制了适当重复样本和其他对照的纳入。尽管通过引入液体处理平台解决了其中一些挑战,但之前的大多数努力仅自动化了ChIP-seq步骤的一个子集。此外,使用自动化有效地绘制非组蛋白(如染色质调节因子)的图谱一直具有挑战性。我们最近开发了一种单管ChIP-seq方案。在此,我们为该方案建立了液体处理操作,并创建了一个端到端的全自动版本,可扩展至8至96个ChIP-seq反应。我们的单管自动化(spa-ChIP-seq)方案从交联细胞到文库大约需要3天,生成索引文库的成本约为每个样本70美元。我们首先将spa-ChIP-seq与并行进行的手动ChIP-seq进行了基准测试,结果表明两种工作流程之间的信噪比几乎没有差别。接下来,我们使用spa-ChIP-seq系统地评估了多个参数,包括剪切和交联条件、缓冲液组成以及抗体与细胞数量的比例。我们的方法使我们能够确定双交联染色质的最佳条件,并轻松且稳健地进行不同抗体和试剂的滴定和筛选,省去了许多费力且昂贵的步骤。据我们所知,我们首次表明抗体与细胞数量的比例在检测弱基因组定位信号时影响最为显著。特别是,虽然强信号似乎不受低抗体与细胞数量比例影响,但较弱信号对此比例敏感。我们指出保持一致的抗体与细胞数量比例的重要性,尤其是在进行比较研究时,例如在小分子等处理之间,或在染色质数量性状位点(chromatin-QTL)图谱绘制等个体之间。我们的自动化ChIP-seq方案已公开可用,包括特定的平台设置、软件文件和参数。最后,我们设想我们强大且经济高效的方案可以通过多个方面推动研究,例如:(i)允许扩大测试的重复样本数量和条件;(ii)在ChIP-seq实验中使用掺入归一化时提高定量精度;(iii)使核心设施能够提供高通量ChIP-seq服务;(iv)被纳入抗体评估程序、化合物筛选、群体基因组学和诊断框架。