• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChIP-seq 实验中功效计算的统计框架。

A statistical framework for power calculations in ChIP-seq experiments.

机构信息

Department of Statistics, and Department of Biostatistics and Medical Informatics, 1300 University Avenue, Madison, WI 53706, USA.

出版信息

Bioinformatics. 2014 Mar 15;30(6):753-60. doi: 10.1093/bioinformatics/btt200. Epub 2013 May 10.

DOI:10.1093/bioinformatics/btt200
PMID:23665773
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3957067/
Abstract

MOTIVATION

ChIP-seq technology enables investigators to study genome-wide binding of transcription factors and mapping of epigenomic marks. Although the availability of basic analysis tools for ChIP-seq data is rapidly increasing, there has not been much progress on the related design issues. A challenging question for designing a ChIP-seq experiment is how deeply should the ChIP and the control samples be sequenced? The answer depends on multiple factors some of which can be set by the experimenter based on pilot/preliminary data. The sequencing depth of a ChIP-seq experiment is one of the key factors that determine whether all the underlying targets (e.g. binding locations or epigenomic profiles) can be identified with a targeted power.

RESULTS

We developed a statistical framework named CSSP (ChIP-seq Statistical Power) for power calculations in ChIP-seq experiments by considering a local Poisson model, which is commonly adopted by many peak callers. Evaluations with simulations and data-driven computational experiments demonstrate that this framework can reliably estimate the power of a ChIP-seq experiment at different sequencing depths based on pilot data. Furthermore, it provides an analytical approach for calculating the required depth for a targeted power while controlling the false discovery rate at a user-specified level. Hence, our results enable researchers to use their own or publicly available data for determining required sequencing depths of their ChIP-seq experiments and potentially make better use of the multiplexing functionality of the sequencers. Evaluation of power for multiple public ChIP-seq datasets indicate that, currently, typical ChIP-seq studies are powered well for detecting large fold changes of ChIP enrichment over the control sample, but they have considerably less power for detecting smaller fold changes.

AVAILABILITY

Available at www.stat.wisc.edu/~zuo/CSSP.

CONTACT

keles@stat.wisc.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

ChIP-seq 技术使研究人员能够研究转录因子的全基因组结合和表观基因组标记的作图。尽管 ChIP-seq 数据的基本分析工具的可用性正在迅速增加,但在相关设计问题上并没有太大进展。设计 ChIP-seq 实验的一个具有挑战性的问题是 ChIP 和对照样本应该测序多深?答案取决于多个因素,其中一些因素可以根据实验者的初步数据进行设置。ChIP-seq 实验的测序深度是决定所有潜在目标(例如结合位置或表观基因组图谱)是否可以用靶向功率识别的关键因素之一。

结果

我们开发了一个名为 CSSP(ChIP-seq 统计功效)的统计框架,用于通过考虑常用的许多峰调用器采用的局部泊松模型来计算 ChIP-seq 实验的功效。模拟和数据驱动的计算实验评估表明,该框架可以根据初步数据可靠地估计不同测序深度的 ChIP-seq 实验的功效。此外,它提供了一种分析方法,用于在控制用户指定水平的假发现率的同时,计算靶向功效所需的深度。因此,我们的结果使研究人员能够使用自己或公开可用的数据来确定其 ChIP-seq 实验的所需测序深度,并有可能更好地利用测序仪的多路复用功能。对多个公共 ChIP-seq 数据集的功效评估表明,目前,典型的 ChIP-seq 研究在检测 ChIP 富集相对于对照样本的大倍数变化方面具有很好的功效,但在检测较小倍数变化方面的功效则要差得多。

可用性

可在 www.stat.wisc.edu/~zuo/CSSP 上获得。

联系方式

keles@stat.wisc.edu

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
A statistical framework for power calculations in ChIP-seq experiments.ChIP-seq 实验中功效计算的统计框架。
Bioinformatics. 2014 Mar 15;30(6):753-60. doi: 10.1093/bioinformatics/btt200. Epub 2013 May 10.
2
CNV-guided multi-read allocation for ChIP-seq.基于 CNV 的 ChIP-seq 多读取分配
Bioinformatics. 2014 Oct 15;30(20):2860-7. doi: 10.1093/bioinformatics/btu402. Epub 2014 Jun 24.
3
Data exploration, quality control and statistical analysis of ChIP-exo/nexus experiments.ChIP-exo/nexus实验的数据探索、质量控制与统计分析。
Nucleic Acids Res. 2017 Sep 6;45(15):e145. doi: 10.1093/nar/gkx594.
4
A MAD-Bayes Algorithm for State-Space Inference and Clustering with Application to Querying Large Collections of ChIP-Seq Data Sets.一种用于状态空间推理和聚类的MAD-贝叶斯算法及其在查询大量ChIP-Seq数据集方面的应用
J Comput Biol. 2017 Jun;24(6):472-485. doi: 10.1089/cmb.2016.0138. Epub 2016 Nov 11.
5
Collaborative Completion of Transcription Factor Binding Profiles via Local Sensitive Unified Embedding.通过局部敏感统一嵌入实现转录因子结合谱的协同完成
IEEE Trans Nanobioscience. 2016 Dec;15(8):946-958. doi: 10.1109/TNB.2016.2625823. Epub 2016 Nov 7.
6
ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline.麒麟:一个全面的染色质免疫沉淀测序(ChIP-seq)和DNA酶超敏感位点测序(DNase-seq)质量控制与分析流程。
BMC Bioinformatics. 2016 Oct 3;17(1):404. doi: 10.1186/s12859-016-1274-4.
7
A novel statistical method for quantitative comparison of multiple ChIP-seq datasets.一种用于多个ChIP-seq数据集定量比较的新型统计方法。
Bioinformatics. 2015 Jun 15;31(12):1889-96. doi: 10.1093/bioinformatics/btv094. Epub 2015 Feb 13.
8
dPeak: high resolution identification of transcription factor binding sites from PET and SET ChIP-Seq data.dPeak:从 PET 和 SET ChIP-Seq 数据中高分辨率识别转录因子结合位点。
PLoS Comput Biol. 2013;9(10):e1003246. doi: 10.1371/journal.pcbi.1003246. Epub 2013 Oct 17.
9
HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data.HiChIP:一种用于 ChIP-Seq 数据综合分析的高通量管道。
BMC Bioinformatics. 2014 Aug 15;15(1):280. doi: 10.1186/1471-2105-15-280.
10
An integrated ChIP-seq analysis platform with customizable workflows.一个具有可定制工作流程的集成 ChIP-seq 分析平台。
BMC Bioinformatics. 2011 Jul 7;12:277. doi: 10.1186/1471-2105-12-277.

引用本文的文献

1
Guiding the design of well-powered Hi-C experiments to detect differential loops.指导高动力Hi-C实验的设计以检测差异环。
Bioinform Adv. 2023 Oct 16;3(1):vbad152. doi: 10.1093/bioadv/vbad152. eCollection 2023.
2
Guiding the design of well-powered Hi-C experiments to detect differential loops.指导高动力Hi-C实验的设计以检测差异环。
bioRxiv. 2023 Mar 16:2023.03.15.532762. doi: 10.1101/2023.03.15.532762.
3
A Hierarchical Framework for State-Space Matrix Inference and Clustering.用于状态空间矩阵推理与聚类的分层框架
Ann Appl Stat. 2016 Sep;10(3):1348-1372. doi: 10.1214/16-AOAS938. Epub 2016 Sep 28.
4
Power and sample size calculations for high-throughput sequencing-based experiments.基于高通量测序的实验的功效和样本量计算。
Brief Bioinform. 2018 Nov 27;19(6):1247-1255. doi: 10.1093/bib/bbx061.
5
Sequencing on the SOLiD 5500xl System - in-depth characterization of the GC bias.SOLiD 5500xl 系统测序 - GC 偏倚的深入特征分析。
Nucleus. 2017 Jul 4;8(4):370-380. doi: 10.1080/19491034.2017.1320461. Epub 2017 Apr 27.
6
A MAD-Bayes Algorithm for State-Space Inference and Clustering with Application to Querying Large Collections of ChIP-Seq Data Sets.一种用于状态空间推理和聚类的MAD-贝叶斯算法及其在查询大量ChIP-Seq数据集方面的应用
J Comput Biol. 2017 Jun;24(6):472-485. doi: 10.1089/cmb.2016.0138. Epub 2016 Nov 11.
7
Recent advances in ChIP-seq analysis: from quality management to whole-genome annotation.染色质免疫沉淀测序(ChIP-seq)分析的最新进展:从质量管理到全基因组注释
Brief Bioinform. 2017 Mar 1;18(2):279-290. doi: 10.1093/bib/bbw023.
8
Systematic evaluation of the impact of ChIP-seq read designs on genome coverage, peak identification, and allele-specific binding detection.对ChIP-seq读取设计对基因组覆盖度、峰识别和等位基因特异性结合检测的影响进行系统评估。
BMC Bioinformatics. 2016 Feb 24;17:96. doi: 10.1186/s12859-016-0957-1.
9
Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis.统一高通量测序数据集的分析:通过组合数据分析描述 RNA-seq、16S rRNA 基因测序和选择性生长实验。
Microbiome. 2014 May 5;2:15. doi: 10.1186/2049-2618-2-15. eCollection 2014.

本文引用的文献

1
A Statistical Framework for the Analysis of ChIP-Seq Data.用于ChIP-Seq数据分析的统计框架
J Am Stat Assoc. 2011;106(495):891-903. doi: 10.1198/jasa.2011.ap09706. Epub 2012 Jan 24.
2
Genome-scale analysis of escherichia coli FNR reveals complex features of transcription factor binding.大肠杆菌 FNR 的全基因组分析揭示了转录因子结合的复杂特征。
PLoS Genet. 2013 Jun;9(6):e1003565. doi: 10.1371/journal.pgen.1003565. Epub 2013 Jun 20.
3
GENCODE: the reference human genome annotation for The ENCODE Project.GENCODE:ENCODE 项目的人类参考基因组注释。
Genome Res. 2012 Sep;22(9):1760-74. doi: 10.1101/gr.135350.111.
4
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
5
Normalization of ChIP-seq data with control.使用对照样本进行 ChIP-seq 数据标准化处理。
BMC Bioinformatics. 2012 Aug 10;13:199. doi: 10.1186/1471-2105-13-199.
6
Systematic evaluation of factors influencing ChIP-seq fidelity.系统评估影响 ChIP-seq 保真度的因素。
Nat Methods. 2012 Jun;9(6):609-14. doi: 10.1038/nmeth.1985. Epub 2012 Apr 22.
7
Dynamics of the epigenetic landscape during erythroid differentiation after GATA1 restoration.GATA1 恢复后红系分化过程中的表观遗传景观动态变化。
Genome Res. 2011 Oct;21(10):1659-71. doi: 10.1101/gr.125088.111. Epub 2011 Jul 27.
8
ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions.ZINBA 将局部协变量与 DNA 测序数据相结合,以识别广泛和狭窄的富集区域,即使在扩增的基因组区域内也是如此。
Genome Biol. 2011 Jul 25;12(7):R67. doi: 10.1186/gb-2011-12-7-r67.
9
A user's guide to the encyclopedia of DNA elements (ENCODE).DNA 元件百科全书(ENCODE)使用指南
PLoS Biol. 2011 Apr;9(4):e1001046. doi: 10.1371/journal.pbio.1001046. Epub 2011 Apr 19.
10
ChIP-chip versus ChIP-seq: lessons for experimental design and data analysis.ChIP-chip 与 ChIP-seq:实验设计和数据分析的经验教训。
BMC Genomics. 2011 Feb 28;12:134. doi: 10.1186/1471-2164-12-134.