• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OccuPeak:基于内部背景建模的ChIP-Seq峰检测

OccuPeak: ChIP-Seq peak calling based on internal background modelling.

作者信息

de Boer Bouke A, van Duijvenboden Karel, van den Boogaard Malou, Christoffels Vincent M, Barnett Phil, Ruijter Jan M

机构信息

Department of Anatomy, Embryology & Physiology, Academic Medical Centre, Amsterdam, The Netherlands.

出版信息

PLoS One. 2014 Jun 17;9(6):e99844. doi: 10.1371/journal.pone.0099844. eCollection 2014.

DOI:10.1371/journal.pone.0099844
PMID:24936875
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4061025/
Abstract

UNLABELLED

ChIP-seq has become a major tool for the genome-wide identification of transcription factor binding or histone modification sites. Most peak-calling algorithms require input control datasets to model the occurrence of background reads to account for local sequencing and GC bias. However, the GC-content of reads in Input-seq datasets deviates significantly from that in ChIP-seq datasets. Moreover, we observed that a commonly used peak calling program performed equally well when the use of a simulated uniform background set was compared to an Input-seq dataset. This contradicts the assumption that input control datasets are necessary to fatefully reflect the background read distribution. Because the GC-content of the abundant single reads in ChIP-seq datasets is similar to those of randomly sampled regions we designed a peak-calling algorithm with a background model based on overlapping single reads. The application, OccuPeak, uses the abundant low frequency tags present in each ChIP-seq dataset to model the background, thereby avoiding the need for additional datasets. Analysis of the performance of OccuPeak showed robust model parameters. Its measure of peak significance, the excess ratio, is only dependent on the tag density of a peak and the global noise levels. Compared to the commonly used peak-calling applications MACS and CisGenome, OccuPeak had the highest sensitivity in an enhancer identification benchmark test, and performed similar in an overlap tests of transcription factor occupation with DNase I hypersensitive sites and H3K27ac sites. Moreover, peaks called by OccuPeak were significantly enriched with cardiac disease-associated SNPs. OccuPeak runs as a standalone application and does not require extensive tweaking of parameters, making its use straightforward and user friendly.

AVAILABILITY

http://occupeak.hfrc.nl.

摘要

未加标签

染色质免疫沉淀测序(ChIP-seq)已成为全基因组范围内鉴定转录因子结合位点或组蛋白修饰位点的主要工具。大多数峰值检测算法需要输入对照数据集来模拟背景读数的出现情况,以考虑局部测序和GC偏差。然而,输入序列(Input-seq)数据集中读数的GC含量与ChIP-seq数据集中的GC含量有显著差异。此外,我们观察到,当将模拟均匀背景集与Input-seq数据集进行比较时,一个常用的峰值检测程序表现同样良好。这与输入对照数据集对于准确反映背景读数分布是必要的这一假设相矛盾。由于ChIP-seq数据集中丰富的单读数的GC含量与随机采样区域的GC含量相似,我们设计了一种基于重叠单读数的背景模型的峰值检测算法。该应用程序OccuPeak使用每个ChIP-seq数据集中存在的丰富低频标签来模拟背景,从而无需额外的数据集。对OccuPeak性能的分析显示其模型参数稳健。其峰值显著性度量,即过量比率,仅取决于峰值的标签密度和全局噪声水平。与常用的峰值检测应用程序MACS和CisGenome相比,OccuPeak在增强子鉴定基准测试中具有最高的灵敏度,并且在转录因子占据与DNase I超敏位点和H3K27ac位点的重叠测试中表现相似。此外,OccuPeak检测到的峰值在与心脏病相关的单核苷酸多态性(SNP)中显著富集。OccuPeak作为一个独立应用程序运行,不需要对参数进行大量调整,使用起来简单且用户友好。

可用性

http://occupeak.hfrc.nl 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/7762d94d33fc/pone.0099844.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/30e23a6dba72/pone.0099844.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/3c73f03d8bd3/pone.0099844.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/8491a5f3cb07/pone.0099844.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/721b889e5941/pone.0099844.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/f951171fc0f6/pone.0099844.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/1dc9802888eb/pone.0099844.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/b6140ff88078/pone.0099844.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/6bb190e6ee4e/pone.0099844.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/7762d94d33fc/pone.0099844.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/30e23a6dba72/pone.0099844.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/3c73f03d8bd3/pone.0099844.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/8491a5f3cb07/pone.0099844.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/721b889e5941/pone.0099844.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/f951171fc0f6/pone.0099844.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/1dc9802888eb/pone.0099844.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/b6140ff88078/pone.0099844.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/6bb190e6ee4e/pone.0099844.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f926/4061025/7762d94d33fc/pone.0099844.g009.jpg

相似文献

1
OccuPeak: ChIP-Seq peak calling based on internal background modelling.OccuPeak:基于内部背景建模的ChIP-Seq峰检测
PLoS One. 2014 Jun 17;9(6):e99844. doi: 10.1371/journal.pone.0099844. eCollection 2014.
2
RECAP reveals the true statistical significance of ChIP-seq peak calls.RECAP 揭示了 ChIP-seq 峰调用的真实统计意义。
Bioinformatics. 2019 Oct 1;35(19):3592-3598. doi: 10.1093/bioinformatics/btz150.
3
WACS: improving ChIP-seq peak calling by optimally weighting controls.WACS:通过最优加权对照来提高 ChIP-seq 峰调用。
BMC Bioinformatics. 2021 Feb 15;22(1):69. doi: 10.1186/s12859-020-03927-2.
4
Is this the right normalization? A diagnostic tool for ChIP-seq normalization.这是正确的标准化方法吗?一种用于ChIP-seq标准化的诊断工具。
BMC Bioinformatics. 2015 May 9;16:150. doi: 10.1186/s12859-015-0579-z.
5
Accounting for GC-content bias reduces systematic errors and batch effects in ChIP-seq data.考虑 GC 含量偏倚可减少 ChIP-seq 数据中的系统误差和批次效应。
Genome Res. 2017 Nov;27(11):1930-1938. doi: 10.1101/gr.220673.117. Epub 2017 Oct 12.
6
A non-parametric peak calling algorithm for DamID-Seq.一种用于DamID-Seq的非参数峰检测算法。
PLoS One. 2015 Mar 18;10(3):e0117415. doi: 10.1371/journal.pone.0117415. eCollection 2015.
7
Comparative analysis of commonly used peak calling programs for ChIP-Seq analysis.用于ChIP-Seq分析的常用峰检测程序的比较分析。
Genomics Inform. 2020 Dec;18(4):e42. doi: 10.5808/GI.2020.18.4.e42. Epub 2020 Dec 14.
8
Identifying ChIP-seq enrichment using MACS.使用 MACS 识别 ChIP-seq 富集
Nat Protoc. 2012 Sep;7(9):1728-40. doi: 10.1038/nprot.2012.101. Epub 2012 Aug 30.
9
Use model-based Analysis of ChIP-Seq (MACS) to analyze short reads generated by sequencing protein-DNA interactions in embryonic stem cells.使用基于模型的ChIP-Seq分析方法(MACS)来分析通过对胚胎干细胞中蛋白质-DNA相互作用进行测序而产生的短序列 reads。
Methods Mol Biol. 2014;1150:81-95. doi: 10.1007/978-1-4939-0512-6_4.
10
A comparison of peak callers used for DNase-Seq data.用于DNase-Seq数据的峰检测工具比较。
PLoS One. 2014 May 8;9(5):e96303. doi: 10.1371/journal.pone.0096303. eCollection 2014.

引用本文的文献

1
Improving rigor and reproducibility in chromatin immunoprecipitation assay data analysis workflows with Rocketchip.利用Rocketchip提高染色质免疫沉淀分析数据分析工作流程的严谨性和可重复性。
bioRxiv. 2024 Jul 16:2024.07.10.602975. doi: 10.1101/2024.07.10.602975.
2
Challenges and best practices in omics benchmarking.组学基准测试中的挑战和最佳实践。
Nat Rev Genet. 2024 May;25(5):326-339. doi: 10.1038/s41576-023-00679-6. Epub 2024 Jan 12.
3
Analysis of super-enhancer using machine learning and its application to medical biology.

本文引用的文献

1
Architectural protein subclasses shape 3D organization of genomes during lineage commitment.结构蛋白亚类在谱系确定过程中塑造基因组的 3D 结构。
Cell. 2013 Jun 6;153(6):1281-95. doi: 10.1016/j.cell.2013.04.053. Epub 2013 May 23.
2
Combining multiple ChIP-seq peak detection systems using combinatorial fusion.组合使用 ChIP-seq 峰检测系统进行组合融合。
BMC Genomics. 2012;13 Suppl 8(Suppl 8):S12. doi: 10.1186/1471-2164-13-S8-S12. Epub 2012 Dec 17.
3
Peak identification for ChIP-seq data with no controls.无对照的ChIP-seq数据的峰识别
基于机器学习的超级增强子分析及其在医学生物学中的应用。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad107.
4
An Optimized ChIP-Seq Protocol to Determine Chromatin Binding of Estrogen Receptor Beta.一种用于确定雌激素受体β染色质结合的优化染色质免疫沉淀测序(ChIP-Seq)方案。
Methods Mol Biol. 2022;2418:203-221. doi: 10.1007/978-1-0716-1920-9_13.
5
Genome-wide analysis of primary microRNA expression using H3K36me3 ChIP-seq data.利用H3K36me3染色质免疫沉淀测序数据对初级微小RNA表达进行全基因组分析。
Comput Struct Biotechnol J. 2021 Apr 5;19:1944-1955. doi: 10.1016/j.csbj.2021.03.035. eCollection 2021.
6
Genome-wide histone modification profiling of inner cell mass and trophectoderm of bovine blastocysts by RAT-ChIP.通过 RAT-ChIP 对牛囊胚的内细胞团和滋养外胚层进行全基因组组蛋白修饰谱分析。
PLoS One. 2019 Nov 25;14(11):e0225801. doi: 10.1371/journal.pone.0225801. eCollection 2019.
7
Chromatin Conformation Links Putative Enhancers in Intracranial Aneurysm-Associated Regions to Potential Candidate Genes.染色质构象将颅内动脉瘤相关区域的假定增强子与潜在候选基因联系起来。
J Am Heart Assoc. 2019 May 7;8(9):e011201. doi: 10.1161/JAHA.118.011201.
8
Spatiotemporal regulation of enhancers during cardiogenesis.心脏发生过程中增强子的时空调控
Cell Mol Life Sci. 2017 Jan;74(2):257-265. doi: 10.1007/s00018-016-2322-y. Epub 2016 Aug 6.
9
EMERGE: a flexible modelling framework to predict genomic regulatory elements from genomic signatures.EMERGE:一个用于从基因组特征预测基因组调控元件的灵活建模框架。
Nucleic Acids Res. 2016 Mar 18;44(5):e42. doi: 10.1093/nar/gkv1144. Epub 2015 Nov 3.
10
Chromatin analyses of Zymoseptoria tritici: Methods for chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq).小麦黄斑叶枯病菌的染色质分析:染色质免疫沉淀测序(ChIP-seq)方法
Fungal Genet Biol. 2015 Jun;79:63-70. doi: 10.1016/j.fgb.2015.03.006. Epub 2015 Apr 7.
Dongwuxue Yanjiu. 2012 Dec;33(E5-6):E121-8. doi: 10.3724/SP.J.1141.2012.E120-06E121.
4
ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.ENC 和 modENCODE 联盟的 ChIP-seq 指南和实践。
Genome Res. 2012 Sep;22(9):1813-31. doi: 10.1101/gr.136184.111.
5
Systematic localization of common disease-associated variation in regulatory DNA.调控 DNA 中常见疾病相关变异的系统定位。
Science. 2012 Sep 7;337(6099):1190-5. doi: 10.1126/science.1222794. Epub 2012 Sep 5.
6
Identifying ChIP-seq enrichment using MACS.使用 MACS 识别 ChIP-seq 富集
Nat Protoc. 2012 Sep;7(9):1728-40. doi: 10.1038/nprot.2012.101. Epub 2012 Aug 30.
7
An encyclopedia of mouse DNA elements (Mouse ENCODE).小鼠DNA元件百科全书(小鼠ENCODE)。
Genome Biol. 2012 Aug 13;13(8):418. doi: 10.1186/gb-2012-13-8-418.
8
A map of the cis-regulatory sequences in the mouse genome.小鼠基因组中顺式调控序列的图谱。
Nature. 2012 Aug 2;488(7409):116-20. doi: 10.1038/nature11243.
9
Genetic variation in T-box binding element functionally affects SCN5A/SCN10A enhancer.T 盒结合元件的遗传变异对 SCN5A/SCN10A 增强子具有功能影响。
J Clin Invest. 2012 Jul;122(7):2519-30. doi: 10.1172/JCI62613. Epub 2012 Jun 18.
10
Systematic evaluation of factors influencing ChIP-seq fidelity.系统评估影响 ChIP-seq 保真度的因素。
Nat Methods. 2012 Jun;9(6):609-14. doi: 10.1038/nmeth.1985. Epub 2012 Apr 22.