PePr：一种峰值检测优先级排序流程，用于从重复的ChIP-Seq数据中识别一致或差异峰值。

PePr: a peak-calling prioritization pipeline to identify consistent or differential peaks from replicated ChIP-Seq data.

作者信息

Zhang Yanxiao, Lin Yu-Hsuan, Johnson Timothy D, Rozek Laura S, Sartor Maureen A

机构信息

Department of Computational Medicine and Bioinformatics, Department of Biostatistics and Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA Department of Computational Medicine and Bioinformatics, Department of Biostatistics and Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Bioinformatics. 2014 Sep 15;30(18):2568-75. doi: 10.1093/bioinformatics/btu372. Epub 2014 Jun 3.

DOI:10.1093/bioinformatics/btu372

PMID:24894502

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4155259/

Abstract

MOTIVATION

ChIP-Seq is the standard method to identify genome-wide DNA-binding sites for transcription factors (TFs) and histone modifications. There is a growing need to analyze experiments with biological replicates, especially for epigenomic experiments where variation among biological samples can be substantial. However, tools that can perform group comparisons are currently lacking.

RESULTS

We present a peak-calling prioritization pipeline (PePr) for identifying consistent or differential binding sites in ChIP-Seq experiments with biological replicates. PePr models read counts across the genome among biological samples with a negative binomial distribution and uses a local variance estimation method, ranking consistent or differential binding sites more favorably than sites with greater variability. We compared PePr with commonly used and recently proposed approaches on eight TF datasets and show that PePr uniquely identifies consistent regions with enriched read counts, high motif occurrence rate and known characteristics of TF binding based on visual inspection. For histone modification data with broadly enriched regions, PePr identified differential regions that are consistent within groups and outperformed other methods in scaling False Discovery Rate (FDR) analysis.

AVAILABILITY AND IMPLEMENTATION

http://code.google.com/p/pepr-chip-seq/.

摘要

动机

染色质免疫沉淀测序（ChIP-Seq）是识别全基因组范围内转录因子（TFs）的DNA结合位点和组蛋白修饰的标准方法。对于生物重复实验的分析需求日益增长，特别是对于生物样本间差异可能很大的表观基因组实验。然而，目前缺乏能够进行组间比较的工具。

结果

我们提出了一种峰检测优先级流程（PePr），用于在具有生物重复的ChIP-Seq实验中识别一致或差异结合位点。PePr使用负二项分布对生物样本间全基因组的读数计数进行建模，并采用局部方差估计方法，相比于具有更大变异性的位点，更有利于对一致或差异结合位点进行排名。我们在八个TF数据集上，将PePr与常用和最近提出的方法进行了比较，结果表明，基于目视检查，PePr能独特地识别出具有富集读数计数、高基序出现率以及TF结合已知特征的一致区域。对于具有广泛富集区域的组蛋白修饰数据，PePr识别出组内一致的差异区域，并且在错误发现率（FDR）分析的规模上优于其他方法。

可用性和实现方式

http://code.google.com/p/pepr-chip-seq/ 。

相似文献

PePr: a peak-calling prioritization pipeline to identify consistent or differential peaks from replicated ChIP-Seq data.PePr：一种峰值检测优先级排序流程，用于从重复的ChIP-Seq数据中识别一致或差异峰值。

Bioinformatics. 2014 Sep 15;30(18):2568-75. doi: 10.1093/bioinformatics/btu372. Epub 2014 Jun 3.

HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data.HiChIP：一种用于 ChIP-Seq 数据综合分析的高通量管道。

BMC Bioinformatics. 2014 Aug 15;15(1):280. doi: 10.1186/1471-2105-15-280.

Sierra platinum: a fast and robust peak-caller for replicated ChIP-seq experiments with visual quality-control and -steering.塞拉铂：一种用于复制染色质免疫沉淀测序实验的快速且强大的峰识别工具，具备可视化质量控制与引导功能。

BMC Bioinformatics. 2016 Sep 15;17(1):377. doi: 10.1186/s12859-016-1248-6.

DiffChIPL: a differential peak analysis method for high-throughput sequencing data with biological replicates based on limma.DiffChIPL：一种基于 limma 的具有生物学重复的高通量测序数据差异峰分析方法。

Bioinformatics. 2022 Sep 2;38(17):4062-4069. doi: 10.1093/bioinformatics/btac498.

Using combined evidence from replicates to evaluate ChIP-seq peaks.使用来自重复样本的综合证据评估染色质免疫沉淀测序（ChIP-seq）峰。

Bioinformatics. 2015 Sep 1;31(17):2761-9. doi: 10.1093/bioinformatics/btv293. Epub 2015 May 7.

Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.利托内洛：高保真无对照染色质免疫沉淀峰检测

Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.

An improved ChIP-seq peak detection system for simultaneously identifying post-translational modified transcription factors by combinatorial fusion, using SUMOylation as an example.一种改良的 ChIP-seq 峰检测系统，用于通过组合融合，以 SUMOylation 为例，同时鉴定翻译后修饰的转录因子。

BMC Genomics. 2014;15 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-15-S1-S1. Epub 2014 Jan 24.

Identification of C2H2-ZF binding preferences from ChIP-seq data using RCADE.使用RCADE从ChIP-seq数据中鉴定C2H2锌指蛋白的结合偏好。

Bioinformatics. 2015 Sep 1;31(17):2879-81. doi: 10.1093/bioinformatics/btv284. Epub 2015 May 6.

Cell-type specificity of ChIP-predicted transcription factor binding sites.ChIP 预测转录因子结合位点的细胞类型特异性。

BMC Genomics. 2012 Aug 3;13:372. doi: 10.1186/1471-2164-13-372.

Integrative analysis of ChIP-chip and ChIP-seq dataset.芯片结合位点分析（ChIP-chip）和染色质免疫沉淀测序（ChIP-seq）数据集的综合分析。

Methods Mol Biol. 2013;1067:105-24. doi: 10.1007/978-1-62703-607-8_8.

引用本文的文献

Sex-stratified piRNA expression analysis reveals shared functional impacts of perinatal lead (Pb) exposure in murine hearts.性别分层的piRNA表达分析揭示了围产期铅（Pb）暴露对小鼠心脏的共同功能影响。

Epigenetics. 2025 Dec;20(1):2542879. doi: 10.1080/15592294.2025.2542879. Epub 2025 Aug 10.

The derived transposase 5 (PGBD5) can interact with human -like elements.衍生转座酶5（PGBD5）可与人源样元件相互作用。

bioRxiv. 2025 Aug 2:2025.07.31.667870. doi: 10.1101/2025.07.31.667870.

Wnt signaling activation induces CTCF binding and loop formation at -regulatory elements of target genes.Wnt信号激活诱导CTCF在靶基因的调控元件处结合并形成环。

Genome Res. 2025 Aug 1;35(8):1701-1716. doi: 10.1101/gr.279684.124.

DNA methylation confers a cerebellum-specific identity in non-human primates.DNA甲基化赋予非人类灵长类动物小脑特异性特征。

Zool Res. 2025 Mar 18;46(2):414-428. doi: 10.24272/j.issn.2095-8137.2024.325.

Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery.植物转录因子结合区域发现中计算方法的综合分析

Heliyon. 2024 Oct 10;10(20):e39140. doi: 10.1016/j.heliyon.2024.e39140. eCollection 2024 Oct 30.

CRISPRepi: a multi-omic atlas for CRISPR-based epigenome editing.CRISPRepi：基于CRISPR的表观基因组编辑的多组学图谱。

Nucleic Acids Res. 2025 Jan 6;53(D1):D901-D913. doi: 10.1093/nar/gkae1039.

5-Hydroxymethylcytosine in circulating cell-free DNA as a potential diagnostic biomarker for SLE.循环无细胞 DNA 中的 5-羟甲基胞嘧啶作为 SLE 的潜在诊断生物标志物。

Lupus Sci Med. 2024 Oct 4;11(2):e001286. doi: 10.1136/lupus-2024-001286.

Improving rigor and reproducibility in chromatin immunoprecipitation assay data analysis workflows with Rocketchip.利用Rocketchip提高染色质免疫沉淀分析数据分析工作流程的严谨性和可重复性。

bioRxiv. 2024 Jul 16:2024.07.10.602975. doi: 10.1101/2024.07.10.602975.

PRDM16 co-operates with LHX2 to shape the human brain.PRDM16与LHX2协同作用塑造人类大脑。

Oxf Open Neurosci. 2024 Jan 24;3:kvae001. doi: 10.1093/oons/kvae001. eCollection 2024.

Quantitative transcriptomic and epigenomic data analysis: a primer.定量转录组学和表观基因组数据分析：入门指南

Bioinform Adv. 2024 Feb 10;4(1):vbae019. doi: 10.1093/bioadv/vbae019. eCollection 2024.

本文引用的文献

Targeting the epigenome in lung cancer: expanding approaches to epigenetic therapy.靶向肺癌中的表观基因组：拓展表观遗传治疗方法

Front Oncol. 2013 Oct 9;3:261. doi: 10.3389/fonc.2013.00261.

Functions, aberrations, and advances for chromatin modulation in cancer.癌症中染色质调控的功能、异常及进展

Cancer Treat Res. 2014;159:227-39. doi: 10.1007/978-3-642-38007-5_13.

diffReps: detecting differential chromatin modification sites from ChIP-seq data with biological replicates.diffReps：使用具有生物学重复的 ChIP-seq 数据检测差异染色质修饰位点。

PLoS One. 2013 Jun 10;8(6):e65598. doi: 10.1371/journal.pone.0065598. Print 2013.

ER-stress-induced transcriptional regulation increases protein synthesis leading to cell death.内质网应激诱导的转录调控增加蛋白质合成，导致细胞死亡。

Nat Cell Biol. 2013 May;15(5):481-90. doi: 10.1038/ncb2738. Epub 2013 Apr 28.

BroadPeak: a novel algorithm for identifying broad peaks in diffuse ChIP-seq datasets.BroadPeak：一种用于识别弥散 ChIP-seq 数据集的宽峰的新算法。

Bioinformatics. 2013 Feb 15;29(4):492-3. doi: 10.1093/bioinformatics/bts722. Epub 2013 Jan 7.

Differential analysis of gene regulation at transcript resolution with RNA-seq.基于 RNA-seq 的转录分辨率下基因调控的差异分析。

Nat Biotechnol. 2013 Jan;31(1):46-53. doi: 10.1038/nbt.2450. Epub 2012 Dec 9.

ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.ENC 和 modENCODE 联盟的 ChIP-seq 指南和实践。

Genome Res. 2012 Sep;22(9):1813-31. doi: 10.1101/gr.136184.111.

An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。

Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.

Functional analysis of transcription factor binding sites in human promoters.转录因子结合位点在人类启动子中的功能分析。

Genome Biol. 2012 Sep 26;13(9):R50. doi: 10.1186/gb-2012-13-9-r50.

The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding.三态算法：提高 ChIP-Seq 峰发现的灵敏度和特异性。

BMC Bioinformatics. 2012 Jul 24;13:176. doi: 10.1186/1471-2105-13-176.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。