• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChIP-seq 中链交叉关联的理论特征描述。

Theoretical characterisation of strand cross-correlation in ChIP-seq.

机构信息

Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan.

Advanced Research Laboratory, Canon Medical Systems Corporation, Otawara, Tochigi, Japan.

出版信息

BMC Bioinformatics. 2020 Sep 22;21(1):417. doi: 10.1186/s12859-020-03729-6.

DOI:10.1186/s12859-020-03729-6
PMID:32962634
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7510163/
Abstract

BACKGROUND

Strand cross-correlation profiles are used for both peak calling pre-analysis and quality control (QC) in chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis. Despite its potential for robust and accurate assessments of signal-to-noise ratio (S/N) because of its peak calling independence, it remains unclear what aspects of quality such strand cross-correlation profiles actually measure.

RESULTS

We introduced a simple model to simulate the mapped read-density of ChIP-seq and then derived the theoretical maximum and minimum of cross-correlation coefficients between strands. The results suggest that the maximum coefficient of typical ChIP-seq samples is directly proportional to the number of total mapped reads and the square of the ratio of signal reads, and inversely proportional to the number of peaks and the length of read-enriched regions. Simulation analysis supported our results and evaluation using 790 ChIP-seq data obtained from the public database demonstrated high consistency between calculated cross-correlation coefficients and estimated coefficients based on the theoretical relations and peak calling results. In addition, we found that the mappability-bias-correction improved sensitivity, enabling differentiation of maximum coefficients from the noise level. Based on these insights, we proposed virtual S/N (VSN), a novel peak call-free metric for S/N assessment. We also developed PyMaSC, a tool to calculate strand cross-correlation and VSN efficiently. VSN achieved most consistent S/N estimation for various ChIP targets and sequencing read depths. Furthermore, we demonstrated that a combination of VSN and pre-existing peak calling results enable the estimation of the numbers of detectable peaks for posterior experiments and assess peak calling results.

CONCLUSIONS

We present the first theoretical insights into the strand cross-correlation, and the results reveal the potential and the limitations of strand cross-correlation analysis. Our quality assessment framework using VSN provides peak call-independent QC and will help in the evaluation of peak call analysis in ChIP-seq experiments.

摘要

背景

链间相关轮廓用于预分析峰调用和质量控制(QC),在染色质免疫沉淀测序(ChIP-seq)分析中。尽管由于其峰调用独立性,它具有稳健和准确评估信号噪声比(S/N)的潜力,但仍不清楚链间相关轮廓实际上测量了哪些质量方面。

结果

我们引入了一个简单的模型来模拟 ChIP-seq 的映射读密度,然后推导出链间交叉相关系数的理论最大值和最小值。结果表明,典型 ChIP-seq 样本的最大相关系数与总映射读数量成正比,与信号读的比例的平方成正比,与峰的数量和读富集区域的长度成反比。模拟分析支持了我们的结果,并且使用来自公共数据库的 790 个 ChIP-seq 数据的评估表明,计算出的交叉相关系数与基于理论关系和峰调用结果的估计系数之间具有高度一致性。此外,我们发现可映射性偏差校正提高了灵敏度,使最大系数能够从噪声水平中区分出来。基于这些见解,我们提出了虚拟 S/N(VSN),一种用于 S/N 评估的新型无峰调用指标。我们还开发了 PyMaSC,一种有效计算链间交叉相关和 VSN 的工具。VSN 实现了对各种 ChIP 靶标和测序读深度的最一致的 S/N 估计。此外,我们证明了 VSN 和现有的峰调用结果的组合能够估计后续实验中可检测峰的数量,并评估峰调用结果。

结论

我们首次提出了链间相关的理论见解,结果揭示了链间相关分析的潜力和局限性。我们使用 VSN 的质量评估框架提供了无峰调用的 QC,并将有助于评估 ChIP-seq 实验中的峰调用分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/3c87bcc60a04/12859_2020_3729_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/7454033edf3c/12859_2020_3729_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/a3de5803f503/12859_2020_3729_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/805a54a5dbb1/12859_2020_3729_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/bbcfae9ea1b3/12859_2020_3729_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/3c87bcc60a04/12859_2020_3729_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/7454033edf3c/12859_2020_3729_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/a3de5803f503/12859_2020_3729_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/805a54a5dbb1/12859_2020_3729_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/bbcfae9ea1b3/12859_2020_3729_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f5/7510163/3c87bcc60a04/12859_2020_3729_Fig5_HTML.jpg

相似文献

1
Theoretical characterisation of strand cross-correlation in ChIP-seq.ChIP-seq 中链交叉关联的理论特征描述。
BMC Bioinformatics. 2020 Sep 22;21(1):417. doi: 10.1186/s12859-020-03729-6.
2
Identification of factors associated with duplicate rate in ChIP-seq data.鉴定与 ChIP-seq 数据中重复率相关的因素。
PLoS One. 2019 Apr 3;14(4):e0214723. doi: 10.1371/journal.pone.0214723. eCollection 2019.
3
Sensitive and robust assessment of ChIP-seq read distribution using a strand-shift profile.使用链位移谱图对 ChIP-seq 读取分布进行灵敏且稳健的评估。
Bioinformatics. 2018 Jul 15;34(14):2356-2363. doi: 10.1093/bioinformatics/bty137.
4
WACS: improving ChIP-seq peak calling by optimally weighting controls.WACS:通过最优加权对照来提高 ChIP-seq 峰调用。
BMC Bioinformatics. 2021 Feb 15;22(1):69. doi: 10.1186/s12859-020-03927-2.
5
Unified Analysis of Multiple ChIP-Seq Datasets.多个 ChIP-Seq 数据集的统一分析。
Methods Mol Biol. 2021;2198:451-465. doi: 10.1007/978-1-0716-0876-0_33.
6
Comparative analysis of ChIP-exo peak-callers: impact of data quality, read duplication and binding subtypes.比较 ChIP-exo 峰调用程序:数据质量、读取重复和结合亚型的影响。
BMC Bioinformatics. 2020 Feb 21;21(1):65. doi: 10.1186/s12859-020-3403-3.
7
Exploring the Genomic Landscape: An In-depth ChIP-seq Analysis Protocol for Uncovering Protein-DNA Interactions.探索基因组景观:一种深入的 ChIP-seq 分析方案,用于揭示蛋白质-DNA 相互作用。
Curr Protoc. 2023 Oct;3(10):e909. doi: 10.1002/cpz1.909.
8
OccuPeak: ChIP-Seq peak calling based on internal background modelling.OccuPeak:基于内部背景建模的ChIP-Seq峰检测
PLoS One. 2014 Jun 17;9(6):e99844. doi: 10.1371/journal.pone.0099844. eCollection 2014.
9
RECAP reveals the true statistical significance of ChIP-seq peak calls.RECAP 揭示了 ChIP-seq 峰调用的真实统计意义。
Bioinformatics. 2019 Oct 1;35(19):3592-3598. doi: 10.1093/bioinformatics/btz150.
10
Shape-based peak identification for ChIP-Seq.基于形状的 ChIP-Seq 峰识别。
BMC Bioinformatics. 2011 Jan 12;12:15. doi: 10.1186/1471-2105-12-15.

引用本文的文献

1
Current progress and future perspective of super-enhancers: a viable and effective bridge between the transcriptional apparatus and disease.超级增强子的当前进展与未来展望:转录装置与疾病之间可行且有效的桥梁
Front Genet. 2025 Jul 2;16:1611905. doi: 10.3389/fgene.2025.1611905. eCollection 2025.

本文引用的文献

1
Sensitive and robust assessment of ChIP-seq read distribution using a strand-shift profile.使用链位移谱图对 ChIP-seq 读取分布进行灵敏且稳健的评估。
Bioinformatics. 2018 Jul 15;34(14):2356-2363. doi: 10.1093/bioinformatics/bty137.
2
The Encyclopedia of DNA elements (ENCODE): data portal update.《DNA 元件百科全书》(ENCODE):数据门户更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D794-D801. doi: 10.1093/nar/gkx1081.
3
Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.利托内洛:高保真无对照染色质免疫沉淀峰检测
Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.
4
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.
5
ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline.麒麟:一个全面的染色质免疫沉淀测序(ChIP-seq)和DNA酶超敏感位点测序(DNase-seq)质量控制与分析流程。
BMC Bioinformatics. 2016 Oct 3;17(1):404. doi: 10.1186/s12859-016-1274-4.
6
Recent advances in ChIP-seq analysis: from quality management to whole-genome annotation.染色质免疫沉淀测序(ChIP-seq)分析的最新进展:从质量管理到全基因组注释
Brief Bioinform. 2017 Mar 1;18(2):279-290. doi: 10.1093/bib/bbw023.
7
Systematic evaluation of the impact of ChIP-seq read designs on genome coverage, peak identification, and allele-specific binding detection.对ChIP-seq读取设计对基因组覆盖度、峰识别和等位基因特异性结合检测的影响进行系统评估。
BMC Bioinformatics. 2016 Feb 24;17:96. doi: 10.1186/s12859-016-0957-1.
8
A comprehensive comparison of tools for differential ChIP-seq analysis.用于差异染色质免疫沉淀测序(ChIP-seq)分析的工具的全面比较。
Brief Bioinform. 2016 Nov;17(6):953-966. doi: 10.1093/bib/bbv110. Epub 2016 Jan 13.
9
H3K9me3-Dependent Heterochromatin: Barrier to Cell Fate Changes.H3K9me3依赖的异染色质:细胞命运改变的障碍。
Trends Genet. 2016 Jan;32(1):29-41. doi: 10.1016/j.tig.2015.11.001. Epub 2015 Dec 8.
10
Integrative analysis of 111 reference human epigenomes.111 个人类参考基因组的综合分析。
Nature. 2015 Feb 19;518(7539):317-30. doi: 10.1038/nature14248.