• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 ChIP-Seq 数据中鉴定体内蛋白质-DNA 结合位点的有效方法。

An effective approach for identification of in vivo protein-DNA binding sites from paired-end ChIP-Seq data.

机构信息

School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.

出版信息

BMC Bioinformatics. 2010 Feb 9;11:81. doi: 10.1186/1471-2105-11-81.

DOI:10.1186/1471-2105-11-81
PMID:20144209
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2831849/
Abstract

BACKGROUND

ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of protein-DNA interactions in vivo in the genome. However, to maximize the effectiveness of data analysis of such sequences requires the development of new algorithms that are able to accurately predict DNA-protein binding sites.

RESULTS

Here, we present SIPeS (Site Identification from Paired-end Sequencing), a novel algorithm for precise identification of binding sites from short reads generated by paired-end solexa ChIP-Seq technology. In this paper we used ChIP-Seq data from the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS), which is expressed within the anther during pollen development, the results show that SIPeS has better resolution for binding site identification compared to two existing ChIP-Seq peak detection algorithms, Cisgenome and MACS.

CONCLUSIONS

When compared to Cisgenome and MACS, SIPeS shows better resolution for binding site discovery. Moreover, SIPeS is designed to calculate the mappable genome length accurately with the fragment length based on the paired-end reads. Dynamic baselines are also employed to effectively discriminate closely adjacent binding sites, for effective binding sites discovery, which is of particular value when working with high-density genomes.

摘要

背景

ChIP-Seq 技术结合了染色质免疫沉淀(ChIP)和高通量平行测序,越来越多地用于在体内鉴定基因组中蛋白质与 DNA 的相互作用。然而,为了最大限度地提高此类序列数据分析的效果,需要开发能够准确预测 DNA-蛋白质结合位点的新算法。

结果

在这里,我们提出了 SIPeS(来自配对末端测序的位点识别),这是一种用于从基于配对末端焦磷酸测序的 ChIP-Seq 技术生成的短读序列中精确识别结合位点的新算法。在本文中,我们使用了拟南芥基本螺旋-环-螺旋转录因子 ABORTED MICROSPORES(AMS)的 ChIP-Seq 数据,该转录因子在花粉发育过程中在花药内表达,结果表明,与现有的两种 ChIP-Seq 峰检测算法 Cisgenome 和 MACS 相比,SIPeS 具有更好的结合位点识别分辨率。

结论

与 Cisgenome 和 MACS 相比,SIPeS 显示出更好的结合位点发现分辨率。此外,SIPeS 旨在根据基于配对末端读取的片段长度准确计算可映射基因组长度。还采用了动态基线来有效区分紧密相邻的结合位点,以便有效地发现结合位点,这在处理高密度基因组时具有特别重要的价值。

相似文献

1
An effective approach for identification of in vivo protein-DNA binding sites from paired-end ChIP-Seq data.从 ChIP-Seq 数据中鉴定体内蛋白质-DNA 结合位点的有效方法。
BMC Bioinformatics. 2010 Feb 9;11:81. doi: 10.1186/1471-2105-11-81.
2
Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.从ChIP-Seq数据中进行全基因组范围内体内蛋白质-DNA结合位点的鉴定。
Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.
3
ChIP-Seq data analysis: identification of protein-DNA binding sites with SISSRs peak-finder.染色质免疫沉淀测序(ChIP-Seq)数据分析:使用SISSRs峰检测工具鉴定蛋白质-DNA结合位点
Methods Mol Biol. 2012;802:305-22. doi: 10.1007/978-1-61779-400-1_20.
4
dPeak: high resolution identification of transcription factor binding sites from PET and SET ChIP-Seq data.dPeak:从 PET 和 SET ChIP-Seq 数据中高分辨率识别转录因子结合位点。
PLoS Comput Biol. 2013;9(10):e1003246. doi: 10.1371/journal.pcbi.1003246. Epub 2013 Oct 17.
5
Use model-based Analysis of ChIP-Seq (MACS) to analyze short reads generated by sequencing protein-DNA interactions in embryonic stem cells.使用基于模型的ChIP-Seq分析方法(MACS)来分析通过对胚胎干细胞中蛋白质-DNA相互作用进行测序而产生的短序列 reads。
Methods Mol Biol. 2014;1150:81-95. doi: 10.1007/978-1-4939-0512-6_4.
6
Computational analysis of ChIP-seq data.染色质免疫沉淀测序(ChIP-seq)数据的计算分析
Methods Mol Biol. 2010;674:143-59. doi: 10.1007/978-1-60761-854-6_9.
7
ChIPulate: A comprehensive ChIP-seq simulation pipeline.ChIPulate:一个全面的 ChIP-seq 模拟管道。
PLoS Comput Biol. 2019 Mar 21;15(3):e1006921. doi: 10.1371/journal.pcbi.1006921. eCollection 2019 Mar.
8
Computational analysis of protein-DNA interactions from ChIP-seq data.基于染色质免疫沉淀测序(ChIP-seq)数据的蛋白质-DNA相互作用的计算分析。
Methods Mol Biol. 2012;786:263-73. doi: 10.1007/978-1-61779-292-2_16.
9
Using CisGenome to analyze ChIP-chip and ChIP-seq data.使用CisGenome分析染色质免疫沉淀芯片(ChIP-chip)和染色质免疫沉淀测序(ChIP-seq)数据。
Curr Protoc Bioinformatics. 2011 Mar;Chapter 2:Unit2.13. doi: 10.1002/0471250953.bi0213s33.
10
Design and analysis of ChIP-seq experiments for DNA-binding proteins.用于DNA结合蛋白的ChIP-seq实验的设计与分析。
Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.

引用本文的文献

1
Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.利托内洛:高保真无对照染色质免疫沉淀峰检测
Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.
2
Depletion of Arabidopsis SC35 and SC35-like serine/arginine-rich proteins affects the transcription and splicing of a subset of genes.拟南芥SC35及类SC35富含丝氨酸/精氨酸蛋白的缺失会影响一部分基因的转录和剪接。
PLoS Genet. 2017 Mar 8;13(3):e1006663. doi: 10.1371/journal.pgen.1006663. eCollection 2017 Mar.
3
A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana.

本文引用的文献

1
The ABORTED MICROSPORES regulatory network is required for postmeiotic male reproductive development in Arabidopsis thaliana.ABORTED MICROSPORES 调控网络对拟南芥减数分裂后雄性生殖发育是必需的。
Plant Cell. 2010 Jan;22(1):91-107. doi: 10.1105/tpc.109.071803. Epub 2010 Jan 29.
2
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.
3
Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data.
拟南芥中转录因子结合的功能与进化视角
Plant Cell. 2014 Oct;26(10):3894-910. doi: 10.1105/tpc.114.130591. Epub 2014 Oct 31.
4
CMT: a constrained multi-level thresholding approach for ChIP-Seq data analysis.CMT:一种用于ChIP-Seq数据分析的约束多阈值方法。
PLoS One. 2014 Apr 15;9(4):e93873. doi: 10.1371/journal.pone.0093873. eCollection 2014.
5
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.一个关于 motif 发现网络工具的调查,用于检测 ChIP-Seq 数据中的结合位点 motif。
Biol Direct. 2014 Feb 20;9:4. doi: 10.1186/1745-6150-9-4.
6
Practical guidelines for the comprehensive analysis of ChIP-seq data.《ChIP-seq 数据综合分析实用指南》
PLoS Comput Biol. 2013;9(11):e1003326. doi: 10.1371/journal.pcbi.1003326. Epub 2013 Nov 14.
7
dPeak: high resolution identification of transcription factor binding sites from PET and SET ChIP-Seq data.dPeak:从 PET 和 SET ChIP-Seq 数据中高分辨率识别转录因子结合位点。
PLoS Comput Biol. 2013;9(10):e1003246. doi: 10.1371/journal.pcbi.1003246. Epub 2013 Oct 17.
8
ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions.ZINBA 将局部协变量与 DNA 测序数据相结合,以识别广泛和狭窄的富集区域,即使在扩增的基因组区域内也是如此。
Genome Biol. 2011 Jul 25;12(7):R67. doi: 10.1186/gb-2011-12-7-r67.
9
Detection and removal of biases in the analysis of next-generation sequencing reads.下一代测序读段分析中的偏差检测与去除。
PLoS One. 2011 Jan 31;6(1):e16685. doi: 10.1371/journal.pone.0016685.
10
Evaluation of algorithm performance in ChIP-seq peak detection.评估 ChIP-seq 峰检测算法的性能。
PLoS One. 2010 Jul 8;5(7):e11471. doi: 10.1371/journal.pone.0011471.
基于染色质免疫沉淀测序(ChIP-Seq)数据的转录因子结合位点全基因组分析。
Nat Methods. 2008 Sep;5(9):829-34. doi: 10.1038/nmeth.1246.
4
Genome-wide identification of DNA-protein interactions using chromatin immunoprecipitation coupled with flow cell sequencing.利用染色质免疫沉淀结合流动池测序进行全基因组DNA-蛋白质相互作用的鉴定。
J Endocrinol. 2009 Apr;201(1):1-13. doi: 10.1677/JOE-08-0526. Epub 2009 Jan 9.
5
PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.PeakSeq能够相对于对照对ChIP-seq实验进行系统评分。
Nat Biotechnol. 2009 Jan;27(1):66-75. doi: 10.1038/nbt.1518. Epub 2009 Jan 4.
6
Design and analysis of ChIP-seq experiments for DNA-binding proteins.用于DNA结合蛋白的ChIP-seq实验的设计与分析。
Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.
7
An integrated software system for analyzing ChIP-chip and ChIP-seq data.一个用于分析染色质免疫沉淀芯片(ChIP-chip)和染色质免疫沉淀测序(ChIP-seq)数据的集成软件系统。
Nat Biotechnol. 2008 Nov;26(11):1293-300. doi: 10.1038/nbt.1505. Epub 2008 Nov 2.
8
Model-based analysis of ChIP-Seq (MACS).基于模型的染色质免疫沉淀测序分析(MACS)
Genome Biol. 2008;9(9):R137. doi: 10.1186/gb-2008-9-9-r137. Epub 2008 Sep 17.
9
F-Seq: a feature density estimator for high-throughput sequence tags.F-Seq:一种用于高通量序列标签的特征密度估计器。
Bioinformatics. 2008 Nov 1;24(21):2537-8. doi: 10.1093/bioinformatics/btn480. Epub 2008 Sep 10.
10
Mapping short DNA sequencing reads and calling variants using mapping quality scores.使用比对质量分数比对短DNA测序读数并识别变异。
Genome Res. 2008 Nov;18(11):1851-8. doi: 10.1101/gr.078212.108. Epub 2008 Aug 19.