• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于ChIP-chip和ChIP-seq数据联合分析的分层隐马尔可夫模型

Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data.

作者信息

Choi Hyungwon, Nesvizhskii Alexey I, Ghosh Debashis, Qin Zhaohui S

机构信息

Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Bioinformatics. 2009 Jul 15;25(14):1715-21. doi: 10.1093/bioinformatics/btp312. Epub 2009 May 14.

DOI:10.1093/bioinformatics/btp312
PMID:19447789
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2732365/
Abstract

MOTIVATION

Chromatin immunoprecipitation (ChIP) experiments followed by array hybridization, or ChIP-chip, is a powerful approach for identifying transcription factor binding sites (TFBS) and has been widely used. Recently, massively parallel sequencing coupled with ChIP experiments (ChIP-seq) has been increasingly used as an alternative to ChIP-chip, offering cost-effective genome-wide coverage and resolution up to a single base pair. For many well-studied TFs, both ChIP-seq and ChIP-chip experiments have been applied and their data are publicly available. Previous analyses have revealed substantial technology-specific binding signals despite strong correlation between the two sets of results. Therefore, it is of interest to see whether the two data sources can be combined to enhance the detection of TFBS.

RESULTS

In this work, hierarchical hidden Markov model (HHMM) is proposed for combining data from ChIP-seq and ChIP-chip. In HHMM, inference results from individual HMMs in ChIP-seq and ChIP-chip experiments are summarized by a higher level HMM. Simulation studies show the advantage of HHMM when data from both technologies co-exist. Analysis of two well-studied TFs, NRSF and CCCTC-binding factor (CTCF), also suggests that HHMM yields improved TFBS identification in comparison to analyses using individual data sources or a simple merger of the two.

AVAILABILITY

Source code for the software ChIPmeta is freely available for download at http://www.umich.edu/~hwchoi/HHMMsoftware.zip, implemented in C and supported on linux.

摘要

动机

染色质免疫沉淀(ChIP)实验结合芯片杂交,即ChIP-chip,是一种识别转录因子结合位点(TFBS)的强大方法,已被广泛应用。最近,大规模平行测序与ChIP实验相结合(ChIP-seq)越来越多地被用作ChIP-chip的替代方法,它能以具有成本效益的方式实现全基因组覆盖,分辨率可达单碱基对。对于许多已深入研究的转录因子,ChIP-seq和ChIP-chip实验都已开展,且其数据可公开获取。先前的分析表明,尽管两组结果之间有很强的相关性,但仍存在大量技术特异性的结合信号。因此,研究这两种数据源能否结合以增强TFBS的检测很有意义。

结果

在这项工作中,提出了层次隐马尔可夫模型(HHMM)来结合ChIP-seq和ChIP-chip的数据。在HHMM中,ChIP-seq和ChIP-chip实验中各个隐马尔可夫模型的推断结果由一个更高层次的隐马尔可夫模型进行汇总。模拟研究表明,当两种技术的数据共存时,HHMM具有优势。对两个已深入研究的转录因子,即神经元限制性沉默因子(NRSF)和CCCTC结合因子(CTCF)的分析也表明,与使用单个数据源或简单合并两者的分析相比,HHMM在TFBS识别方面有改进。

可用性

软件ChIPmeta的源代码可从http://www.umich.edu/~hwchoi/HHMMsoftware.zip免费下载,用C语言实现,支持Linux系统。

相似文献

1
Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data.用于ChIP-chip和ChIP-seq数据联合分析的分层隐马尔可夫模型
Bioinformatics. 2009 Jul 15;25(14):1715-21. doi: 10.1093/bioinformatics/btp312. Epub 2009 May 14.
2
Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.从ChIP-Seq数据中进行全基因组范围内体内蛋白质-DNA结合位点的鉴定。
Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.
3
A hidden Ising model for ChIP-chip data analysis.用于 ChIP-chip 数据分析的隐藏伊辛模型。
Bioinformatics. 2010 Mar 15;26(6):777-83. doi: 10.1093/bioinformatics/btq032. Epub 2010 Jan 28.
4
Using CisGenome to analyze ChIP-chip and ChIP-seq data.使用CisGenome分析染色质免疫沉淀芯片(ChIP-chip)和染色质免疫沉淀测序(ChIP-seq)数据。
Curr Protoc Bioinformatics. 2011 Mar;Chapter 2:Unit2.13. doi: 10.1002/0471250953.bi0213s33.
5
A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences.一种用于分析基因组平铺阵列上的染色质免疫沉淀芯片实验的隐马尔可夫模型及其在p53结合序列中的应用。
Bioinformatics. 2005 Jun;21 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bti1046.
6
Computationally Tractable Multivariate HMM in Genome-Wide Mapping Studies.全基因组图谱研究中计算上易于处理的多变量隐马尔可夫模型
Methods Mol Biol. 2017;1552:135-148. doi: 10.1007/978-1-4939-6753-7_10.
7
Utilizing gene pair orientations for HMM-based analysis of promoter array ChIP-chip data.利用基因对方向进行基于 HMM 的启动子数组 ChIP-chip 数据分析。
Bioinformatics. 2009 Aug 15;25(16):2118-25. doi: 10.1093/bioinformatics/btp276. Epub 2009 Apr 28.
8
HPeak: an HMM-based algorithm for defining read-enriched regions in ChIP-Seq data.HPeak:一种基于隐马尔可夫模型的算法,用于定义 ChIP-Seq 数据中的读取富集区域。
BMC Bioinformatics. 2010 Jul 2;11:369. doi: 10.1186/1471-2105-11-369.
9
A Bayesian hidden Markov model for motif discovery through joint modeling of genomic sequence and ChIP-chip data.一种通过对基因组序列和芯片杂交数据进行联合建模来发现基序的贝叶斯隐马尔可夫模型。
Biometrics. 2009 Dec;65(4):1087-95. doi: 10.1111/j.1541-0420.2008.01180.x.
10
Modelling ChIP-seq Data Using HMMs.使用隐马尔可夫模型对染色质免疫沉淀测序(ChIP-seq)数据进行建模
Methods Mol Biol. 2017;1552:115-122. doi: 10.1007/978-1-4939-6753-7_8.

引用本文的文献

1
Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data.ChIP-Seq 和 RNA-Seq 数据分析中的统计问题。
Genes (Basel). 2010 Sep 27;1(2):317-34. doi: 10.3390/genes1020317.
2
Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.转录因子活性的全基因组特征:连接转录因子、疾病和小分子。
PLoS Comput Biol. 2013;9(9):e1003198. doi: 10.1371/journal.pcbi.1003198. Epub 2013 Sep 5.
3
jMOSAiCS: joint analysis of multiple ChIP-seq datasets.jMOSAiCS:多个ChIP-seq数据集的联合分析
Genome Biol. 2013 Apr 29;14(4):R38. doi: 10.1186/gb-2013-14-4-r38.
4
Joint analysis of expression profiles from multiple cancers improves the identification of microRNA-gene interactions.多癌种表达谱的联合分析提高了 microRNA-基因相互作用的识别能力。
Bioinformatics. 2013 Sep 1;29(17):2137-45. doi: 10.1093/bioinformatics/btt341. Epub 2013 Jun 14.
5
Differential principal component analysis of ChIP-seq.染色质免疫沉淀测序(ChIP-seq)的差异主成分分析
Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6789-94. doi: 10.1073/pnas.1204398110. Epub 2013 Apr 8.
6
Sparsely correlated hidden Markov models with application to genome-wide location studies.稀疏相关隐马尔可夫模型及其在全基因组定位研究中的应用。
Bioinformatics. 2013 Mar 1;29(5):533-41. doi: 10.1093/bioinformatics/btt012. Epub 2013 Jan 16.
7
Multi-profile hidden Markov model for mood, dietary intake, and physical activity in an intervention study of childhood obesity.多剖面隐马尔可夫模型在儿童肥胖干预研究中对情绪、饮食摄入和身体活动的应用。
Stat Med. 2013 Aug 30;32(19):3314-31. doi: 10.1002/sim.5719. Epub 2013 Jan 16.
8
A new approach for the joint analysis of multiple ChIP-seq libraries with application to histone modification.一种用于联合分析多个ChIP-seq文库并应用于组蛋白修饰的新方法。
Stat Appl Genet Mol Biol. 2012 Feb 10;11(3):Article 1. doi: 10.1515/1544-6115.1660.
9
MM-ChIP enables integrative analysis of cross-platform and between-laboratory ChIP-chip or ChIP-seq data.MM-ChIP 可实现跨平台和跨实验室 ChIP-chip 或 ChIP-seq 数据的综合分析。
Genome Biol. 2011;12(2):R11. doi: 10.1186/gb-2011-12-2-r11. Epub 2011 Feb 1.
10
Statistical Analyses of Next Generation Sequence Data: A Partial Overview.下一代测序数据的统计分析:部分概述
J Proteomics Bioinform. 2010 Jun 1;3(6):183-190. doi: 10.4172/jpb.1000138.

本文引用的文献

1
Epigenetics meets next-generation sequencing.表观遗传学与新一代测序技术相遇。
Epigenetics. 2008 Nov;3(6):318-21. doi: 10.4161/epi.3.6.7249. Epub 2008 Nov 22.
2
Accurate whole human genome sequencing using reversible terminator chemistry.使用可逆终止子化学法进行准确的全人类基因组测序。
Nature. 2008 Nov 6;456(7218):53-9. doi: 10.1038/nature07517.
3
An integrated software system for analyzing ChIP-chip and ChIP-seq data.一个用于分析染色质免疫沉淀芯片(ChIP-chip)和染色质免疫沉淀测序(ChIP-seq)数据的集成软件系统。
Nat Biotechnol. 2008 Nov;26(11):1293-300. doi: 10.1038/nbt.1505. Epub 2008 Nov 2.
4
Parameter estimation for robust HMM analysis of ChIP-chip data.用于ChIP-chip数据稳健隐马尔可夫模型分析的参数估计
BMC Bioinformatics. 2008 Aug 18;9:343. doi: 10.1186/1471-2105-9-343.
5
Integration of external signaling pathways with the core transcriptional network in embryonic stem cells.胚胎干细胞中外部信号通路与核心转录网络的整合。
Cell. 2008 Jun 13;133(6):1106-17. doi: 10.1016/j.cell.2008.04.043.
6
Dynamic remodeling of individual nucleosomes across a eukaryotic genome in response to transcriptional perturbation.真核生物基因组中单个核小体响应转录扰动的动态重塑。
PLoS Biol. 2008 Mar 18;6(3):e65. doi: 10.1371/journal.pbio.0060065.
7
Dynamic regulation of nucleosome positioning in the human genome.人类基因组中核小体定位的动态调控
Cell. 2008 Mar 7;132(5):887-98. doi: 10.1016/j.cell.2008.02.022.
8
Modeling recurrent DNA copy number alterations in array CGH data.阵列比较基因组杂交数据中复发性DNA拷贝数改变的建模
Bioinformatics. 2007 Jul 1;23(13):i450-8. doi: 10.1093/bioinformatics/btm221.
9
Genome-wide maps of chromatin state in pluripotent and lineage-committed cells.多能细胞和谱系定向细胞中染色质状态的全基因组图谱。
Nature. 2007 Aug 2;448(7153):553-60. doi: 10.1038/nature06008. Epub 2007 Jul 1.
10
Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies.通过染色质免疫沉淀法对哺乳动物细胞中转录因子结合区域进行定位:基于芯片和测序技术的比较
Genome Res. 2007 Jun;17(6):898-909. doi: 10.1101/gr.5583007.