Suppr超能文献

主题模型在 ChIP-Seq 数据集丛集中的应用揭示了反复出现的转录调控模块。

Application of topic models to a compendium of ChIP-Seq datasets uncovers recurrent transcriptional regulatory modules.

机构信息

Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA.

Department of Cardiovascular Medicine, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, 710061, P. R. China.

出版信息

Bioinformatics. 2020 Apr 15;36(8):2352-2358. doi: 10.1093/bioinformatics/btz975.

Abstract

MOTIVATION

The availability of thousands of genome-wide coupling chromatin immunoprecipitation (ChIP)-Seq datasets across hundreds of transcription factors (TFs) and cell lines provides an unprecedented opportunity to jointly analyze large-scale TF-binding in vivo, making possible the discovery of the potential interaction and cooperation among different TFs. The interacted and cooperated TFs can potentially form a transcriptional regulatory module (TRM) (e.g. co-binding TFs), which helps decipher the combinatorial regulatory mechanisms.

RESULTS

We develop a computational method tfLDA to apply state-of-the-art topic models to multiple ChIP-Seq datasets to decipher the combinatorial binding events of multiple TFs. tfLDA is able to learn high-order combinatorial binding patterns of TFs from multiple ChIP-Seq profiles, interpret and visualize the combinatorial patterns. We apply the tfLDA to two cell lines with a rich collection of TFs and identify combinatorial binding patterns that show well-known TRMs and related TF co-binding events.

AVAILABILITY AND IMPLEMENTATION

A software R package tfLDA is freely available at https://github.com/lichen-lab/tfLDA.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

数以千计的全基因组偶联染色质免疫沉淀(ChIP)-Seq 数据集,涵盖数百个转录因子(TF)和细胞系,为联合分析大规模体内 TF 结合提供了前所未有的机会,从而有可能发现不同 TF 之间的潜在相互作用和合作。相互作用和合作的 TF 可能潜在地形成转录调控模块(TRM)(例如共同结合的 TF),这有助于破译组合调控机制。

结果

我们开发了一种计算方法 tfLDA,将最先进的主题模型应用于多个 ChIP-Seq 数据集,以破译多个 TF 的组合结合事件。tfLDA 能够从多个 ChIP-Seq 谱中学习 TF 的高阶组合结合模式,解释和可视化组合模式。我们将 tfLDA 应用于两个具有丰富 TF 集合的细胞系,并确定了显示已知 TRM 和相关 TF 共结合事件的组合结合模式。

可用性和实现

软件 R 包 tfLDA 可在 https://github.com/lichen-lab/tfLDA 上免费获得。

补充信息

补充数据可在生物信息学在线获得。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验