• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

转录调控稀疏潜在变量模型的高效推断。

Efficient inference for sparse latent variable models of transcriptional regulation.

机构信息

Department of Computer Science, University of Sheffield, Sheffield, UK.

Amazon Research, Cambridge, UK.

出版信息

Bioinformatics. 2017 Dec 1;33(23):3776-3783. doi: 10.1093/bioinformatics/btx508.

DOI:10.1093/bioinformatics/btx508
PMID:28961802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860323/
Abstract

MOTIVATION

Regulation of gene expression in prokaryotes involves complex co-regulatory mechanisms involving large numbers of transcriptional regulatory proteins and their target genes. Uncovering these genome-scale interactions constitutes a major bottleneck in systems biology. Sparse latent factor models, assuming activity of transcription factors (TFs) as unobserved, provide a biologically interpretable modelling framework, integrating gene expression and genome-wide binding data, but at the same time pose a hard computational inference problem. Existing probabilistic inference methods for such models rely on subjective filtering and suffer from scalability issues, thus are not well-suited for realistic genome-scale applications.

RESULTS

We present a fast Bayesian sparse factor model, which takes input gene expression and binding sites data, either from ChIP-seq experiments or motif predictions, and outputs active TF-gene links as well as latent TF activities. Our method employs an efficient variational Bayes scheme for model inference enabling its application to large datasets which was not feasible with existing MCMC-based inference methods for such models. We validate our method on synthetic data against a similar model in the literature, employing MCMC for inference, and obtain comparable results with a small fraction of the computational time. We also apply our method to large-scale data from Mycobacterium tuberculosis involving ChIP-seq data on 113 TFs and matched gene expression data for 3863 putative target genes. We evaluate our predictions using an independent transcriptomics experiment involving over-expression of TFs.

AVAILABILITY AND IMPLEMENTATION

An easy-to-use Jupyter notebook demo of our method with data is available at https://github.com/zhenwendai/SITAR.

CONTACT

mudassar.iqbal@manchester.ac.uk.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

原核生物中基因表达的调控涉及复杂的协同调控机制,涉及大量转录调控蛋白及其靶基因。揭示这些基因组规模的相互作用是系统生物学的一个主要瓶颈。稀疏潜在因子模型,假设转录因子(TFs)的活性是不可观测的,为基因表达和全基因组结合数据提供了一个具有生物学可解释性的建模框架,但同时也构成了一个困难的计算推断问题。现有的此类模型的概率推断方法依赖于主观过滤,并且存在可扩展性问题,因此不适用于现实的基因组规模应用。

结果

我们提出了一种快速贝叶斯稀疏因子模型,它接受来自 ChIP-seq 实验或基序预测的基因表达和结合位点数据作为输入,并输出活跃的 TF-基因链接以及潜在的 TF 活性。我们的方法采用了一种有效的变分贝叶斯方案进行模型推断,使其能够应用于大型数据集,而这在以前基于 MCMC 的此类模型的推断方法中是不可行的。我们在合成数据上对我们的方法进行了验证,与文献中的类似模型进行了比较,使用 MCMC 进行推断,并在计算时间的一小部分内获得了可比的结果。我们还将我们的方法应用于涉及 113 个 TF 的 ChIP-seq 数据和 3863 个假定靶基因的匹配基因表达数据的大型结核分枝杆菌数据集。我们使用涉及 TF 过表达的独立转录组学实验来评估我们的预测。

可用性和实现

我们的方法的带有数据的易于使用的 Jupyter 笔记本演示可在 https://github.com/zhenwendai/SITAR 上获得。

联系方式

mudassar.iqbal@manchester.ac.uk。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/a323771b78ac/btx508f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/8213bf22c5d4/btx508f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/1a3d2459a8c0/btx508f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/150c966dbb2e/btx508f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/8ee0f6ec3fd9/btx508f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/a323771b78ac/btx508f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/8213bf22c5d4/btx508f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/1a3d2459a8c0/btx508f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/150c966dbb2e/btx508f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/8ee0f6ec3fd9/btx508f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b700/5860323/a323771b78ac/btx508f5.jpg

相似文献

1
Efficient inference for sparse latent variable models of transcriptional regulation.转录调控稀疏潜在变量模型的高效推断。
Bioinformatics. 2017 Dec 1;33(23):3776-3783. doi: 10.1093/bioinformatics/btx508.
2
Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.通过整合转录网络推断,预测果蝇中的调控模型。
Genome Res. 2012 Jul;22(7):1334-49. doi: 10.1101/gr.127191.111. Epub 2012 Mar 28.
3
Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data.通过整合多源生物数据基于网络基序识别转录因子-靶基因关系
BMC Bioinformatics. 2008 Apr 21;9:203. doi: 10.1186/1471-2105-9-203.
4
REUNION: transcription factor binding prediction and regulatory association inference from single-cell multi-omics data.REUNION:从单细胞多组学数据中进行转录因子结合预测和调控关联推断。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i567-i575. doi: 10.1093/bioinformatics/btae234.
5
Transcription Factor Binding Site Mapping Using ChIP-Seq.利用 ChIP-Seq 进行转录因子结合位点作图。
Microbiol Spectr. 2014 Apr;2(2). doi: 10.1128/microbiolspec.MGM2-0035-2013.
6
Improved linking of motifs to their TFs using domain information.利用域信息改进基序与其 TF 的关联。
Bioinformatics. 2020 Mar 1;36(6):1655-1662. doi: 10.1093/bioinformatics/btz855.
7
Application of topic models to a compendium of ChIP-Seq datasets uncovers recurrent transcriptional regulatory modules.主题模型在 ChIP-Seq 数据集丛集中的应用揭示了反复出现的转录调控模块。
Bioinformatics. 2020 Apr 15;36(8):2352-2358. doi: 10.1093/bioinformatics/btz975.
8
Inferring transcriptional logic from multiple dynamic experiments.从多个动态实验中推断转录逻辑。
Bioinformatics. 2017 Nov 1;33(21):3437-3444. doi: 10.1093/bioinformatics/btx407.
9
A comprehensive map of genome-wide gene regulation in Mycobacterium tuberculosis.结核分枝杆菌全基因组基因调控的综合图谱。
Sci Data. 2015 Mar 31;2:150010. doi: 10.1038/sdata.2015.10. eCollection 2015.
10
Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities.转录因子浓度和基因特异性调控活性的概率推断
Bioinformatics. 2006 Nov 15;22(22):2775-81. doi: 10.1093/bioinformatics/btl473. Epub 2006 Sep 11.

引用本文的文献

1
Multi-study inference of regulatory networks for more accurate models of gene regulation.多研究推断调控网络,以更准确地构建基因调控模型。
PLoS Comput Biol. 2019 Jan 24;15(1):e1006591. doi: 10.1371/journal.pcbi.1006591. eCollection 2019 Jan.

本文引用的文献

1
A prior-based integrative framework for functional transcriptional regulatory network inference.基于先验的功能转录调控网络推断综合框架。
Nucleic Acids Res. 2017 Feb 28;45(4):e21. doi: 10.1093/nar/gkw963.
2
An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network.一个经实验支持的枯草芽孢杆菌全局转录调控网络模型。
Mol Syst Biol. 2015 Nov 17;11(11):839. doi: 10.15252/msb.20156236.
3
RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond.
RegulonDB 9.0版本:基因调控、共表达、基序聚类及其他方面的高级整合。
Nucleic Acids Res. 2016 Jan 4;44(D1):D133-43. doi: 10.1093/nar/gkv1156. Epub 2015 Nov 2.
4
A comprehensive map of genome-wide gene regulation in Mycobacterium tuberculosis.结核分枝杆菌全基因组基因调控的综合图谱。
Sci Data. 2015 Mar 31;2:150010. doi: 10.1038/sdata.2015.10. eCollection 2015.
5
Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells.单细胞 RNA 测序数据中细胞间异质性的计算分析揭示了细胞的隐藏亚群。
Nat Biotechnol. 2015 Feb;33(2):155-60. doi: 10.1038/nbt.3102. Epub 2015 Jan 19.
6
The DNA-binding network of Mycobacterium tuberculosis.结核分枝杆菌的DNA结合网络。
Nat Commun. 2015 Jan 12;6:5829. doi: 10.1038/ncomms6829.
7
Mapping and manipulating the Mycobacterium tuberculosis transcriptome using a transcription factor overexpression-derived regulatory network.利用转录因子过表达衍生的调控网络对结核分枝杆菌转录组进行映射和操作。
Genome Biol. 2014;15(11):502. doi: 10.1186/PREACCEPT-1701638048134699.
8
Genomic insights into tuberculosis.基因组学对结核病的研究进展
Nat Rev Genet. 2014 May;15(5):307-20. doi: 10.1038/nrg3664. Epub 2014 Mar 25.
9
The Mycobacterium tuberculosis regulatory network and hypoxia.结核分枝杆菌调控网络与低氧。
Nature. 2013 Jul 11;499(7457):178-83. doi: 10.1038/nature12337. Epub 2013 Jul 3.
10
Wisdom of crowds for robust gene network inference.群体智慧在稳健基因网络推断中的应用。
Nat Methods. 2012 Jul 15;9(8):796-804. doi: 10.1038/nmeth.2016.