• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于贝叶斯数据整合的转录模块发现。

Discovering transcriptional modules by Bayesian data integration.

机构信息

Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, UK.

出版信息

Bioinformatics. 2010 Jun 15;26(12):i158-67. doi: 10.1093/bioinformatics/btq210.

DOI:10.1093/bioinformatics/btq210
PMID:20529901
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2881394/
Abstract

MOTIVATION

We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intuition that co-expression and co-regulation are not necessarily equivalent and hence we do not expect all genes to group similarly in both datasets. In particular, it allows us to identify the subset of genes that share the same structure of transcriptional modules in both datasets.

RESULTS

We find that by working on a gene-by-gene basis, our model is able to extract clusters with greater functional coherence than existing methods. By combining gene expression and transcription factor binding (ChIP-chip) data in this way, we are better able to determine the groups of genes that are most likely to represent underlying TMs.

AVAILABILITY

If interested in the code for the work presented in this article, please contact the authors.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

我们提出了一种通过整合基因表达和转录因子结合(ChIP-chip)数据来直接推断转录模块(TMs)的方法。我们的模型扩展了层次狄利克雷过程混合模型,允许在逐基因的基础上进行数据融合。这就体现了这样一种直觉,即共表达和共调控不一定等同,因此我们不期望两个数据集的所有基因都以相似的方式分组。特别是,它允许我们确定在两个数据集中共有的转录模块结构的基因子集。

结果

我们发现,通过逐基因的方式,我们的模型能够提取出比现有方法具有更高功能一致性的聚类。通过以这种方式结合基因表达和转录因子结合(ChIP-chip)数据,我们能够更好地确定最有可能代表潜在 TM 的基因组。

可用性

如果有兴趣了解本文中介绍的工作的代码,请联系作者。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/6330a874bc14/btq210f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/027c443d7b2c/btq210f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/989c99204d2e/btq210f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/d7737e6579a2/btq210f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/6330a874bc14/btq210f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/027c443d7b2c/btq210f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/989c99204d2e/btq210f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/d7737e6579a2/btq210f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f25/2881394/6330a874bc14/btq210f4.jpg

相似文献

1
Discovering transcriptional modules by Bayesian data integration.基于贝叶斯数据整合的转录模块发现。
Bioinformatics. 2010 Jun 15;26(12):i158-67. doi: 10.1093/bioinformatics/btq210.
2
Bayesian hierarchical model for transcriptional module discovery by jointly modeling gene expression and ChIP-chip data.通过联合建模基因表达和芯片数据发现转录模块的贝叶斯层次模型。
BMC Bioinformatics. 2007 Aug 3;8:283. doi: 10.1186/1471-2105-8-283.
3
A Gibbs sampler for the identification of gene expression and network connectivity consistency.一种用于识别基因表达和网络连通性一致性的吉布斯采样器。
Bioinformatics. 2006 Dec 15;22(24):3040-6. doi: 10.1093/bioinformatics/btl541. Epub 2006 Oct 23.
4
Efficient inference for sparse latent variable models of transcriptional regulation.转录调控稀疏潜在变量模型的高效推断。
Bioinformatics. 2017 Dec 1;33(23):3776-3783. doi: 10.1093/bioinformatics/btx508.
5
Global and threshold-free Transcriptional Regulatory Networks reconstruction through integrating ChIP-Chip and expression data.通过整合 ChIP-Chip 和表达数据进行全局和无阈值转录调控网络重建。
Curr Protein Pept Sci. 2011 Nov;12(7):631-42. doi: 10.2174/1389203711109070631.
6
Learning transcriptional networks from the integration of ChIP-chip and expression data in a non-parametric model.从 ChIP-chip 和表达数据的非参数模型集成中学习转录网络。
Bioinformatics. 2010 Aug 1;26(15):1879-86. doi: 10.1093/bioinformatics/btq289. Epub 2010 Jun 4.
7
Prioritization of gene regulatory interactions from large-scale modules in yeast.酵母大规模模块中基因调控相互作用的优先级排序
BMC Bioinformatics. 2008 Jan 22;9:32. doi: 10.1186/1471-2105-9-32.
8
Integration of known transcription factor binding site information and gene expression data to advance from co-expression to co-regulation.整合已知转录因子结合位点信息与基因表达数据,以从共表达推进到共调控。
Genomics Proteomics Bioinformatics. 2007 May;5(2):86-101. doi: 10.1016/S1672-0229(07)60019-9.
9
Defining transcription modules using large-scale gene expression data.利用大规模基因表达数据定义转录模块。
Bioinformatics. 2004 Sep 1;20(13):1993-2003. doi: 10.1093/bioinformatics/bth166. Epub 2004 Mar 25.
10
Bayesian sparse hidden components analysis for transcription regulation networks.用于转录调控网络的贝叶斯稀疏隐藏成分分析
Bioinformatics. 2006 Mar 15;22(6):739-46. doi: 10.1093/bioinformatics/btk017. Epub 2005 Dec 20.

引用本文的文献

1
BAYESIAN LEARNING OF CLINICALLY MEANINGFUL SEPSIS PHENOTYPES IN NORTHERN TANZANIA.坦桑尼亚北部临床上有意义的脓毒症表型的贝叶斯学习
Ann Appl Stat. 2025 Sep;19(3):2193-2217. doi: 10.1214/25-aoas2045. Epub 2025 Aug 28.
2
VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data.VICatMix:用于离散生物医学数据的变分贝叶斯聚类和变量选择
Bioinform Adv. 2025 Mar 17;5(1):vbaf055. doi: 10.1093/bioadv/vbaf055. eCollection 2025.
3
Bayesian profile regression for clustering analysis involving a longitudinal response and explanatory variables.

本文引用的文献

1
Modeling and visualizing uncertainty in gene expression clusters using dirichlet process mixtures.使用狄利克雷过程混合模型对基因表达聚类中的不确定性进行建模和可视化。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):615-28. doi: 10.1109/TCBB.2007.70269.
2
R/BHC: fast Bayesian hierarchical clustering for microarray data.R/BHC:用于微阵列数据的快速贝叶斯层次聚类
BMC Bioinformatics. 2009 Aug 6;10:242. doi: 10.1186/1471-2105-10-242.
3
Transcriptional programs: modelling higher order structure in transcriptional control.转录程序:转录调控中高阶结构的建模
用于涉及纵向响应和解释变量的聚类分析的贝叶斯轮廓回归
Methodology (Gott). 2024 Mar 11;73(2):314-339. doi: 10.1093/jrsssc/qlad097. Epub 2023 Nov 8.
4
A Drug Repurposing Pipeline Based on Bladder Cancer Integrated Proteotranscriptomics Signatures.基于膀胱癌综合蛋白质转录组学特征的药物重新利用流程
Methods Mol Biol. 2023;2684:59-99. doi: 10.1007/978-1-0716-3291-8_4.
5
Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges.高维生物医学数据的统计分析:分析目标、常见方法和挑战简介。
BMC Med. 2023 May 15;21(1):182. doi: 10.1186/s12916-023-02858-y.
6
Telescoping bimodal latent Dirichlet allocation to identify expression QTLs across tissues.跨组织的伸缩双模态潜在狄利克雷分配识别表达 QTLs。
Life Sci Alliance. 2022 Aug 17;5(12):e202101297. doi: 10.26508/lsa.202101297.
7
Unsupervised Multi-Omics Data Integration Methods: A Comprehensive Review.无监督多组学数据整合方法:全面综述
Front Genet. 2022 Mar 22;13:854752. doi: 10.3389/fgene.2022.854752. eCollection 2022.
8
ALOHA: Aggregated local extrema splines for high-throughput dose-response analysis.ALOHA:用于高通量剂量反应分析的聚合局部极值样条
Comput Toxicol. 2022 Feb;21. doi: 10.1016/j.comtox.2021.100196. Epub 2021 Oct 13.
9
Spatiotemporal proteomic profiling of the pro-inflammatory response to lipopolysaccharide in the THP-1 human leukaemia cell line.脂多糖诱导 THP-1 人白血病细胞炎症反应的时空蛋白质组学分析。
Nat Commun. 2021 Oct 1;12(1):5773. doi: 10.1038/s41467-021-26000-9.
10
Evaluation and comparison of multi-omics data integration methods for cancer subtyping.癌症亚型的多组学数据整合方法的评估与比较。
PLoS Comput Biol. 2021 Aug 12;17(8):e1009224. doi: 10.1371/journal.pcbi.1009224. eCollection 2021 Aug.
BMC Bioinformatics. 2009 Jul 16;10:218. doi: 10.1186/1471-2105-10-218.
4
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.使用收缩相关系数对重复微阵列进行全基因组规模的聚类分析。
BMC Bioinformatics. 2008 Jun 18;9:288. doi: 10.1186/1471-2105-9-288.
5
Automated discovery of functional generality of human gene expression programs.人类基因表达程序功能通用性的自动化发现
PLoS Comput Biol. 2007 Aug;3(8):e148. doi: 10.1371/journal.pcbi.0030148. Epub 2007 Jun 13.
6
Bayesian hierarchical model for transcriptional module discovery by jointly modeling gene expression and ChIP-chip data.通过联合建模基因表达和芯片数据发现转录模块的贝叶斯层次模型。
BMC Bioinformatics. 2007 Aug 3;8:283. doi: 10.1186/1471-2105-8-283.
7
Using GOstats to test gene lists for GO term association.使用GOstats测试基因列表与GO术语的关联性。
Bioinformatics. 2007 Jan 15;23(2):257-8. doi: 10.1093/bioinformatics/btl567. Epub 2006 Nov 10.
8
Combining sequence and time series expression data to learn transcriptional modules.结合序列和时间序列表达数据以学习转录模块。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):194-202. doi: 10.1109/TCBB.2005.34.
9
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
10
Clustering microarray gene expression data using weighted Chinese restaurant process.使用加权中国餐馆过程对微阵列基因表达数据进行聚类
Bioinformatics. 2006 Aug 15;22(16):1988-97. doi: 10.1093/bioinformatics/btl284. Epub 2006 Jun 9.