• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因调控网络和转录因子活性谱的因子分析

Factor analysis for gene regulatory networks and transcription factor activity profiles.

作者信息

Pournara Iosifina, Wernisch Lorenz

机构信息

School of Crystallography, Birkbeck College, University of London, London, UK.

出版信息

BMC Bioinformatics. 2007 Feb 23;8:61. doi: 10.1186/1471-2105-8-61.

DOI:10.1186/1471-2105-8-61
PMID:17319944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1821042/
Abstract

BACKGROUND

Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one might be able to reconstruct unobserved activity profiles of TFs from the expression profiles of target genes. A simple model is a two-layer network with unobserved TF variables in the first layer and observed gene expression variables in the second layer. TFs are connected to regulated genes by weighted edges. The weights, known as factor loadings, indicate the strength and direction of regulation. Of particular interest are methods that produce sparse networks, networks with few edges, since it is known that most genes are regulated by only a small number of TFs, and most TFs regulate only a small number of genes.

RESULTS

In this paper, we explore the performance of five factor analysis algorithms, Bayesian as well as classical, on problems with biological context using both simulated and real data. Factor analysis (FA) models are used in order to describe a larger number of observed variables by a smaller number of unobserved variables, the factors, whereby all correlation between observed variables is explained by common factors. Bayesian FA methods allow one to infer sparse networks by enforcing sparsity through priors. In contrast, in the classical FA, matrix rotation methods are used to enforce sparsity and thus to increase the interpretability of the inferred factor loadings matrix. However, we also show that Bayesian FA models that do not impose sparsity through the priors can still be used for the reconstruction of a gene regulatory network if applied in conjunction with matrix rotation methods. Finally, we show the added advantage of merging the information derived from all algorithms in order to obtain a combined result.

CONCLUSION

Most of the algorithms tested are successful in reconstructing the connectivity structure as well as the TF profiles. Moreover, we demonstrate that if the underlying network is sparse it is still possible to reconstruct hidden activity profiles of TFs to some degree without prior connectivity information.

摘要

背景

大多数现有的从基因表达数据推断基因调控网络结构的算法都假定转录因子(TFs)的活性水平与其mRNA水平成正比。这一假设对大多数生物系统而言是无效的。然而,人们或许能够从靶基因的表达谱中重建未观测到的TFs活性谱。一个简单的模型是一个两层网络,第一层有未观测到的TF变量,第二层有观测到的基因表达变量。TFs通过加权边与受调控基因相连。这些权重,即因子负荷,表明调控的强度和方向。特别令人感兴趣的是能产生稀疏网络(边很少的网络)的方法,因为已知大多数基因仅受少数TFs调控,且大多数TFs仅调控少数基因。

结果

在本文中,我们使用模拟数据和真实数据,探讨了五种因子分析算法(贝叶斯算法和经典算法)在具有生物学背景的问题上的性能。因子分析(FA)模型用于通过较少数量的未观测变量(即因子)来描述大量观测变量,从而使观测变量之间的所有相关性都由公共因子来解释。贝叶斯FA方法允许通过先验来强制稀疏性,从而推断稀疏网络。相比之下,在经典FA中,使用矩阵旋转方法来强制稀疏性,进而提高推断出的因子负荷矩阵的可解释性。然而,我们还表明,如果将不通过先验强制稀疏性的贝叶斯FA模型与矩阵旋转方法结合应用,仍然可用于基因调控网络的重建。最后,我们展示了合并所有算法得出的信息以获得综合结果的额外优势。

结论

大多数测试算法在重建连接结构以及TFs谱方面都取得了成功。此外,我们证明,如果基础网络是稀疏的,那么在没有先验连接信息的情况下,仍然有可能在一定程度上重建TFs的隐藏活性谱。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/68ce9b95de11/1471-2105-8-61-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/5ef0555b6e79/1471-2105-8-61-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/6e416b8b872e/1471-2105-8-61-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/b8ba005a2c32/1471-2105-8-61-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/3e942cb5524b/1471-2105-8-61-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/5f6bf06768b3/1471-2105-8-61-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/f8dedb93aaf8/1471-2105-8-61-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/95cb43693852/1471-2105-8-61-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/ffa0f7a9c0a7/1471-2105-8-61-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/ca3db8cd57f4/1471-2105-8-61-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/68ce9b95de11/1471-2105-8-61-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/5ef0555b6e79/1471-2105-8-61-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/6e416b8b872e/1471-2105-8-61-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/b8ba005a2c32/1471-2105-8-61-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/3e942cb5524b/1471-2105-8-61-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/5f6bf06768b3/1471-2105-8-61-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/f8dedb93aaf8/1471-2105-8-61-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/95cb43693852/1471-2105-8-61-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/ffa0f7a9c0a7/1471-2105-8-61-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/ca3db8cd57f4/1471-2105-8-61-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7cb/1821042/68ce9b95de11/1471-2105-8-61-10.jpg

相似文献

1
Factor analysis for gene regulatory networks and transcription factor activity profiles.基因调控网络和转录因子活性谱的因子分析
BMC Bioinformatics. 2007 Feb 23;8:61. doi: 10.1186/1471-2105-8-61.
2
Transcriptome network component analysis with limited microarray data.利用有限微阵列数据的转录组网络成分分析
Bioinformatics. 2006 Aug 1;22(15):1886-94. doi: 10.1093/bioinformatics/btl279. Epub 2006 Jun 9.
3
Using a state-space model with hidden variables to infer transcription factor activities.使用带有隐藏变量的状态空间模型来推断转录因子活性。
Bioinformatics. 2006 Mar 15;22(6):747-54. doi: 10.1093/bioinformatics/btk034. Epub 2006 Jan 10.
4
Unraveling transcriptional regulatory programs by integrative analysis of microarray and transcription factor binding data.通过微阵列和转录因子结合数据的综合分析来揭示转录调控程序。
Bioinformatics. 2008 Sep 1;24(17):1874-80. doi: 10.1093/bioinformatics/btn332. Epub 2008 Jun 27.
5
Inferring the role of transcription factors in regulatory networks.推断转录因子在调控网络中的作用。
BMC Bioinformatics. 2008 May 6;9:228. doi: 10.1186/1471-2105-9-228.
6
A Bayesian regression approach to the inference of regulatory networks from gene expression data.一种基于贝叶斯回归方法从基因表达数据推断调控网络。
Bioinformatics. 2005 Jul 15;21(14):3131-7. doi: 10.1093/bioinformatics/bti487. Epub 2005 May 6.
7
Using temporal correlation in factor analysis for reconstructing transcription factor activities.在因子分析中利用时间相关性来重建转录因子活性。
EURASIP J Bioinform Syst Biol. 2008;2008(1):172840. doi: 10.1155/2008/172840.
8
Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data.通过整合多源生物数据基于网络基序识别转录因子-靶基因关系
BMC Bioinformatics. 2008 Apr 21;9:203. doi: 10.1186/1471-2105-9-203.
9
A Bayesian approach to reconstructing genetic regulatory networks with hidden factors.一种用于重建具有隐藏因素的基因调控网络的贝叶斯方法。
Bioinformatics. 2005 Feb 1;21(3):349-56. doi: 10.1093/bioinformatics/bti014. Epub 2004 Sep 7.
10
A causal inference approach for constructing transcriptional regulatory networks.一种用于构建转录调控网络的因果推断方法。
Bioinformatics. 2005 Nov 1;21(21):4007-13. doi: 10.1093/bioinformatics/bti648. Epub 2005 Aug 30.

引用本文的文献

1
Inferring Covariance Structure from Multiple Data Sources via Subspace Factor Analysis.通过子空间因子分析从多个数据源推断协方差结构。
J Am Stat Assoc. 2025 Jun;120(550):1239-1253. doi: 10.1080/01621459.2024.2408777. Epub 2024 Dec 5.
2
Fast Variational Inference for Bayesian Factor Analysis in Single and Multi-Study Settings.单研究和多研究环境下贝叶斯因子分析的快速变分推理
J Comput Graph Stat. 2025;34(1):96-108. doi: 10.1080/10618600.2024.2356173. Epub 2024 Jul 17.
3
Adopting Integrated Bioinformatics and Systems Biology Approaches to Pinpoint the COVID-19 Patients' Risk Factors That Uplift the Onset of Posttraumatic Stress Disorder.

本文引用的文献

1
Modeling the manifolds of images of handwritten digits.对手写数字图像的流形进行建模。
IEEE Trans Neural Netw. 1997;8(1):65-74. doi: 10.1109/72.554192.
2
Independent component analysis reveals new and biologically significant structures in micro array data.独立成分分析揭示了微阵列数据中的新的且具有生物学意义的结构。
BMC Bioinformatics. 2006 Jun 8;7:290. doi: 10.1186/1471-2105-7-290.
3
Bayesian sparse hidden components analysis for transcription regulation networks.用于转录调控网络的贝叶斯稀疏隐藏成分分析
采用综合生物信息学和系统生物学方法来确定导致COVID-19患者发生创伤后应激障碍的风险因素。
Bioinform Biol Insights. 2024 Sep 11;18:11779322241274958. doi: 10.1177/11779322241274958. eCollection 2024.
4
Empirical Bayes Matrix Factorization.经验贝叶斯矩阵分解
J Mach Learn Res. 2021;22.
5
siVAE: interpretable deep generative models for single-cell transcriptomes.siVAE:单细胞转录组的可解释深度生成模型。
Genome Biol. 2023 Feb 20;24(1):29. doi: 10.1186/s13059-023-02850-y.
6
GTB-PPI: Predict Protein-protein Interactions Based on L1-regularized Logistic Regression and Gradient Tree Boosting.GTB-PPI:基于 L1 正则化逻辑回归和梯度提升树的蛋白质相互作用预测。
Genomics Proteomics Bioinformatics. 2020 Oct;18(5):582-592. doi: 10.1016/j.gpb.2021.01.001. Epub 2021 Jan 27.
7
A Matrix-free Likelihood Method for Exploratory Factor Analysis of High-dimensional Gaussian Data.一种用于高维高斯数据探索性因子分析的无矩阵似然方法。
J Comput Graph Stat. 2020;29(3):675-680. doi: 10.1080/10618600.2019.1704296. Epub 2020 Feb 7.
8
FarmTest: Factor-adjusted robust multiple testing with approximate false discovery control.FarmTest:具有近似错误发现率控制的因子调整稳健多重检验。
J Am Stat Assoc. 2019;114(528):1880-1893. doi: 10.1080/01621459.2018.1527700. Epub 2019 Mar 20.
9
Controlling for Confounding Effects in Single Cell RNA Sequencing Studies Using both Control and Target Genes.使用对照和靶标基因控制单细胞 RNA 测序研究中的混杂效应。
Sci Rep. 2017 Oct 19;7(1):13587. doi: 10.1038/s41598-017-13665-w.
10
Efficient inference for sparse latent variable models of transcriptional regulation.转录调控稀疏潜在变量模型的高效推断。
Bioinformatics. 2017 Dec 1;33(23):3776-3783. doi: 10.1093/bioinformatics/btx508.
Bioinformatics. 2006 Mar 15;22(6):739-46. doi: 10.1093/bioinformatics/btk017. Epub 2005 Dec 20.
4
Predicting transcription factor activities from combined analysis of microarray and ChIP data: a partial least squares approach.通过微阵列和染色质免疫沉淀数据的联合分析预测转录因子活性:一种偏最小二乘法
Theor Biol Med Model. 2005 Jun 24;2:23. doi: 10.1186/1742-4682-2-23.
5
gNCA: a framework for determining transcription factor activity based on transcriptome: identifiability and numerical implementation.gNCA:一种基于转录组确定转录因子活性的框架:可识别性与数值实现
Metab Eng. 2005 Mar;7(2):128-41. doi: 10.1016/j.ymben.2004.12.001.
6
Vocabulon: a dictionary model approach for reconstruction and localization of transcription factor binding sites.Vocabulon:一种用于转录因子结合位点重建和定位的字典模型方法。
Bioinformatics. 2005 Apr 1;21(7):922-31. doi: 10.1093/bioinformatics/bti083. Epub 2004 Oct 27.
7
Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis.利用网络成分分析基于转录组测定大肠杆菌中多种转录调节因子的活性
Proc Natl Acad Sci U S A. 2004 Jan 13;101(2):641-6. doi: 10.1073/pnas.0305287101. Epub 2003 Dec 23.
8
Network component analysis: reconstruction of regulatory signals in biological systems.网络组件分析:生物系统中调控信号的重建
Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15522-7. doi: 10.1073/pnas.2136632100. Epub 2003 Dec 12.
9
A decomposition model to track gene expression signatures: preview on observer-independent classification of ovarian cancer.一种用于追踪基因表达特征的分解模型:卵巢癌独立于观察者分类的预览。
Bioinformatics. 2002 Dec;18(12):1617-24. doi: 10.1093/bioinformatics/18.12.1617.
10
Linear modes of gene expression determined by independent component analysis.通过独立成分分析确定的基因表达线性模式。
Bioinformatics. 2002 Jan;18(1):51-60. doi: 10.1093/bioinformatics/18.1.51.