基于模型的方法，用于在有限数据下进行转录因子靶标识别。

Model-based method for transcription factor target identification with limited data.

机构信息

Department of Information and Computer Science, Aalto University School of Science and Technology, Helsinki, Finland.

出版信息

Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7793-8. doi: 10.1073/pnas.0914285107. Epub 2010 Apr 12.

DOI:10.1073/pnas.0914285107

PMID:20385836

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2867914/

Abstract

We present a computational method for identifying potential targets of a transcription factor (TF) using wild-type gene expression time series data. For each putative target gene we fit a simple differential equation model of transcriptional regulation, and the model likelihood serves as a score to rank targets. The expression profile of the TF is modeled as a sample from a Gaussian process prior distribution that is integrated out using a nonparametric Bayesian procedure. This results in a parsimonious model with relatively few parameters that can be applied to short time series datasets without noticeable overfitting. We assess our method using genome-wide chromatin immunoprecipitation (ChIP-chip) and loss-of-function mutant expression data for two TFs, Twist, and Mef2, controlling mesoderm development in Drosophila. Lists of top-ranked genes identified by our method are significantly enriched for genes close to bound regions identified in the ChIP-chip data and for genes that are differentially expressed in loss-of-function mutants. Targets of Twist display diverse expression profiles, and in this case a model-based approach performs significantly better than scoring based on correlation with TF expression. Our approach is found to be comparable or superior to ranking based on mutant differential expression scores. Also, we show how integrating complementary wild-type spatial expression data can further improve target ranking performance.

摘要

我们提出了一种计算方法，用于使用野生型基因表达时间序列数据识别转录因子 (TF) 的潜在靶标。对于每个假定的靶标基因，我们拟合一个简单的转录调节微分方程模型，模型似然度作为评分来对靶标进行排序。TF 的表达谱被建模为来自高斯过程先验分布的样本，该分布使用非参数贝叶斯程序进行积分。这导致了一个具有相对较少参数的简约模型，可以应用于没有明显过度拟合的短时间序列数据集。我们使用全基因组染色质免疫沉淀 (ChIP-chip) 和两种 TF Twist 和 Mef2 的功能丧失突变体表达数据来评估我们的方法，这些 TF 控制果蝇中中胚层的发育。我们方法识别的排名靠前的基因列表显著富集了在 ChIP-chip 数据中识别到的靠近结合区域的基因和在功能丧失突变体中差异表达的基因。Twist 的靶标显示出多样化的表达谱，在这种情况下，基于模型的方法的性能明显优于基于与 TF 表达的相关性进行评分的方法。我们的方法被发现与基于突变体差异表达评分的排名相当或更优。此外，我们展示了如何整合互补的野生型空间表达数据可以进一步提高靶标排名性能。

相似文献

Model-based method for transcription factor target identification with limited data.

Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7793-8. doi: 10.1073/pnas.0914285107. Epub 2010 Apr 12.

A core transcriptional network for early mesoderm development in Drosophila melanogaster.

Genes Dev. 2007 Feb 15;21(4):436-49. doi: 10.1101/gad.1509007.

The myogenic repressor gene Holes in muscles is a direct transcriptional target of Twist and Tinman in the Drosophila embryonic mesoderm.

Dev Biol. 2015 Apr 15;400(2):266-76. doi: 10.1016/j.ydbio.2015.02.005. Epub 2015 Feb 20.

Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison.

BMC Syst Biol. 2012 May 30;6:53. doi: 10.1186/1752-0509-6-53.

A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data.

PLoS One. 2009 Dec 1;4(12):e8155. doi: 10.1371/journal.pone.0008155.

Identifying biologically interpretable transcription factor knockout targets by jointly analyzing the transcription factor knockout microarray and the ChIP-chip data.

BMC Syst Biol. 2012 Aug 16;6:102. doi: 10.1186/1752-0509-6-102.

Mapping Dmef2-binding regulatory modules by using a ChIP-enriched in silico targets approach.

Proc Natl Acad Sci U S A. 2005 Dec 20;102(51):18479-84. doi: 10.1073/pnas.0507030102. Epub 2005 Dec 9.

Whole-genome ChIP-chip analysis of Dorsal, Twist, and Snail suggests integration of diverse patterning processes in the Drosophila embryo.

Genes Dev. 2007 Feb 15;21(4):385-90. doi: 10.1101/gad.1509607.

Bayesian inference based modelling for gene transcriptional dynamics by integrating multiple source of knowledge.

BMC Syst Biol. 2012;6 Suppl 1(Suppl 1):S3. doi: 10.1186/1752-0509-6-S1-S3. Epub 2012 Jul 16.

Integrated analyses to reconstruct microRNA-mediated regulatory networks in mouse liver using high-throughput profiling.

BMC Genomics. 2015;16 Suppl 2(Suppl 2):S12. doi: 10.1186/1471-2164-16-S2-S12. Epub 2015 Jan 21.

引用本文的文献

A Bayesian noisy logic model for inference of transcription factor activity from single cell and bulk transcriptomic data.

NAR Genom Bioinform. 2023 Dec 13;5(4):lqad106. doi: 10.1093/nargab/lqad106. eCollection 2023 Dec.

Nonlinear expression patterns and multiple shifts in gene network interactions underlie robust phenotypic change in Drosophila melanogaster selected for night sleep duration.

PLoS Comput Biol. 2023 Aug 10;19(8):e1011389. doi: 10.1371/journal.pcbi.1011389. eCollection 2023 Aug.

Inferencing Bulk Tumor and Single-Cell Multi-Omics Regulatory Networks for Discovery of Biomarkers and Therapeutic Targets.

Cells. 2022 Dec 26;12(1):101. doi: 10.3390/cells12010101.

Accurate determination of causalities in gene regulatory networks by dissecting downstream target genes.

Front Genet. 2022 Dec 7;13:923339. doi: 10.3389/fgene.2022.923339. eCollection 2022.

Semi-Supervised Non-Parametric Bayesian Modelling of Spatial Proteomics.

Ann Appl Stat. 2022 Dec 1;16(4). doi: 10.1214/22-AOAS1603.

Predicting the targets of IRF8 and NFATc1 during osteoclast differentiation using the machine learning method framework cTAP.

BMC Genomics. 2022 Jan 7;23(1):14. doi: 10.1186/s12864-021-08159-z.

Inferring Gene Regulatory Networks Using the Improved Markov Blanket Discovery Algorithm.

Interdiscip Sci. 2022 Mar;14(1):168-181. doi: 10.1007/s12539-021-00478-9. Epub 2021 Sep 8.

Machine Learning Reduced Gene/Non-Coding RNA Features That Classify Schizophrenia Patients Accurately and Highlight Insightful Gene Clusters.

Int J Mol Sci. 2021 Mar 25;22(7):3364. doi: 10.3390/ijms22073364.

Kinetic modeling of stem cell transcriptome dynamics to identify regulatory modules of normal and disturbed neuroectodermal differentiation.

Nucleic Acids Res. 2020 Dec 16;48(22):12577-12592. doi: 10.1093/nar/gkaa1089.

RWRNET: A Gene Regulatory Network Inference Algorithm Using Random Walk With Restart.

Front Genet. 2020 Sep 25;11:591461. doi: 10.3389/fgene.2020.591461. eCollection 2020.

本文引用的文献

Combinatorial binding predicts spatio-temporal cis-regulatory activity.

Nature. 2009 Nov 5;462(7269):65-70. doi: 10.1038/nature08531.

Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions.

Genome Biol. 2009;10(7):R80. doi: 10.1186/gb-2009-10-7-r80. Epub 2009 Jul 23.

puma: a Bioconductor package for propagating uncertainty in microarray analysis.

BMC Bioinformatics. 2009 Jul 9;10:211. doi: 10.1186/1471-2105-10-211.

Backup in gene regulatory networks explains differences between binding and knockout results.

Mol Syst Biol. 2009;5:276. doi: 10.1038/msb.2009.33. Epub 2009 Jun 16.

rHVDM: an R package to predict the activity and targets of a transcription factor.

Bioinformatics. 2009 Feb 1;25(3):419-20. doi: 10.1093/bioinformatics/btn639. Epub 2008 Dec 15.

Gaussian process modelling of latent chemical species: applications to inferring transcription factor activities.

Bioinformatics. 2008 Aug 15;24(16):i70-5. doi: 10.1093/bioinformatics/btn278.

Direct targets of the TRP63 transcription factor revealed by a combination of gene expression profiling and reverse engineering.

Genome Res. 2008 Jun;18(6):939-48. doi: 10.1101/gr.073601.107. Epub 2008 Apr 25.

Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm.

PLoS Biol. 2008 Feb;6(2):e27. doi: 10.1371/journal.pbio.0060027.

Dialogue on reverse-engineering assessment and methods: the DREAM of high-throughput pathway inference.

Ann N Y Acad Sci. 2007 Dec;1115:1-22. doi: 10.1196/annals.1407.021. Epub 2007 Oct 9.

A core transcriptional network for early mesoderm development in Drosophila melanogaster.

Genes Dev. 2007 Feb 15;21(4):436-49. doi: 10.1101/gad.1509007.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于模型的方法，用于在有限数据下进行转录因子靶标识别。

Model-based method for transcription factor target identification with limited data.

机构信息

Department of Information and Computer Science, Aalto University School of Science and Technology, Helsinki, Finland.

出版信息

Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7793-8. doi: 10.1073/pnas.0914285107. Epub 2010 Apr 12.

DOI:10.1073/pnas.0914285107

PMID:20385836

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2867914/

Abstract

摘要

基于模型的方法，用于在有限数据下进行转录因子靶标识别。

Model-based method for transcription factor target identification with limited data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于模型的方法，用于在有限数据下进行转录因子靶标识别。

Model-based method for transcription factor target identification with limited data.

机构信息

出版信息