多研究推断调控网络，以更准确地构建基因调控模型。

Multi-study inference of regulatory networks for more accurate models of gene regulation.

机构信息

New York University, New York, NY 10003, USA.

Center for Computational Biology, Flatiron Institute, New York, NY 10010, USA.

出版信息

PLoS Comput Biol. 2019 Jan 24;15(1):e1006591. doi: 10.1371/journal.pcbi.1006591. eCollection 2019 Jan.

DOI:10.1371/journal.pcbi.1006591

PMID:30677040

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6363223/

Abstract

Gene regulatory networks are composed of sub-networks that are often shared across biological processes, cell-types, and organisms. Leveraging multiple sources of information, such as publicly available gene expression datasets, could therefore be helpful when learning a network of interest. Integrating data across different studies, however, raises numerous technical concerns. Hence, a common approach in network inference, and broadly in genomics research, is to separately learn models from each dataset and combine the results. Individual models, however, often suffer from under-sampling, poor generalization and limited network recovery. In this study, we explore previous integration strategies, such as batch-correction and model ensembles, and introduce a new multitask learning approach for joint network inference across several datasets. Our method initially estimates the activities of transcription factors, and subsequently, infers the relevant network topology. As regulatory interactions are context-dependent, we estimate model coefficients as a combination of both dataset-specific and conserved components. In addition, adaptive penalties may be used to favor models that include interactions derived from multiple sources of prior knowledge including orthogonal genomics experiments. We evaluate generalization and network recovery using examples from Bacillus subtilis and Saccharomyces cerevisiae, and show that sharing information across models improves network reconstruction. Finally, we demonstrate robustness to both false positives in the prior information and heterogeneity among datasets.

摘要

基因调控网络由通常在生物过程、细胞类型和生物体中共享的子网组成。因此，在学习感兴趣的网络时，利用多个信息源，如公开的基因表达数据集，可能会有所帮助。然而，整合来自不同研究的数据会引发许多技术问题。因此，在网络推断中，以及在广义的基因组学研究中，一种常见的方法是分别从每个数据集学习模型并组合结果。然而，单个模型经常存在采样不足、泛化能力差和网络恢复有限的问题。在这项研究中，我们探索了先前的整合策略，如批量校正和模型集成，并引入了一种新的多任务学习方法，用于跨多个数据集进行联合网络推断。我们的方法最初估计转录因子的活性，然后推断相关的网络拓扑结构。由于调节相互作用是上下文相关的，我们将模型系数估计为数据集特定和保守成分的组合。此外，自适应惩罚可用于支持包括来自多个先前知识来源（包括正交基因组实验）的相互作用的模型，这些来源的信息被整合在一起。我们使用枯草芽孢杆菌和酿酒酵母的示例来评估泛化和网络恢复情况，并表明跨模型共享信息可以改善网络重建。最后，我们证明了对先验信息中的假阳性和数据集之间的异质性都具有鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f39f/6363223/a1b15a5fff22/pcbi.1006591.g001.jpg

相似文献

Multi-study inference of regulatory networks for more accurate models of gene regulation.

PLoS Comput Biol. 2019 Jan 24;15(1):e1006591. doi: 10.1371/journal.pcbi.1006591. eCollection 2019 Jan.

An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network.

Mol Syst Biol. 2015 Nov 17;11(11):839. doi: 10.15252/msb.20156236.

Bayesian Data Fusion of Gene Expression and Histone Modification Profiles for Inference of Gene Regulatory Network.

IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):516-525. doi: 10.1109/TCBB.2018.2869590. Epub 2018 Sep 10.

Fused Regression for Multi-source Gene Regulatory Network Inference.

PLoS Comput Biol. 2016 Dec 6;12(12):e1005157. doi: 10.1371/journal.pcbi.1005157. eCollection 2016 Dec.

SIN-KNO: A method of gene regulatory network inference using single-cell transcription and gene knockout data.

J Bioinform Comput Biol. 2019 Dec;17(6):1950035. doi: 10.1142/S0219720019500355.

The Local Edge Machine: inference of dynamic models of gene regulation.

Genome Biol. 2016 Oct 19;17(1):214. doi: 10.1186/s13059-016-1076-z.

An approach for reduction of false predictions in reverse engineering of gene regulatory networks.

J Theor Biol. 2018 May 14;445:9-30. doi: 10.1016/j.jtbi.2018.02.015. Epub 2018 Feb 17.

Comparison between instrumental variable and mediation-based methods for reconstructing causal gene networks in yeast.

Mol Omics. 2021 Apr 1;17(2):241-251. doi: 10.1039/d0mo00140f. Epub 2021 Jan 13.

Learning Differential Module Networks Across Multiple Experimental Conditions.

Methods Mol Biol. 2019;1883:303-321. doi: 10.1007/978-1-4939-8882-2_13.

Condition-Specific Modeling of Biophysical Parameters Advances Inference of Regulatory Networks.

Cell Rep. 2018 Apr 10;23(2):376-388. doi: 10.1016/j.celrep.2018.03.048.

引用本文的文献

Uncovering Functional Gene Regulatory Networks in Bulk and Single-Cell Data through Robust Transcription Factor Activity Estimation and Model-Guided Experimental Validation.

bioRxiv. 2025 Jun 13:2025.06.09.658650. doi: 10.1101/2025.06.09.658650.

Prediction of Composite Clinical Outcomes for Childhood Neuroblastoma Using Multi-Omics Data and Machine Learning.

Int J Mol Sci. 2024 Dec 27;26(1):136. doi: 10.3390/ijms26010136.

DeepGRNCS: deep learning-based framework for jointly inferring gene regulatory networks across cell subpopulations.

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae334.

Flexible modeling of regulatory networks improves transcription factor activity estimation.

NPJ Syst Biol Appl. 2024 May 28;10(1):58. doi: 10.1038/s41540-024-00386-w.

Structure-primed embedding on the transcription factor manifold enables transparent model architectures for gene regulatory network and latent activity inference.

Genome Biol. 2024 Jan 18;25(1):24. doi: 10.1186/s13059-023-03134-1.

iLSGRN: inference of large-scale gene regulatory networks based on multi-model fusion.

Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad619.

MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework.

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad373.

Inference of cell type-specific gene regulatory networks on cell lineages from single cell omic datasets.

Nat Commun. 2023 May 27;14(1):3064. doi: 10.1038/s41467-023-38637-9.

Improving gene regulatory network inference and assessment: The importance of using network structure.

Front Genet. 2023 Feb 27;14:1143382. doi: 10.3389/fgene.2023.1143382. eCollection 2023.

Network-based approaches for modeling disease regulation and progression.

Comput Struct Biotechnol J. 2022 Dec 16;21:780-795. doi: 10.1016/j.csbj.2022.12.022. eCollection 2023.

本文引用的文献

Condition-Specific Modeling of Biophysical Parameters Advances Inference of Regulatory Networks.

Cell Rep. 2018 Apr 10;23(2):376-388. doi: 10.1016/j.celrep.2018.03.048.

Efficient inference for sparse latent variable models of transcriptional regulation.

Bioinformatics. 2017 Dec 1;33(23):3776-3783. doi: 10.1093/bioinformatics/btx508.

Inference and Evolutionary Analysis of Genome-Scale Regulatory Networks in Large Phylogenies.

Cell Syst. 2017 May 24;4(5):543-558.e8. doi: 10.1016/j.cels.2017.04.010.

Critical role of IRF1 and BATF in forming chromatin landscape during type 1 regulatory cell differentiation.

Nat Immunol. 2017 Apr;18(4):412-421. doi: 10.1038/ni.3683. Epub 2017 Feb 6.

Batch effects and the effective design of single-cell gene expression studies.

Sci Rep. 2017 Jan 3;7:39921. doi: 10.1038/srep39921.

Fused Regression for Multi-source Gene Regulatory Network Inference.

PLoS Comput Biol. 2016 Dec 6;12(12):e1005157. doi: 10.1371/journal.pcbi.1005157. eCollection 2016 Dec.

A prior-based integrative framework for functional transcriptional regulatory network inference.

Nucleic Acids Res. 2017 Feb 28;45(4):e21. doi: 10.1093/nar/gkw963.

EGRINs (Environmental Gene Regulatory Influence Networks) in Rice That Function in the Response to Water Deficit, High Temperature, and Agricultural Environments.

Plant Cell. 2016 Oct;28(10):2365-2384. doi: 10.1105/tpc.16.00158. Epub 2016 Sep 21.

Toward Accurate and Quantitative Comparative Metagenomics.

Cell. 2016 Aug 25;166(5):1103-1116. doi: 10.1016/j.cell.2016.08.007.

Integrating Transcriptomic and Proteomic Data Using Predictive Regulatory Network Models of Host Response to Pathogens.

PLoS Comput Biol. 2016 Jul 12;12(7):e1005013. doi: 10.1371/journal.pcbi.1005013. eCollection 2016 Jul.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多研究推断调控网络，以更准确地构建基因调控模型。

Multi-study inference of regulatory networks for more accurate models of gene regulation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献