BiGSM：通过稀疏建模进行基因调控网络的贝叶斯推断

BiGSM: Bayesian inference of gene regulatory network via sparse modelling.

作者信息

Qin Hang, Garbulowski Mateusz, Sonnhammer Erik L L, Chatterjee Saikat

机构信息

Digital Futures, and School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm 11428, Sweden.

Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Solna 17121, Sweden.

出版信息

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf318.

DOI:10.1093/bioinformatics/btaf318

PMID:40484997

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12151459/

Abstract

MOTIVATION

Inference of gene regulatory network (GRN) is challenging due to the inherent sparsity of the GRN matrix and noisy expression data, often leading to a high possibility of false positive or negative predictions. To address this, it is essential to leverage the sparsity of the GRN matrix and develop a robust method capable of handling varying levels of noise in the data. Moreover, most existing GRN inference methods produce only fixed point estimates, which lack the flexibility and informativeness for comprehensive network analysis. In contrast, a Bayesian approach that yields closed-form posterior distributions allows probabilistic link selection, offering insights into the statistical confidence of each possible link. Consequently, it is important to engineer a Bayesian GRN inference method and rigorously execute a benchmark evaluation compared to state-of-the-art methods.

RESULTS

We propose a method-Bayesian inference of GRN via Sparse Modelling (BiGSM). BiGSM effectively exploits the sparsity of the GRN matrix and infers the posterior distributions of GRN links from noisy expression data by using the maximum likelihood based learning. We thoroughly benchmarked BiGSM using biological and simulated datasets including GeneNetWeaver, GeneSPIDER, and GRNbenchmark. The benchmark test evaluates its accuracy and robustness across varying noise levels and data models. Using point-estimate based performance measures, BiGSM provides an overall best performance in comparison with several state-of-the-art methods including GENIE3, LASSO, LSCON, and Zscore. Additionally, BiGSM is the only method in the set of competing methods that provides posteriors for the GRN weights, helping to decipher confidence across predictions.

AVAILABILITY AND IMPLEMENTATION

Code implemented via MATLAB and Python are available at Github: https://github.com/SachLab/BiGSM and archived at zenodo.

摘要

动机

基因调控网络（GRN）的推断具有挑战性，因为GRN矩阵固有的稀疏性以及有噪声的表达数据，这常常导致假阳性或假阴性预测的可能性很高。为了解决这个问题，利用GRN矩阵的稀疏性并开发一种能够处理数据中不同噪声水平的稳健方法至关重要。此外，大多数现有的GRN推断方法仅产生固定点估计，这对于全面的网络分析缺乏灵活性和信息性。相比之下，产生封闭形式后验分布的贝叶斯方法允许进行概率性链接选择，从而深入了解每个可能链接的统计置信度。因此，设计一种贝叶斯GRN推断方法并与最先进的方法进行严格的基准评估很重要。

结果

我们提出了一种方法——通过稀疏建模的GRN贝叶斯推断（BiGSM）。BiGSM有效地利用了GRN矩阵的稀疏性，并通过基于最大似然的学习从有噪声的表达数据中推断GRN链接的后验分布。我们使用包括GeneNetWeaver、GeneSPIDER和GRNbenchmark在内的生物和模拟数据集对BiGSM进行了全面的基准测试。该基准测试评估了其在不同噪声水平和数据模型下的准确性和稳健性。使用基于点估计的性能指标，与包括GENIE3、LASSO、LSCON和Zscore在内的几种最先进方法相比，BiGSM提供了总体最佳性能。此外，BiGSM是竞争方法集中唯一一种提供GRN权重后验的方法，有助于解读预测的置信度。

可用性和实现

通过MATLAB和Python实现的代码可在Github上获取：https://github.com/SachLab/BiGSM，并已存档于zenodo。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/814f/12151459/ab1dbcbf1ec5/btaf318f1.jpg

相似文献

BiGSM: Bayesian inference of gene regulatory network via sparse modelling.

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf318.

Multi-objective context-guided consensus of a massive array of techniques for the inference of Gene Regulatory Networks.

Comput Biol Med. 2024 Sep;179:108850. doi: 10.1016/j.compbiomed.2024.108850. Epub 2024 Jul 15.

MEFFGRN: Matrix enhancement and feature fusion-based method for reconstructing the gene regulatory network of epithelioma papulosum cyprini cells by spring viremia of carp virus infection.

Comput Biol Med. 2024 Sep;179:108835. doi: 10.1016/j.compbiomed.2024.108835. Epub 2024 Jul 11.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

From Noise to Knowledge: Diffusion Probabilistic Model-Based Neural Inference of Gene Regulatory Networks.

J Comput Biol. 2024 Nov;31(11):1087-1103. doi: 10.1089/cmb.2024.0607. Epub 2024 Oct 10.

Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.

Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

Incentives for preventing smoking in children and adolescents.

Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.

Interventions for central serous chorioretinopathy: a network meta-analysis.

Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Doppler trans-thoracic echocardiography for detection of pulmonary hypertension in adults.

Cochrane Database Syst Rev. 2022 May 9;5(5):CD012809. doi: 10.1002/14651858.CD012809.pub2.

本文引用的文献

GeneSPIDER2: large scale GRN simulation and benchmarking with perturbed single-cell data.

NAR Genom Bioinform. 2024 Sep 18;6(3):lqae121. doi: 10.1093/nargab/lqae121. eCollection 2024 Sep.

Modeling gene regulatory networks using neural network architectures.

Nat Comput Sci. 2021 Jul;1(7):491-501. doi: 10.1038/s43588-021-00099-8. Epub 2021 Jul 22.

Mechanism-centric regulatory network identifies NME2 and MYC programs as markers of Enzalutamide resistance in CRPC.

Nat Commun. 2024 Jan 8;15(1):352. doi: 10.1038/s41467-024-44686-5.

The Network Zoo: a multilingual package for the inference and analysis of gene regulatory networks.

Genome Biol. 2023 Mar 9;24(1):45. doi: 10.1186/s13059-023-02877-1.

Knowledge of the perturbation design is essential for accurate gene regulatory network inference.

Sci Rep. 2022 Oct 3;12(1):16531. doi: 10.1038/s41598-022-19005-x.

GRNbenchmark - a web server for benchmarking directed gene regulatory network inference methods.

Nucleic Acids Res. 2022 Jul 5;50(W1):W398-W404. doi: 10.1093/nar/gkac377.

Fast and accurate gene regulatory network inference by normalized least squares regression.

Bioinformatics. 2022 Apr 12;38(8):2263-2268. doi: 10.1093/bioinformatics/btac103.

An order independent algorithm for inferring gene regulatory network using quantile value for conditional independence tests.

Sci Rep. 2021 Apr 7;11(1):7605. doi: 10.1038/s41598-021-87074-5.

circRNA-miRNA-mRNA regulatory network in human lung cancer: an update.

Cancer Cell Int. 2020 May 19;20:173. doi: 10.1186/s12935-020-01245-4. eCollection 2020.

Combinatorial single-cell CRISPR screens by direct guide RNA capture and targeted sequencing.

Nat Biotechnol. 2020 Aug;38(8):954-961. doi: 10.1038/s41587-020-0470-y. Epub 2020 Mar 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

BiGSM：通过稀疏建模进行基因调控网络的贝叶斯推断

BiGSM: Bayesian inference of gene regulatory network via sparse modelling.

作者信息

Qin Hang, Garbulowski Mateusz, Sonnhammer Erik L L, Chatterjee Saikat

机构信息

Digital Futures, and School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm 11428, Sweden.

Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Solna 17121, Sweden.