一种使用高斯图形模型和蒙特卡罗方法从全基因组基因表达数据学习遗传网络的综合方法。

An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method.

作者信息

Zhao Haitao, Datta Sujay, Duan Zhong-Hui

机构信息

Department of Mathematics and Computer Science, The University of North Carolina at Pembroke, Pembroke, NC, USA.

Department of Statistics, The University of Akron, Akron, OH, USA.

出版信息

Bioinform Biol Insights. 2023 Feb 27;17:11779322231152972. doi: 10.1177/11779322231152972. eCollection 2023.

DOI:10.1177/11779322231152972

PMID:36865982

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9972065/

Abstract

Global genetic networks provide additional information for the analysis of human diseases, beyond the traditional analysis that focuses on single genes or local networks. The Gaussian graphical model (GGM) is widely applied to learn genetic networks because it defines an undirected graph decoding the conditional dependence between genes. Many algorithms based on the GGM have been proposed for learning genetic network structures. Because the number of gene variables is typically far more than the number of samples collected, and a real genetic network is typically sparse, the graphical lasso implementation of GGM becomes a popular tool for inferring the conditional interdependence among genes. However, graphical lasso, although showing good performance in low dimensional data sets, is computationally expensive and inefficient or even unable to work directly on genome-wide gene expression data sets. In this study, the method of Monte Carlo Gaussian graphical model (MCGGM) was proposed to learn global genetic networks of genes. This method uses a Monte Carlo approach to sample subnetworks from genome-wide gene expression data and graphical lasso to learn the structures of the subnetworks. The learned subnetworks are then integrated to approximate a global genetic network. The proposed method was evaluated with a relatively small real data set of RNA-seq expression levels. The results indicate the proposed method shows a strong ability of decoding the interactions with high conditional dependences among genes. The method was then applied to genome-wide data sets of RNA-seq expression levels. The gene interactions with high interdependence from the estimated global networks show that most of the predicted gene-gene interactions have been reported in the literatures playing important roles in different human cancers. Also, the results validate the ability and reliability of the proposed method to identify high conditional dependences among genes in large-scale data sets.

摘要

全球遗传网络为人类疾病分析提供了额外信息，超越了传统的聚焦于单个基因或局部网络的分析方法。高斯图形模型（GGM）被广泛应用于学习遗传网络，因为它定义了一个无向图来解码基因之间的条件依赖性。许多基于GGM的算法已被提出用于学习遗传网络结构。由于基因变量的数量通常远远超过所收集样本的数量，并且实际的遗传网络通常是稀疏的，GGM的图形套索实现成为推断基因间条件相互依赖性的流行工具。然而，图形套索虽然在低维数据集中表现良好，但计算成本高且效率低下，甚至无法直接处理全基因组范围的基因表达数据集。在本研究中，提出了蒙特卡罗高斯图形模型（MCGGM）方法来学习基因的全球遗传网络。该方法使用蒙特卡罗方法从全基因组范围的基因表达数据中采样子网，并使用图形套索来学习子网的结构。然后将学习到的子网整合起来以近似一个全球遗传网络。所提出的方法用一个相对较小的RNA-seq表达水平真实数据集进行了评估。结果表明所提出的方法具有很强的解码基因间高条件依赖性相互作用的能力。然后该方法被应用于RNA-seq表达水平的全基因组数据集。从估计的全球网络中具有高相互依赖性的基因相互作用表明，大多数预测的基因-基因相互作用已在文献中报道，它们在不同人类癌症中发挥重要作用。此外，结果验证了所提出的方法在大规模数据集中识别基因间高条件依赖性的能力和可靠性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84f6/9972065/d49d5dda8c1a/10.1177_11779322231152972-fig1.jpg

相似文献

An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method.一种使用高斯图形模型和蒙特卡罗方法从全基因组基因表达数据学习遗传网络的综合方法。

Bioinform Biol Insights. 2023 Feb 27;17:11779322231152972. doi: 10.1177/11779322231152972. eCollection 2023.

Cancer Genetic Network Inference Using Gaussian Graphical Models.使用高斯图形模型进行癌症遗传网络推断

Bioinform Biol Insights. 2019 Apr 8;13:1177932219839402. doi: 10.1177/1177932219839402. eCollection 2019.

An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.一种用于整合先验生物学知识以进行全局网络学习的增强型高维图形套索方法。

Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.

A linear programming approach for estimating the structure of a sparse linear genetic network from transcript profiling data.一种用于从转录谱数据估计稀疏线性遗传网络结构的线性规划方法。

Algorithms Mol Biol. 2009 Feb 24;4:5. doi: 10.1186/1748-7188-4-5.

Pathway Graphical Lasso.通路图形套索法

Proc AAAI Conf Artif Intell. 2015 Jan;2015:2617-2623.

FastGGM: An Efficient Algorithm for the Inference of Gaussian Graphical Model in Biological Networks.FastGGM：一种用于生物网络中高斯图形模型推断的高效算法。

PLoS Comput Biol. 2016 Feb 12;12(2):e1004755. doi: 10.1371/journal.pcbi.1004755. eCollection 2016 Feb.

Tailored graphical lasso for data integration in gene network reconstruction.针对基因网络重构中数据集成的定制图形套索。

BMC Bioinformatics. 2021 Oct 15;22(1):498. doi: 10.1186/s12859-021-04413-z.

Regularized estimation of large-scale gene association networks using graphical Gaussian models.基于图式高斯模型的大规模基因关联网络正则化估计

BMC Bioinformatics. 2009 Nov 24;10:384. doi: 10.1186/1471-2105-10-384.

Weighted lasso in graphical Gaussian modeling for large gene network estimation based on microarray data.基于微阵列数据的大型基因网络估计的图形高斯建模中的加权套索法

Genome Inform. 2007;19:142-53.

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO.利用差异加权图形套索法，将先验生物学知识纳入基于网络的差异基因表达分析。

BMC Bioinformatics. 2017 Feb 10;18(1):99. doi: 10.1186/s12859-017-1515-1.

引用本文的文献

Utilizing systems genetics to enhance understanding into molecular targets of skin cancer.利用系统遗传学增进对皮肤癌分子靶点的理解。

Exp Dermatol. 2024 Mar;33(3):e15043. doi: 10.1111/exd.15043.

Computational methods in glaucoma research: Current status and future outlook.青光眼研究中的计算方法：现状与展望。

Mol Aspects Med. 2023 Dec;94:101222. doi: 10.1016/j.mam.2023.101222. Epub 2023 Nov 3.

本文引用的文献

Overexpression of in Lung Adenocarcinoma Is a New Independent Prognostic Marker of Poor Survival.肺腺癌中过表达是新的独立预后不良的生存标志物。

Dis Markers. 2019 Dec 7;2019:6019637. doi: 10.1155/2019/6019637. eCollection 2019.

Systematically profiling the expression of eIF3 subunits in glioma reveals the expression of eIF3i has prognostic value in IDH-mutant lower grade glioma.系统性分析胶质瘤中eIF3亚基的表达情况发现，eIF3i的表达在异柠檬酸脱氢酶（IDH）突变的低级别胶质瘤中具有预后价值。

Cancer Cell Int. 2019 Jun 4;19:155. doi: 10.1186/s12935-019-0867-1. eCollection 2019.

EIF3D promotes gallbladder cancer development by stabilizing GRK2 kinase and activating PI3K-AKT signaling pathway.EIF3D通过稳定GRK2激酶并激活PI3K-AKT信号通路促进胆囊癌发展。

Cell Death Dis. 2017 Jun 8;8(6):e2868. doi: 10.1038/cddis.2017.263.

Bottom-up GGM algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways or processes.用于构建调控生物途径或过程的多层层次基因调控网络的自下而上的GGM算法。

BMC Bioinformatics. 2016 Mar 18;17:132. doi: 10.1186/s12859-016-0981-1.

Bayesian Inference for General Gaussian Graphical Models With Application to Multivariate Lattice Data.具有多元格点数据应用的一般高斯图形模型的贝叶斯推断。

J Am Stat Assoc. 2011;106(496):1418-1433. doi: 10.1198/jasa.2011.tm10465. Epub 2012 Dec 24.

Targeting the translation machinery in cancer.靶向肿瘤翻译机制。

Nat Rev Drug Discov. 2015 Apr;14(4):261-78. doi: 10.1038/nrd4505. Epub 2015 Mar 6.

The role of eIF3 and its individual subunits in cancer.真核生物翻译起始因子3（eIF3）及其各个亚基在癌症中的作用。

Biochim Biophys Acta. 2015 Jul;1849(7):792-800. doi: 10.1016/j.bbagrm.2014.10.005. Epub 2014 Nov 1.

Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models.高维图形模型的正则化选择稳定性方法（StARS）

Adv Neural Inf Process Syst. 2010 Dec 31;24(2):1432-1440.

The joint graphical lasso for inverse covariance estimation across multiple classes.用于跨多个类别的逆协方差估计的联合图形套索法。

J R Stat Soc Series B Stat Methodol. 2014 Mar;76(2):373-397. doi: 10.1111/rssb.12033.

Learning gene networks under SNP perturbations using eQTL datasets.利用eQTL数据集在SNP扰动下学习基因网络。

PLoS Comput Biol. 2014 Feb 27;10(2):e1003420. doi: 10.1371/journal.pcbi.1003420. eCollection 2014 Feb.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用高斯图形模型和蒙特卡罗方法从全基因组基因表达数据学习遗传网络的综合方法。

An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献