• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合先验生物学知识进行基因网络重建

Gene Network Reconstruction by Integration of Prior Biological Knowledge.

作者信息

Li Yupeng, Jackson Scott A

机构信息

Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia Department of Statistics, University of Georgia, Athens, Georgia 30602.

Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia

出版信息

G3 (Bethesda). 2015 Mar 30;5(6):1075-9. doi: 10.1534/g3.115.018127.

DOI:10.1534/g3.115.018127
PMID:25823587
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4478538/
Abstract

With the development of high-throughput genomic technologies, large, genome-wide datasets have been collected, and the integration of these datasets should provide large-scale, multidimensional, and insightful views of biological systems. We developed a method for gene association network construction based on gene expression data that integrate a variety of biological resources. Assuming gene expression data are from a multivariate Gaussian distribution, a graphical lasso (glasso) algorithm is able to estimate the sparse inverse covariance matrix by a lasso (L1) penalty. The inverse covariance matrix can be seen as direct correlation between gene pairs in the gene association network. In our work, instead of using a single penalty, different penalty values were applied for gene pairs based on a priori knowledge as to whether the two genes should be connected. The a priori information can be calculated or retrieved from other biological data, e.g., Gene Ontology similarity, protein-protein interaction, gene regulatory network. By incorporating prior knowledge, the weighted graphical lasso (wglasso) outperforms the original glasso both on simulations and on data from Arabidopsis. Simulation studies show that even when some prior knowledge is not correct, the overall quality of the wglasso network was still greater than when not incorporating that information, e.g., glasso.

摘要

随着高通量基因组技术的发展,已收集了大量全基因组数据集,这些数据集的整合应能提供生物系统的大规模、多维度且有深刻见解的视图。我们基于整合了多种生物资源的基因表达数据开发了一种基因关联网络构建方法。假设基因表达数据来自多元高斯分布,图形拉索(glasso)算法能够通过拉索(L1)惩罚来估计稀疏逆协方差矩阵。逆协方差矩阵可被视为基因关联网络中基因对之间的直接相关性。在我们的工作中,不是使用单一惩罚,而是基于两个基因是否应相连的先验知识对基因对应用不同的惩罚值。先验信息可从其他生物数据中计算或检索得到,例如基因本体相似性、蛋白质 - 蛋白质相互作用、基因调控网络。通过纳入先验知识,加权图形拉索(wglasso)在模拟和拟南芥数据上均优于原始的glasso。模拟研究表明,即使某些先验知识不正确,wglasso网络的整体质量仍高于不纳入该信息时的情况,例如glasso。

相似文献

1
Gene Network Reconstruction by Integration of Prior Biological Knowledge.通过整合先验生物学知识进行基因网络重建
G3 (Bethesda). 2015 Mar 30;5(6):1075-9. doi: 10.1534/g3.115.018127.
2
Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO.利用差异加权图形套索法,将先验生物学知识纳入基于网络的差异基因表达分析。
BMC Bioinformatics. 2017 Feb 10;18(1):99. doi: 10.1186/s12859-017-1515-1.
3
Tailored graphical lasso for data integration in gene network reconstruction.针对基因网络重构中数据集成的定制图形套索。
BMC Bioinformatics. 2021 Oct 15;22(1):498. doi: 10.1186/s12859-021-04413-z.
4
Weighted lasso in graphical Gaussian modeling for large gene network estimation based on microarray data.基于微阵列数据的大型基因网络估计的图形高斯建模中的加权套索法
Genome Inform. 2007;19:142-53.
5
An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.一种用于整合先验生物学知识以进行全局网络学习的增强型高维图形套索方法。
Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.
6
Inferring gene networks from discrete expression data.从离散表达数据中推断基因网络。
Biostatistics. 2013 Sep;14(4):708-22. doi: 10.1093/biostatistics/kxt021. Epub 2013 Jul 18.
7
The graphical lasso: New insights and alternatives.图形套索:新见解与替代方法。
Electron J Stat. 2012 Nov 9;6:2125-2149. doi: 10.1214/12-EJS740.
8
Weighted Fused Pathway Graphical Lasso for Joint Estimation of Multiple Gene Networks.用于多基因网络联合估计的加权融合路径图模型选择法
Front Genet. 2019 Jul 22;10:623. doi: 10.3389/fgene.2019.00623. eCollection 2019.
9
Sparse time series chain graphical models for reconstructing genetic networks.稀疏时间序列链图模型在遗传网络重建中的应用。
Biostatistics. 2013 Jul;14(3):586-99. doi: 10.1093/biostatistics/kxt005. Epub 2013 Mar 5.
10
Sparse factor model for co-expression networks with an application using prior biological knowledge.具有利用先验生物学知识应用的共表达网络的稀疏因子模型。
Stat Appl Genet Mol Biol. 2016 Jun 1;15(3):253-72. doi: 10.1515/sagmb-2015-0002.

引用本文的文献

1
NetREm: Network Regression Embeddings reveal cell-type transcription factor coordination for gene regulation.NetREm:网络回归嵌入揭示细胞类型转录因子在基因调控中的协同作用。
Bioinform Adv. 2024 Dec 20;5(1):vbae206. doi: 10.1093/bioadv/vbae206. eCollection 2025.
2
Bio-primed machine learning to enhance discovery of relevant biomarkers.生物引发的机器学习以增强相关生物标志物的发现。
NPJ Precis Oncol. 2025 Feb 6;9(1):39. doi: 10.1038/s41698-025-00825-9.
3
CoVar: A generalizable machine learning approach to identify the coordinated regulators driving variational gene expression.

本文引用的文献

1
The huge Package for High-dimensional Undirected Graph Estimation in R.R语言中用于高维无向图估计的庞大软件包。
J Mach Learn Res. 2012 Apr;13:1059-1062.
2
Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models.高维图形模型的正则化选择稳定性方法(StARS)
Adv Neural Inf Process Syst. 2010 Dec 31;24(2):1432-1440.
3
Augmenting microarray data with literature-based knowledge to enhance gene regulatory network inference.利用基于文献的知识增强微阵列数据,以增强基因调控网络推断。
CoVar:一种可推广的机器学习方法,用于识别驱动变异基因表达的协调调控因子。
PLoS Comput Biol. 2024 Apr 17;20(4):e1012016. doi: 10.1371/journal.pcbi.1012016. eCollection 2024 Apr.
4
Optimal entropic properties of SARS-CoV-2 RNA sequences.新型冠状病毒2(SARS-CoV-2)RNA序列的最佳熵特性
R Soc Open Sci. 2024 Jan 31;11(1):231369. doi: 10.1098/rsos.231369. eCollection 2024 Jan.
5
StabJGL: a stability approach to sparsity and similarity selection in multiple-network reconstruction.StabJGL:一种用于多网络重建中稀疏性和相似性选择的稳定性方法。
Bioinform Adv. 2023 Dec 19;3(1):vbad185. doi: 10.1093/bioadv/vbad185. eCollection 2023.
6
Multi-omics regulatory network inference in the presence of missing data.存在缺失数据时的多组学调控网络推断。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad309.
7
Genetic architecture of inter-specific and -generic grass hybrids by network analysis on multi-omics data.基于多组学数据的网络分析研究种间和属间杂交草的遗传结构。
BMC Genomics. 2023 Apr 25;24(1):213. doi: 10.1186/s12864-023-09292-7.
8
Estimating Gaussian Copulas with Missing Data with and without Expert Knowledge.在有和没有专家知识的情况下对存在缺失数据的高斯Copula进行估计。
Entropy (Basel). 2022 Dec 19;24(12):1849. doi: 10.3390/e24121849.
9
Network reconstruction for trans acting genetic loci using multi-omics data and prior information.利用多组学数据和先验信息进行转录遗传基因座的网络重建。
Genome Med. 2022 Nov 7;14(1):125. doi: 10.1186/s13073-022-01124-9.
10
Learning complex dependency structure of gene regulatory networks from high dimensional microarray data with Gaussian Bayesian networks.用高斯贝叶斯网络从高维微阵列数据中学习基因调控网络的复杂依赖结构。
Sci Rep. 2022 Nov 4;12(1):18704. doi: 10.1038/s41598-022-21957-z.
PLoS Comput Biol. 2014 Jun 12;10(6):e1003666. doi: 10.1371/journal.pcbi.1003666. eCollection 2014 Jun.
4
Incorporating prior knowledge into Gene Network Study.将先验知识纳入基因网络研究。
Bioinformatics. 2013 Oct 15;29(20):2633-40. doi: 10.1093/bioinformatics/btt443. Epub 2013 Aug 16.
5
Wisdom of crowds for robust gene network inference.群体智慧在稳健基因网络推断中的应用。
Nat Methods. 2012 Jul 15;9(8):796-804. doi: 10.1038/nmeth.2016.
6
Next-generation genomics: an integrative approach.下一代基因组学:综合方法。
Nat Rev Genet. 2010 Jul;11(7):476-86. doi: 10.1038/nrg2795.
7
Weighted-LASSO for structured network inference from time course data.用于从时间序列数据进行结构化网络推断的加权套索算法
Stat Appl Genet Mol Biol. 2010;9:Article 15. doi: 10.2202/1544-6115.1519. Epub 2010 Feb 1.
8
GOSemSim: an R package for measuring semantic similarity among GO terms and gene products.GO 语义相似度分析:用于测量 GO 术语和基因产物之间语义相似性的 R 包。
Bioinformatics. 2010 Apr 1;26(7):976-8. doi: 10.1093/bioinformatics/btq064. Epub 2010 Feb 23.
9
Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana.利用拟南芥全基因组基因网络合理关联基因与性状。
Nat Biotechnol. 2010 Feb;28(2):149-56. doi: 10.1038/nbt.1603. Epub 2010 Jan 31.
10
Incorporating existing network information into gene network inference.将现有网络信息纳入基因网络推断中。
PLoS One. 2009 Aug 27;4(8):e6799. doi: 10.1371/journal.pone.0006799.