• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

netReg:用于生物关联研究的网络正则化线性模型。

netReg: network-regularized linear models for biological association studies.

机构信息

Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland.

Institute of Computational Biology, Helmholtz Zentrum München, 85764 Neuherberg, Germany.

出版信息

Bioinformatics. 2018 Mar 1;34(5):896-898. doi: 10.1093/bioinformatics/btx677.

DOI:10.1093/bioinformatics/btx677
PMID:29077797
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6030897/
Abstract

SUMMARY

Modelling biological associations or dependencies using linear regression is often complicated when the analyzed data-sets are high-dimensional and less observations than variables are available (n ≪ p). For genomic data-sets penalized regression methods have been applied settling this issue. Recently proposed regression models utilize prior knowledge on dependencies, e.g. in the form of graphs, arguing that this information will lead to more reliable estimates for regression coefficients. However, none of the proposed models for multivariate genomic response variables have been implemented as a computationally efficient, freely available library. In this paper we propose netReg, a package for graph-penalized regression models that use large networks and thousands of variables. netReg incorporates a priori generated biological graph information into linear models yielding sparse or smooth solutions for regression coefficients.

AVAILABILITY AND IMPLEMENTATION

netReg is implemented as both R-package and C ++ commandline tool. The main computations are done in C ++, where we use Armadillo for fast matrix calculations and Dlib for optimization. The R package is freely available on Bioconductorhttps://bioconductor.org/packages/netReg. The command line tool can be installed using the conda channel Bioconda. Installation details, issue reports, development versions, documentation and tutorials for the R and C ++ versions and the R package vignette can be found on GitHub https://dirmeier.github.io/netReg/. The GitHub page also contains code for benchmarking and example datasets used in this paper.

CONTACT

simon.dirmeier@bsse.ethz.ch.

摘要

摘要

当分析的数据集是高维的,并且可用的观测值少于变量时(n ≪ p),使用线性回归来模拟生物关联或依赖性通常会变得复杂。为此,已经应用了惩罚回归方法来处理基因组数据集。最近提出的回归模型利用了依赖性的先验知识,例如以图形的形式,认为这些信息将为回归系数提供更可靠的估计。然而,用于多变量基因组响应变量的提出的模型中,没有一个被实现为计算效率高、免费可用的库。在本文中,我们提出了 netReg,这是一个用于图形惩罚回归模型的软件包,该模型使用大型网络和数千个变量。netReg 将先验生成的生物图形信息纳入线性模型中,为回归系数生成稀疏或平滑的解决方案。

可用性和实现

netReg 既作为 R 包,也作为 C++命令行工具实现。主要计算在 C++中完成,我们在其中使用 Armadillo 进行快速矩阵计算和 Dlib 进行优化。R 包可在 Bioconductorhttps://bioconductor.org/packages/netReg 上免费获得。命令行工具可以使用 conda 频道 Bioconda 安装。有关 R 和 C++版本以及 R 包 vignette 的安装详细信息、问题报告、开发版本、文档和教程都可以在 GitHub https://dirmeier.github.io/netReg/ 上找到。GitHub 页面还包含本文中使用的基准测试和示例数据集的代码。

联系方式

simon.dirmeier@bsse.ethz.ch。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/420b/6030897/3fa3caa64ade/btx677f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/420b/6030897/f84a27e9a5a9/btx677f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/420b/6030897/3fa3caa64ade/btx677f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/420b/6030897/f84a27e9a5a9/btx677f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/420b/6030897/3fa3caa64ade/btx677f2.jpg

相似文献

1
netReg: network-regularized linear models for biological association studies.netReg:用于生物关联研究的网络正则化线性模型。
Bioinformatics. 2018 Mar 1;34(5):896-898. doi: 10.1093/bioinformatics/btx677.
2
graphkernels: R and Python packages for graph comparison.图核:用于图比较的 R 和 Python 包。
Bioinformatics. 2018 Feb 1;34(3):530-532. doi: 10.1093/bioinformatics/btx602.
3
ndexr-an R package to interface with the network data exchange.ndexr 是一个用于与网络数据交换接口的 R 包。
Bioinformatics. 2018 Feb 15;34(4):716-717. doi: 10.1093/bioinformatics/btx683.
4
diffuStats: an R package to compute diffusion-based scores on biological networks.diffuStats:一个用于计算生物网络基于扩散的分数的 R 包。
Bioinformatics. 2018 Feb 1;34(3):533-534. doi: 10.1093/bioinformatics/btx632.
5
Interactive network visualization in Jupyter notebooks: visJS2jupyter.交互式网络可视化在 Jupyter 笔记本:visJS2jupyter。
Bioinformatics. 2018 Jan 1;34(1):126-128. doi: 10.1093/bioinformatics/btx581.
6
EGAD: ultra-fast functional analysis of gene networks.EGAD:基因网络的超快速功能分析
Bioinformatics. 2017 Feb 15;33(4):612-614. doi: 10.1093/bioinformatics/btw695.
7
Modeling and simulating networks of interdependent protein interactions.相互依赖的蛋白质相互作用网络的建模与模拟
Integr Biol (Camb). 2018 May 21;10(5):290-305. doi: 10.1039/c8ib00012c.
8
biospear: an R package for biomarker selection in penalized Cox regression.biospear:用于惩罚 Cox 回归中生物标志物选择的 R 包。
Bioinformatics. 2018 Jan 1;34(1):112-113. doi: 10.1093/bioinformatics/btx560.
9
KEGGgraph: a graph approach to KEGG PATHWAY in R and bioconductor.KEGGgraph:R语言和生物导体中KEGG通路的图形化方法
Bioinformatics. 2009 Jun 1;25(11):1470-1. doi: 10.1093/bioinformatics/btp167. Epub 2009 Mar 23.
10
eNetXplorer: an R package for the quantitative exploration of elastic net families for generalized linear models.eNetXplorer:用于广义线性模型中弹性网络家族的定量探索的 R 包。
BMC Bioinformatics. 2019 Apr 16;20(1):189. doi: 10.1186/s12859-019-2778-5.

引用本文的文献

1
NetREm: Network Regression Embeddings reveal cell-type transcription factor coordination for gene regulation.NetREm:网络回归嵌入揭示细胞类型转录因子在基因调控中的协同作用。
Bioinform Adv. 2024 Dec 20;5(1):vbae206. doi: 10.1093/bioadv/vbae206. eCollection 2025.
2
SCEMENT: scalable and memory efficient integration of large-scale single-cell RNA-sequencing data.SCEMENT:大规模单细胞RNA测序数据的可扩展且内存高效的整合
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf057.
3
Coherent pathway enrichment estimation by modeling inter-pathway dependencies using regularized regression.

本文引用的文献

1
DegreeCox - a network-based regularization method for survival analysis.DegreeCox——一种用于生存分析的基于网络的正则化方法。
BMC Bioinformatics. 2016 Dec 13;17(Suppl 16):449. doi: 10.1186/s12859-016-1310-4.
2
Graph-regularized dual Lasso for robust eQTL mapping.基于图正则化的双拉索方法在稳健的 eQTL 映射中的应用。
Bioinformatics. 2014 Jun 15;30(12):i139-48. doi: 10.1093/bioinformatics/btu293.
3
Network-based penalized regression with application to genomic data.基于网络的惩罚回归及其在基因组数据中的应用。
使用正则化回归建模通路间的依赖关系进行连贯通路富集估计。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad522.
4
Transcriptome analysis of lncRNA expression patterns in human congenital lung malformations.人类先天性肺畸形中长链非编码 RNA 表达模式的转录组分析。
BMC Genomics. 2021 Nov 29;22(1):861. doi: 10.1186/s12864-021-08204-x.
5
Incorporating biological structure into machine learning models in biomedicine.将生物结构纳入生物医学中的机器学习模型。
Curr Opin Biotechnol. 2020 Jun;63:126-134. doi: 10.1016/j.copbio.2019.12.021. Epub 2020 Jan 18.
6
Prediction of response to anti-cancer drugs becomes robust via network integration of molecular data.通过分子数据的网络集成,对抗癌药物的反应预测变得更加可靠。
Sci Rep. 2019 Feb 20;9(1):2379. doi: 10.1038/s41598-019-39019-2.
Biometrics. 2013 Sep;69(3):582-93. doi: 10.1111/biom.12035. Epub 2013 Jul 3.
4
Network enrichment analysis: extension of gene-set enrichment analysis to gene networks.网络富集分析:基因集富集分析向基因网络的扩展。
BMC Bioinformatics. 2012 Sep 11;13:226. doi: 10.1186/1471-2105-13-226.
5
VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.具有基因组学应用的图结构协变量的变量选择与回归分析
Ann Appl Stat. 2010 Sep 1;4(3):1498-1516. doi: 10.1214/10-AOAS332.
6
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
7
WGCNA: an R package for weighted correlation network analysis.WGCNA:一个用于加权相关网络分析的R软件包。
BMC Bioinformatics. 2008 Dec 29;9:559. doi: 10.1186/1471-2105-9-559.
8
Network-constrained regularization and variable selection for analysis of genomic data.用于基因组数据分析的网络约束正则化和变量选择
Bioinformatics. 2008 May 1;24(9):1175-82. doi: 10.1093/bioinformatics/btn081. Epub 2008 Mar 1.
9
Genetic interactions between polymorphisms that affect gene expression in yeast.影响酵母基因表达的多态性之间的遗传相互作用。
Nature. 2005 Aug 4;436(7051):701-3. doi: 10.1038/nature03865.
10
Multiple locus linkage analysis of genomewide expression in yeast.酵母全基因组表达的多位点连锁分析
PLoS Biol. 2005 Aug;3(8):e267. doi: 10.1371/journal.pbio.0030267. Epub 2005 Jul 26.