• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于归一化最小二乘回归的快速准确基因调控网络推断。

Fast and accurate gene regulatory network inference by normalized least squares regression.

机构信息

Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, 17121 Solna, Sweden.

Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, 75185 Uppsala, Sweden.

出版信息

Bioinformatics. 2022 Apr 12;38(8):2263-2268. doi: 10.1093/bioinformatics/btac103.

DOI:10.1093/bioinformatics/btac103
PMID:35176145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9004640/
Abstract

MOTIVATION

Inferring an accurate gene regulatory network (GRN) has long been a key goal in the field of systems biology. To do this, it is important to find a suitable balance between the maximum number of true positive and the minimum number of false-positive interactions. Another key feature is that the inference method can handle the large size of modern experimental data, meaning the method needs to be both fast and accurate. The Least Squares Cut-Off (LSCO) method can fulfill both these criteria, however as it is based on least squares it is vulnerable to known issues of amplifying extreme values, small or large. In GRN this manifests itself with genes that are erroneously hyper-connected to a large fraction of all genes due to extremely low value fold changes.

RESULTS

We developed a GRN inference method called Least Squares Cut-Off with Normalization (LSCON) that tackles this problem. LSCON extends the LSCO algorithm by regularization to avoid hyper-connected genes and thereby reduce false positives. The regularization used is based on normalization, which removes effects of extreme values on the fit. We benchmarked LSCON and compared it to Genie3, LASSO, LSCO and Ridge regression, in terms of accuracy, speed and tendency to predict hyper-connected genes. The results show that LSCON achieves better or equal accuracy compared to LASSO, the best existing method, especially for data with extreme values. Thanks to the speed of least squares regression, LSCON does this an order of magnitude faster than LASSO.

AVAILABILITY AND IMPLEMENTATION

Data: https://bitbucket.org/sonnhammergrni/lscon; Code: https://bitbucket.org/sonnhammergrni/genespider.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

长期以来,推断准确的基因调控网络(GRN)一直是系统生物学领域的一个关键目标。为此,在最大数量的真阳性和最小数量的假阳性相互作用之间找到一个合适的平衡是很重要的。另一个关键特征是,推断方法可以处理现代实验数据的大规模,这意味着该方法需要既快速又准确。最小二乘截止(LSCO)方法可以满足这两个标准,但是由于它基于最小二乘法,因此容易受到放大极值、小或大的已知问题的影响。在 GRN 中,由于值变化极小,这些基因错误地与大量基因高度连接,这表现为基因错误地与大量基因高度连接。

结果

我们开发了一种名为 Least Squares Cut-Off with Normalization(LSCON)的 GRN 推断方法,该方法可以解决这个问题。LSCON 通过正则化扩展 LSCO 算法,避免基因过度连接,从而减少假阳性。所使用的正则化基于归一化,它消除了极值对拟合的影响。我们对 LSCON 进行了基准测试,并在准确性、速度和预测过度连接基因的倾向方面将其与 Genie3、LASSO、LSCO 和 Ridge 回归进行了比较。结果表明,与 LASSO(现有最好的方法)相比,LSCON 具有更好或相等的准确性,特别是对于具有极值的数据。由于最小二乘回归的速度,LSCON 比 LASSO 快一个数量级。

可用性和实现

数据:https://bitbucket.org/sonnhammergrni/lscon;代码:https://bitbucket.org/sonnhammergrni/genespider。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/de2c05512104/btac103f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/26a4efc0fd8b/btac103f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/13c034f6d18b/btac103f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/7e3bb9723bec/btac103f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/e6bc6338efee/btac103f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/de2c05512104/btac103f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/26a4efc0fd8b/btac103f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/13c034f6d18b/btac103f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/7e3bb9723bec/btac103f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/e6bc6338efee/btac103f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c4c/9004640/de2c05512104/btac103f5.jpg

相似文献

1
Fast and accurate gene regulatory network inference by normalized least squares regression.基于归一化最小二乘回归的快速准确基因调控网络推断。
Bioinformatics. 2022 Apr 12;38(8):2263-2268. doi: 10.1093/bioinformatics/btac103.
2
Topology-based metrics for finding the optimal sparsity in gene regulatory network inference.基于拓扑结构的指标在基因调控网络推断中寻找最优稀疏性
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf120.
3
A generalized framework for controlling FDR in gene regulatory network inference.一种用于控制基因调控网络推断中 FDR 的通用框架。
Bioinformatics. 2019 Mar 15;35(6):1026-1032. doi: 10.1093/bioinformatics/bty764.
4
Inferring the experimental design for accurate gene regulatory network inference.推断实验设计以进行准确的基因调控网络推断。
Bioinformatics. 2021 Oct 25;37(20):3553-3559. doi: 10.1093/bioinformatics/btab367.
5
PoLoBag: Polynomial Lasso Bagging for signed gene regulatory network inference from expression data.PoLoBag:基于表达数据的有符号基因调控网络推断的多项式套索装袋
Bioinformatics. 2021 Jan 29;36(21):5187-5193. doi: 10.1093/bioinformatics/btaa651.
6
Optimal Sparsity Selection Based on an Information Criterion for Accurate Gene Regulatory Network Inference.基于信息准则的最优稀疏性选择用于准确的基因调控网络推断
Front Genet. 2022 Jul 13;13:855770. doi: 10.3389/fgene.2022.855770. eCollection 2022.
7
Inferring gene targets of drugs and chemical compounds from gene expression profiles.从基因表达谱推断药物和化合物的基因靶点。
Bioinformatics. 2016 Jul 15;32(14):2120-7. doi: 10.1093/bioinformatics/btw148. Epub 2016 Mar 18.
8
SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference.SRGS:基于稀疏偏最小二乘的递归基因选择用于基因调控网络推断。
BMC Genomics. 2022 Nov 30;23(1):782. doi: 10.1186/s12864-022-09020-7.
9
Fast and accurate inference of gene regulatory networks through robust precision matrix estimation.通过稳健的精度矩阵估计实现基因调控网络的快速准确推断。
Bioinformatics. 2022 May 13;38(10):2802-2809. doi: 10.1093/bioinformatics/btac178.
10
Optimal design of gene knockout experiments for gene regulatory network inference.用于基因调控网络推断的基因敲除实验的优化设计。
Bioinformatics. 2016 Mar 15;32(6):875-83. doi: 10.1093/bioinformatics/btv672. Epub 2015 Nov 14.

引用本文的文献

1
BiGSM: Bayesian inference of gene regulatory network via sparse modelling.BiGSM:通过稀疏建模进行基因调控网络的贝叶斯推断
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf318.
2
Topology-based metrics for finding the optimal sparsity in gene regulatory network inference.基于拓扑结构的指标在基因调控网络推断中寻找最优稀疏性
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf120.
3
Gene regulatory network analysis identifies MYL1, MDH2, GLS, and TRIM28 as the principal proteins in the response of mesenchymal stem cells to Mg ions.
基因调控网络分析确定MYL1、MDH2、GLS和TRIM28为间充质干细胞对镁离子反应中的主要蛋白质。
Comput Struct Biotechnol J. 2024 Apr 14;23:1773-1785. doi: 10.1016/j.csbj.2024.04.033. eCollection 2024 Dec.
4
Topological benchmarking of algorithms to infer gene regulatory networks from single-cell RNA-seq data.从单细胞 RNA-seq 数据中推断基因调控网络的算法的拓扑基准测试。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae267.
5
Reversing pathological cell states: the road less travelled can extend the therapeutic horizon.逆转病理性细胞状态:少有人走的路可以拓宽治疗视野。
Trends Cell Biol. 2023 Nov;33(11):913-923. doi: 10.1016/j.tcb.2023.04.004. Epub 2023 May 30.