• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解决网络推理中的错误发现问题。

Addressing false discoveries in network inference.

作者信息

Petri Tobias, Altmann Stefan, Geistlinger Ludwig, Zimmer Ralf, Küffner Robert

机构信息

Ludwig-Maximilians-Universität München, Institut für Informatik, Munich, Germany and.

Ludwig-Maximilians-Universität München, Institut für Informatik, Munich, Germany and Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.

出版信息

Bioinformatics. 2015 Sep 1;31(17):2836-43. doi: 10.1093/bioinformatics/btv215. Epub 2015 Apr 24.

DOI:10.1093/bioinformatics/btv215
PMID:25910697
Abstract

MOTIVATION

Experimentally determined gene regulatory networks can be enriched by computational inference from high-throughput expression profiles. However, the prediction of regulatory interactions is severely impaired by indirect and spurious effects, particularly for eukaryotes. Recently, published methods report improved predictions by exploiting the a priori known targets of a regulator (its local topology) in addition to expression profiles.

RESULTS

We find that methods exploiting known targets show an unexpectedly high rate of false discoveries. This leads to inflated performance estimates and the prediction of an excessive number of new interactions for regulators with many known targets. These issues are hidden from common evaluation and cross-validation setups, which is due to Simpson's paradox. We suggest a confidence score recalibration method (CoRe) that reduces the false discovery rate and enables a reliable performance estimation.

CONCLUSIONS

CoRe considerably improves the results of network inference methods that exploit known targets. Predictions then display the biological process specificity of regulators more correctly and enable the inference of accurate genome-wide regulatory networks in eukaryotes. For yeast, we propose a network with more than 22 000 confident interactions. We point out that machine learning approaches outside of the area of network inference may be affected as well.

AVAILABILITY AND IMPLEMENTATION

Results, executable code and networks are available via our website http://www.bio.ifi.lmu.de/forschung/CoRe.

CONTACT

robert.kueffner@helmholtz-muenchen.de

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

通过高通量表达谱的计算推断可以丰富实验确定的基因调控网络。然而,调控相互作用的预测受到间接和虚假效应的严重影响,尤其是对于真核生物。最近,已发表的方法报告称,除了表达谱之外,通过利用调节因子的先验已知靶点(其局部拓扑结构),预测得到了改进。

结果

我们发现,利用已知靶点的方法显示出意外高的错误发现率。这导致性能估计虚高,并为具有许多已知靶点的调节因子预测了过多的新相互作用。由于辛普森悖论,这些问题在常见的评估和交叉验证设置中被隐藏了起来。我们提出了一种置信度评分重新校准方法(CoRe),该方法可降低错误发现率,并实现可靠的性能估计。

结论

CoRe显著改善了利用已知靶点的网络推断方法的结果。这样的预测能够更准确地展示调节因子的生物学过程特异性,并能够推断出真核生物中准确的全基因组调控网络。对于酵母,我们提出了一个具有超过22000个可信相互作用的网络。我们指出,网络推断领域之外的机器学习方法也可能受到影响。

可用性和实现方式

结果、可执行代码和网络可通过我们的网站http://www.bio.ifi.lmu.de/forschung/CoRe获得。

联系方式

robert.kueffner@helmholtz-muenchen.de

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Addressing false discoveries in network inference.解决网络推理中的错误发现问题。
Bioinformatics. 2015 Sep 1;31(17):2836-43. doi: 10.1093/bioinformatics/btv215. Epub 2015 Apr 24.
2
Biological Network Inference and analysis using SEBINI and CABIN.使用SEBINI和CABIN进行生物网络推断与分析。
Methods Mol Biol. 2009;541:551-76. doi: 10.1007/978-1-59745-243-4_24.
3
Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic Bayesian network analysis.大规模动态基因调控网络推断:结合微分方程模型与局部动态贝叶斯网络分析。
Bioinformatics. 2011 Oct 1;27(19):2686-91. doi: 10.1093/bioinformatics/btr454. Epub 2011 Aug 4.
4
RedNemo: topology-based PPI network reconstruction via repeated diffusion with neighborhood modifications.RedNemo:通过带邻域修改的重复扩散进行基于拓扑的蛋白质-蛋白质相互作用网络重建。
Bioinformatics. 2017 Feb 15;33(4):537-544. doi: 10.1093/bioinformatics/btw655.
5
Advances to Bayesian network inference for generating causal networks from observational biological data.贝叶斯网络推理在从观测生物数据生成因果网络方面的进展。
Bioinformatics. 2004 Dec 12;20(18):3594-603. doi: 10.1093/bioinformatics/bth448. Epub 2004 Jul 29.
6
NAIL, a software toolset for inferring, analyzing and visualizing regulatory networks.NAIL,一种用于推断、分析和可视化调控网络的软件工具集。
Bioinformatics. 2015 Jan 15;31(2):277-8. doi: 10.1093/bioinformatics/btu612. Epub 2014 Sep 21.
7
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.
8
A novel procedure for statistical inference and verification of gene regulatory subnetwork.一种用于基因调控子网统计推断和验证的新方法。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S7. doi: 10.1186/1471-2105-16-S7-S7. Epub 2015 Apr 23.
9
Quantitative characterization of the transcriptional regulatory network in the yeast cell cycle.酵母细胞周期中转录调控网络的定量表征
Bioinformatics. 2004 Aug 12;20(12):1914-27. doi: 10.1093/bioinformatics/bth178. Epub 2004 Mar 25.
10
CompareSVM: supervised, Support Vector Machine (SVM) inference of gene regularity networks.CompareSVM:监督式支持向量机(SVM)对基因调控网络的推断
BMC Bioinformatics. 2014 Nov 30;15(1):395. doi: 10.1186/s12859-014-0395-x.

引用本文的文献

1
Leveraging dynamic stability to infer regulation in protein-protein interaction networks: A study of infectious vulnerability in COPD.利用动态稳定性推断蛋白质-蛋白质相互作用网络中的调控:慢性阻塞性肺疾病感染易感性研究
PLoS One. 2025 Sep 5;20(9):e0326062. doi: 10.1371/journal.pone.0326062. eCollection 2025.
2
The probability of edge existence due to node degree: a baseline for network-based predictions.节点度导致边缘存在的概率:基于网络的预测的基线。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae001.
3
The probability of edge existence due to node degree: a baseline for network-based predictions.
基于节点度的边存在概率:基于网络预测的基线
bioRxiv. 2023 Jan 6:2023.01.05.522939. doi: 10.1101/2023.01.05.522939.
4
Experimental guidance for discovering genetic networks through hypothesis reduction on time series.通过时间序列假说缩减发现遗传网络的实验指导
PLoS Comput Biol. 2022 Oct 10;18(10):e1010145. doi: 10.1371/journal.pcbi.1010145. eCollection 2022 Oct.
5
A systematic evaluation of the computational tools for ligand-receptor-based cell-cell interaction inference.基于配体-受体的细胞-细胞相互作用推断的计算工具的系统评价。
Brief Funct Genomics. 2022 Sep 16;21(5):339-356. doi: 10.1093/bfgp/elac019.
6
Weighted Gene Correlation Network Meta-Analysis Reveals Functional Candidate Genes Associated with High- and Sub-Fertile Reproductive Performance in Beef Cattle.加权基因相关网络荟萃分析揭示了与肉牛高繁殖力和低繁殖力相关的功能候选基因。
Genes (Basel). 2020 May 12;11(5):543. doi: 10.3390/genes11050543.
7
Embracing Systems Toxicology at Single-Cell Resolution.以单细胞分辨率拥抱系统毒理学。
Curr Opin Toxicol. 2019 Aug;16:49-57. doi: 10.1016/j.cotox.2019.04.003. Epub 2019 Apr 19.
8
Combining multi-OMICs information to identify key-regulator genes for pleiotropic effect on fertility and production traits in beef cattle.结合多组学信息鉴定对肉牛多产性状和繁殖性能具有多效影响的关键调控基因。
PLoS One. 2018 Oct 18;13(10):e0205295. doi: 10.1371/journal.pone.0205295. eCollection 2018.
9
Semi-supervised network inference using simulated gene expression dynamics.基于模拟基因表达动力学的半监督网络推断。
Bioinformatics. 2018 Apr 1;34(7):1148-1156. doi: 10.1093/bioinformatics/btx748.
10
A prior-based integrative framework for functional transcriptional regulatory network inference.基于先验的功能转录调控网络推断综合框架。
Nucleic Acids Res. 2017 Feb 28;45(4):e21. doi: 10.1093/nar/gkw963.