Suppr超能文献

基于模拟基因表达动力学的半监督网络推断。

Semi-supervised network inference using simulated gene expression dynamics.

机构信息

Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, IL 60208, USA.

Biostatistics Division, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA.

出版信息

Bioinformatics. 2018 Apr 1;34(7):1148-1156. doi: 10.1093/bioinformatics/btx748.

Abstract

MOTIVATION

Inferring the structure of gene regulatory networks from high-throughput datasets remains an important and unsolved problem. Current methods are hampered by problems such as noise, low sample size, and incomplete characterizations of regulatory dynamics, leading to networks with missing and anomalous links. Integration of prior network information (e.g. from pathway databases) has the potential to improve reconstructions.

RESULTS

We developed a semi-supervised network reconstruction algorithm that enables the synthesis of information from partially known networks with time course gene expression data. We adapted partial least square-variable importance in projection (VIP) for time course data and used reference networks to simulate expression data from which null distributions of VIP scores are generated and used to estimate edge probabilities for input expression data. By using simulated dynamics to generate reference distributions, this approach incorporates previously known regulatory relationships and links the network to the dynamics to form a semi-supervised approach that discovers novel and anomalous connections. We applied this approach to data from a sleep deprivation study with KEGG pathways treated as prior networks, as well as to synthetic data from several DREAM challenges, and find that it is able to recover many of the true edges and identify errors in these networks, suggesting its ability to derive posterior networks that accurately reflect gene expression dynamics.

AVAILABILITY AND IMPLEMENTATION

R code is available at https://github.com/pn51/postPLSR.

CONTACT

rbraun@northwestern.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

从高通量数据集推断基因调控网络的结构仍然是一个重要且未解决的问题。当前的方法受到噪声、样本量小以及调控动态不完全描述等问题的阻碍,导致网络中存在缺失和异常的链接。整合先前的网络信息(例如来自途径数据库的信息)有可能改善重建。

结果

我们开发了一种半监督网络重建算法,能够将部分已知网络的信息与时间过程基因表达数据相结合。我们为时间过程数据改编了偏最小二乘变量重要性投影(VIP),并使用参考网络模拟表达数据,从这些数据中生成 VIP 得分的零分布,并用于估计输入表达数据的边缘概率。通过使用模拟动态生成参考分布,该方法整合了先前已知的调控关系,并将网络与动态联系起来,形成一种半监督方法,从而发现新的和异常的连接。我们将这种方法应用于一项睡眠剥夺研究的数据,其中 KEGG 途径被视为先验网络,以及来自几个 DREAM 挑战的合成数据,发现它能够恢复许多真实的边缘,并识别这些网络中的错误,这表明它能够得出准确反映基因表达动态的后验网络。

可用性和实现

R 代码可在 https://github.com/pn51/postPLSR 上获得。

联系方式

rbraun@northwestern.edu

补充信息

补充数据可在《生物信息学》在线获得。

相似文献

3
Time-lagged Ordered Lasso for network inference.时滞有序套索在网络推断中的应用。
BMC Bioinformatics. 2018 Dec 29;19(1):545. doi: 10.1186/s12859-018-2558-7.
6

引用本文的文献

1
Network-based approaches for modeling disease regulation and progression.基于网络的疾病调控与进展建模方法。
Comput Struct Biotechnol J. 2022 Dec 16;21:780-795. doi: 10.1016/j.csbj.2022.12.022. eCollection 2023.
3
..
J Biosci. 2022;47(2). doi: 10.1007/s12038-022-00253-y.
4
Time-lagged Ordered Lasso for network inference.时滞有序套索在网络推断中的应用。
BMC Bioinformatics. 2018 Dec 29;19(1):545. doi: 10.1186/s12859-018-2558-7.

本文引用的文献

2
DTW-MIC Coexpression Networks from Time-Course Data.来自时间进程数据的动态时间规整-互信息共表达网络
PLoS One. 2016 Mar 31;11(3):e0152648. doi: 10.1371/journal.pone.0152648. eCollection 2016.
3
Comparative analysis of gene regulatory networks: from network reconstruction to evolution.比较基因调控网络分析:从网络重构到进化。
Annu Rev Cell Dev Biol. 2015;31:399-428. doi: 10.1146/annurev-cellbio-100913-012908. Epub 2015 Sep 3.
4
Integrative random forest for gene regulatory network inference.用于基因调控网络推断的集成随机森林
Bioinformatics. 2015 Jun 15;31(12):i197-205. doi: 10.1093/bioinformatics/btv268.
6
Addressing false discoveries in network inference.解决网络推理中的错误发现问题。
Bioinformatics. 2015 Sep 1;31(17):2836-43. doi: 10.1093/bioinformatics/btv215. Epub 2015 Apr 24.
8
Data, information, knowledge and principle: back to metabolism in KEGG.数据、信息、知识和原理:回到 KEGG 的代谢途径中。
Nucleic Acids Res. 2014 Jan;42(Database issue):D199-205. doi: 10.1093/nar/gkt1076. Epub 2013 Nov 7.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验