• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对不规则采样时间序列进行格兰杰因果检验及其在拟南芥氮信号传导中的应用。

Granger-causal testing for irregularly sampled time series with application to nitrogen signalling in Arabidopsis.

作者信息

Heerah Sachin, Molinari Roberto, Guerrier Stéphane, Marshall-Colon Amy

机构信息

Department of Plant Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA.

Department of Mathematics and Statistics, Auburn University, Auburn, AL 36849, USA.

出版信息

Bioinformatics. 2021 Aug 25;37(16):2450-2460. doi: 10.1093/bioinformatics/btab126.

DOI:10.1093/bioinformatics/btab126
PMID:33693548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8388030/
Abstract

MOTIVATION

Identification of system-wide causal relationships can contribute to our understanding of long-distance, intercellular signalling in biological organisms. Dynamic transcriptome analysis holds great potential to uncover coordinated biological processes between organs. However, many existing dynamic transcriptome studies are characterized by sparse and often unevenly spaced time points that make the identification of causal relationships across organs analytically challenging. Application of existing statistical models, designed for regular time series with abundant time points, to sparse data may fail to reveal biologically significant, causal relationships. With increasing research interest in biological time series data, there is a need for new statistical methods that are able to determine causality within and between time series data sets. Here, a statistical framework was developed to identify (Granger) causal gene-gene relationships of unevenly spaced, multivariate time series data from two different tissues of Arabidopsis thaliana in response to a nitrogen signal.

RESULTS

This work delivers a statistical approach for modelling irregularly sampled bivariate signals which embeds functions from the domain of engineering that allow to adapt the model's dependence structure to the specific sampling time. Using maximum-likelihood to estimate the parameters of this model for each bivariate time series, it is then possible to use bootstrap procedures for small samples (or asymptotics for large samples) in order to test for Granger-Causality. When applied to the A.thaliana data, the proposed approach produced 3078 significant interactions, in which 2012 interactions have root causal genes and 1066 interactions have shoot causal genes. Many of the predicted causal and target genes are known players in local and long-distance nitrogen signalling, including genes encoding transcription factors, hormones and signalling peptides. Of the 1007 total causal genes (either organ), 384 are either known or predicted mobile transcripts, suggesting that the identified causal genes may be directly involved in long-distance nitrogen signalling through intercellular interactions. The model predictions and subsequent network analysis identified nitrogen-responsive genes that can be further tested for their specific roles in long-distance nitrogen signalling.

AVAILABILITY AND IMPLEMENTATION

The method was developed with the R statistical software and is made available through the R package 'irg' hosted on the GitHub repository https://github.com/SMAC-Group/irg where also a running example vignette can be found (https://smac-group.github.io/irg/articles/vignette.html). A few signals from the original data set are made available in the package as an example to apply the method and the complete A.thaliana data can be found at: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE97500.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/795e7b71f0b7/btab126f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/2e3ab97502b5/btab126f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/5a7fb3f77bb4/btab126f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/795e7b71f0b7/btab126f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/2e3ab97502b5/btab126f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/5a7fb3f77bb4/btab126f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5401/8388030/795e7b71f0b7/btab126f3.jpg
摘要

动机

识别全系统的因果关系有助于我们理解生物体内的长距离细胞间信号传导。动态转录组分析在揭示器官间协调的生物学过程方面具有巨大潜力。然而,许多现有的动态转录组研究的特点是时间点稀疏且间隔往往不均匀,这使得跨器官识别因果关系在分析上具有挑战性。将为具有丰富时间点的规则时间序列设计的现有统计模型应用于稀疏数据,可能无法揭示生物学上显著的因果关系。随着对生物时间序列数据的研究兴趣日益增加,需要新的统计方法来确定时间序列数据集内部和之间的因果关系。在此,开发了一个统计框架,以识别拟南芥两种不同组织响应氮信号的不均匀间隔多变量时间序列数据中的(格兰杰)因果基因-基因关系。

结果

这项工作提供了一种对不规则采样的双变量信号进行建模的统计方法,该方法嵌入了工程领域的函数,允许使模型的依赖结构适应特定的采样时间。通过最大似然估计每个双变量时间序列的该模型参数,然后可以使用小样本的自助程序(或大样本的渐近方法)来检验格兰杰因果关系。当应用于拟南芥数据时,所提出的方法产生了3078个显著的相互作用,其中2012个相互作用有根因果基因,1066个相互作用有地上部因果基因。许多预测的因果基因和靶基因是局部和长距离氮信号传导中的已知参与者,包括编码转录因子、激素和信号肽的基因。在总共1007个因果基因(任一器官)中,384个是已知或预测的移动转录本,这表明所识别的因果基因可能通过细胞间相互作用直接参与长距离氮信号传导。模型预测和随后的网络分析确定了可进一步测试其在长距离氮信号传导中特定作用的氮响应基因。

可用性和实现

该方法是用R统计软件开发的,可通过托管在GitHub仓库https://github.com/SMAC-Group/irg上 的R包“irg”获得,在该仓库中还可以找到一个运行示例小插图(https://smac-group.github.io/irg/articles/vignette.html)。该包中提供了原始数据集中的一些信号作为应用该方法的示例,完整的拟南芥数据可在以下网址找到:https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE97500。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Granger-causal testing for irregularly sampled time series with application to nitrogen signalling in Arabidopsis.对不规则采样时间序列进行格兰杰因果检验及其在拟南芥氮信号传导中的应用。
Bioinformatics. 2021 Aug 25;37(16):2450-2460. doi: 10.1093/bioinformatics/btab126.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Causal network inference from gene transcriptional time-series response to glucocorticoids.从基因转录时间序列对糖皮质激素的响应中推断因果关系网络。
PLoS Comput Biol. 2021 Jan 29;17(1):e1008223. doi: 10.1371/journal.pcbi.1008223. eCollection 2021 Jan.
4
teff: estimation of Treatment EFFects on transcriptomic data using causal random forest.teff:使用因果随机森林估计转录组数据的治疗效果。
Bioinformatics. 2022 May 26;38(11):3124-3125. doi: 10.1093/bioinformatics/btac269.
5
BELMM: Bayesian model selection and random walk smoothing in time-series clustering.BELMM:时间序列聚类中的贝叶斯模型选择和随机游走平滑。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad686.
6
Using multivariate cross correlations, Granger causality and graphical models to quantify spatiotemporal synchronization and causality between pest populations.运用多元交叉相关性、格兰杰因果关系和图形模型来量化害虫种群之间的时空同步性和因果关系。
BMC Ecol. 2016 Aug 5;16:33. doi: 10.1186/s12898-016-0087-7.
7
Detectability of Granger causality for subsampled continuous-time neurophysiological processes.对下采样连续时间神经生理过程的格兰杰因果关系的可检测性。
J Neurosci Methods. 2017 Jan 1;275:93-121. doi: 10.1016/j.jneumeth.2016.10.016. Epub 2016 Nov 5.
8
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
9
Granger causality for compressively sensed sparse signals.压缩感知稀疏信号的格兰杰因果关系。
Phys Rev E. 2023 Mar;107(3-1):034308. doi: 10.1103/PhysRevE.107.034308.
10
CausalTAB: the PSI-MITAB 2.8 updated format for signalling data representation and dissemination.因果 TAB:PSI-MITAB 2.8 更新的信号数据表示和传播格式。
Bioinformatics. 2019 Oct 1;35(19):3779-3785. doi: 10.1093/bioinformatics/btz132.

引用本文的文献

1
Model-to-crop conserved NUE Regulons enhance machine learning predictions of nitrogen use efficiency.模型到作物保守的氮利用效率调控子增强了机器学习对氮利用效率的预测。
Plant Cell. 2025 May 9;37(5). doi: 10.1093/plcell/koaf093.
2
CAT Bridge: an efficient toolkit for gene-metabolite association mining from multiomics data.CAT Bridge:一种从多组学数据中挖掘基因-代谢物关联的高效工具包。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae083.
3
Bipartite networks represent causality better than simple networks: evidence, algorithms, and applications.
二分网络比简单网络更能体现因果关系:证据、算法及应用。
Front Genet. 2024 May 9;15:1371607. doi: 10.3389/fgene.2024.1371607. eCollection 2024.
4
Nitrogen sensing and regulatory networks: it's about time and space.氮感应和调控网络:论时间与空间。
Plant Cell. 2024 May 1;36(5):1482-1503. doi: 10.1093/plcell/koae038.
5
Histidine kinase inhibitors impair shoot regeneration in cytokinin signaling and SAM patterning determinants.组氨酸激酶抑制剂会损害细胞分裂素信号传导和茎尖分生组织模式决定因素中的芽再生。
Front Plant Sci. 2022 Sep 8;13:894208. doi: 10.3389/fpls.2022.894208. eCollection 2022.
6
A review of causal discovery methods for molecular network analysis.分子网络分析的因果发现方法综述。
Mol Genet Genomic Med. 2022 Oct;10(10):e2055. doi: 10.1002/mgg3.2055. Epub 2022 Sep 10.
7
Network inference with Granger causality ensembles on single-cell transcriptomics.基于单细胞转录组学的格兰杰因果关系集成的网络推断。
Cell Rep. 2022 Feb 8;38(6):110333. doi: 10.1016/j.celrep.2022.110333.
8
Time-Based Systems Biology Approaches to Capture and Model Dynamic Gene Regulatory Networks.基于时间的系统生物学方法捕获和建模动态基因调控网络。
Annu Rev Plant Biol. 2021 Jun 17;72:105-131. doi: 10.1146/annurev-arplant-081320-090914. Epub 2021 Mar 5.