• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有超前-滞后效应的时间序列的高斯过程及其在生物学数据中的应用

Gaussian processes for time series with lead-lag effects with applications to biology data.

作者信息

Mu Wancen, Chen Jiawen, Davis Eric S, Reed Kathleen, Phanstiel Douglas, Love Michael I, Li Didong

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States.

Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States.

出版信息

Biometrics. 2025 Jan 7;81(1). doi: 10.1093/biomtc/ujae156.

DOI:10.1093/biomtc/ujae156
PMID:39775854
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11704948/
Abstract

Investigating the relationship, particularly the lead-lag effect, between time series is a common question across various disciplines, especially when uncovering biological processes. However, analyzing time series presents several challenges. Firstly, due to technical reasons, the time points at which observations are made are not at uniform intervals. Secondly, some lead-lag effects are transient, necessitating time-lag estimation based on a limited number of time points. Thirdly, external factors also impact these time series, requiring a similarity metric to assess the lead-lag relationship. To counter these issues, we introduce a model grounded in the Gaussian process, affording the flexibility to estimate lead-lag effects for irregular time series. In addition, our method outputs dissimilarity scores, thereby broadening its applications to include tasks such as ranking or clustering multiple pairwise time series when considering their strength of lead-lag effects with external factors. Crucially, we offer a series of theoretical proofs to substantiate the validity of our proposed kernels and the identifiability of kernel parameters. Our model demonstrates advances in various simulations and real-world applications, particularly in the study of dynamic chromatin interactions, compared to other leading methods.

摘要

研究时间序列之间的关系,尤其是超前-滞后效应,是各个学科中常见的问题,在揭示生物过程时尤为如此。然而,分析时间序列存在若干挑战。首先,由于技术原因,进行观测的时间点并非等间隔的。其次,一些超前-滞后效应是短暂的,这就需要基于有限数量的时间点来估计时间滞后。第三,外部因素也会影响这些时间序列,需要一种相似性度量来评估超前-滞后关系。为应对这些问题,我们引入了一个基于高斯过程的模型,它能够灵活地估计不规则时间序列的超前-滞后效应。此外,我们的方法输出差异分数,从而拓宽了其应用范围,包括在考虑多个成对时间序列与外部因素的超前-滞后效应强度时进行排序或聚类等任务。至关重要的是,我们提供了一系列理论证明,以证实我们提出的核的有效性以及核参数的可识别性。与其他领先方法相比,我们的模型在各种模拟和实际应用中都有进展,特别是在动态染色质相互作用的研究中。

相似文献

1
Gaussian processes for time series with lead-lag effects with applications to biology data.具有超前-滞后效应的时间序列的高斯过程及其在生物学数据中的应用
Biometrics. 2025 Jan 7;81(1). doi: 10.1093/biomtc/ujae156.
2
Identifying temporal pathways using biomarkers in the presence of latent non-Gaussian components.利用存在潜在非高斯成分的生物标志物识别时间途径。
Biometrics. 2024 Mar 27;80(2). doi: 10.1093/biomtc/ujae033.
3
Bayesian inference for multivariate probit model with latent envelope.具有潜在包络的多元概率比例模型的贝叶斯推断。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae059.
4
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
5
Variable selection for clustering with Gaussian mixture models.用于高斯混合模型聚类的变量选择
Biometrics. 2009 Sep;65(3):701-9. doi: 10.1111/j.1541-0420.2008.01160.x. Epub 2009 Feb 4.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Large scale maximum average power multiple inference on time-course count data with application to RNA-seq analysis.大规模最大平均功率多重推断在时间序列计数数据中的应用及在 RNA-seq 分析中的应用。
Biometrics. 2020 Mar;76(1):9-22. doi: 10.1111/biom.13144. Epub 2019 Nov 14.
8
Gaussian process functional regression modeling for batch data.用于批量数据的高斯过程函数回归建模
Biometrics. 2007 Sep;63(3):714-23. doi: 10.1111/j.1541-0420.2007.00758.x.
9
Global identifiability of latent class models with applications to diagnostic test accuracy studies: A Gröbner basis approach.具有潜在类别模型全局可识别性的诊断测试准确性研究:一种 Grobner 基方法。
Biometrics. 2020 Mar;76(1):98-108. doi: 10.1111/biom.13133. Epub 2019 Nov 6.
10
Gaussian process based bayesian semiparametric quantitative trait Loci interval mapping.基于高斯过程的贝叶斯半参数数量性状基因座区间定位
Biometrics. 2010 Mar;66(1):222-32. doi: 10.1111/j.1541-0420.2009.01268.x. Epub 2009 May 12.

本文引用的文献

1
Inference for Gaussian Processes with Matérn Covariogram on Compact Riemannian Manifolds.紧致黎曼流形上具有马特恩协方差函数的高斯过程的推断
J Mach Learn Res. 2023 Mar;24.
2
matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling.matchRanges:通过协变量匹配抽样生成零假设基因组范围。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad197.
3
Temporal analysis suggests a reciprocal relationship between 3D chromatin structure and transcription.时间分析表明 3D 染色质结构和转录之间存在相互关系。
Cell Rep. 2022 Nov 1;41(5):111567. doi: 10.1016/j.celrep.2022.111567.
4
Evolution of disease transmission during the COVID-19 pandemic: patterns and determinants.新冠疫情期间疾病传播的演变:模式和决定因素。
Sci Rep. 2021 May 26;11(1):11029. doi: 10.1038/s41598-021-90347-8.
5
Causal network inference from gene transcriptional time-series response to glucocorticoids.从基因转录时间序列对糖皮质激素的响应中推断因果关系网络。
PLoS Comput Biol. 2021 Jan 29;17(1):e1008223. doi: 10.1371/journal.pcbi.1008223. eCollection 2021 Jan.
6
Plasma Hsp90 levels in patients with systemic sclerosis and relation to lung and skin involvement: a cross-sectional and longitudinal study.硬皮病患者血浆 Hsp90 水平与肺和皮肤受累的关系:一项横断面和纵向研究。
Sci Rep. 2021 Jan 7;11(1):1. doi: 10.1038/s41598-020-79139-8.
7
Expanded encyclopaedias of DNA elements in the human and mouse genomes.人类和小鼠基因组中 DNA 元件的扩展百科全书。
Nature. 2020 Jul;583(7818):699-710. doi: 10.1038/s41586-020-2493-4. Epub 2020 Jul 29.
8
Detecting and quantifying causal associations in large nonlinear time series datasets.检测和量化大型非线性时间序列数据集的因果关系。
Sci Adv. 2019 Nov 27;5(11):eaau4996. doi: 10.1126/sciadv.aau4996. eCollection 2019 Nov.
9
Activity-by-contact model of enhancer-promoter regulation from thousands of CRISPR perturbations.基于数千个 CRISPR 干扰的增强子-启动子调控的活性-接触模型。
Nat Genet. 2019 Dec;51(12):1664-1669. doi: 10.1038/s41588-019-0538-0. Epub 2019 Nov 29.
10
Dynamic genetic regulation of gene expression during cellular differentiation.细胞分化过程中基因表达的动态遗传调控。
Science. 2019 Jun 28;364(6447):1287-1290. doi: 10.1126/science.aaw0040.