Suppr超能文献

具有长程分形相关性的时间序列分割

Segmentation of time series with long-range fractal correlations.

作者信息

Bernaola-Galván P, Oliver J L, Hackenberg M, Coronado A V, Ivanov P Ch, Carpena P

机构信息

Dpto. de Física Aplicada II, Universidad de Málaga, 29071 Málaga, Spain.

出版信息

Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.

Abstract

Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome.

摘要

分割是一种数据分析的标准方法,用于识别将非平稳时间序列划分为同质段的变化点。然而,对于长程分形相关序列,大多数分割技术会检测到虚假的变化点,这些变化点仅仅是由相关性引起的异质性导致的,而不是真正的非平稳性。为了避免这种过度分割,我们提出了一种分割算法,该算法以一个相关序列作为同质性的参考,而不是一个随机独立同分布序列,该相关序列由与待分割序列具有相同相关程度的分数噪声建模。我们将我们的算法应用于具有长程相关性的人工序列,并表明它系统地只检测到由真正的非平稳性产生的变化点,而不是由信号相关性产生的变化点。此外,我们将该方法应用于人类21号染色体长臂的序列,已知该序列具有长程分形相关性。我们只得到了三个段,它们清楚地对应于通过多尺度小波图揭示的不同G + C组成的三个区域。在分割所有人类染色体序列时也得到了类似的结果,这表明人类基因组中存在以前未知的巨大组成超结构。

相似文献

2
Effect of nonstationarities on detrended fluctuation analysis.非平稳性对去趋势波动分析的影响。
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):041107. doi: 10.1103/PhysRevE.65.041107. Epub 2002 Apr 8.
3
High-level organization of isochores into gigantic superstructures in the human genome.人类基因组中等位基因的高级组织形成巨大的超结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
4
Heuristic segmentation of a nonstationary time series.非平稳时间序列的启发式分割
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Feb;69(2 Pt 1):021108. doi: 10.1103/PhysRevE.69.021108. Epub 2004 Feb 25.
5
Discovering isochores by least-squares optimal segmentation.通过最小二乘最优分割发现等容线。
Gene. 2007 Jun 1;394(1-2):53-60. doi: 10.1016/j.gene.2007.01.028. Epub 2007 Feb 16.
8
Latent space unsupervised semantic segmentation.潜在空间无监督语义分割
Front Physiol. 2023 Apr 25;14:1151312. doi: 10.3389/fphys.2023.1151312. eCollection 2023.

引用本文的文献

2
Compositional Structure of the Genome: A Review.基因组的组成结构:综述
Biology (Basel). 2023 Jun 13;12(6):849. doi: 10.3390/biology12060849.
6
NGSmethDB 2017: enhanced methylomes and differential methylation.NGSmethDB 2017:增强的甲基化组与差异甲基化
Nucleic Acids Res. 2017 Jan 4;45(D1):D97-D103. doi: 10.1093/nar/gkw996. Epub 2016 Oct 27.

本文引用的文献

2
High-level organization of isochores into gigantic superstructures in the human genome.人类基因组中等位基因的高级组织形成巨大的超结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
3
Level statistics of words: finding keywords in literary texts and symbolic sequences.词汇的层级统计:在文学文本和符号序列中寻找关键词
Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Mar;79(3 Pt 2):035102. doi: 10.1103/PhysRevE.79.035102. Epub 2009 Mar 10.
7
Identifying characteristic scales in the human genome.识别人类基因组中的特征尺度。
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Mar;75(3 Pt 1):032903. doi: 10.1103/PhysRevE.75.032903. Epub 2007 Mar 16.
8
Markov models of genome segmentation.基因组分割的马尔可夫模型。
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Jan;75(1 Pt 1):011915. doi: 10.1103/PhysRevE.75.011915. Epub 2007 Jan 17.
10
How not to search for isochores: a reply to Cohen et Al.如何避免寻找等染色体区域:对科恩等人的回应
Mol Biol Evol. 2005 Dec;22(12):2315-7. doi: 10.1093/molbev/msi231. Epub 2005 Aug 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验