Suppr超能文献

具有长程分形相关性的时间序列分割

Segmentation of time series with long-range fractal correlations.

作者信息

Bernaola-Galván P, Oliver J L, Hackenberg M, Coronado A V, Ivanov P Ch, Carpena P

机构信息

Dpto. de Física Aplicada II, Universidad de Málaga, 29071 Málaga, Spain.

出版信息

Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.

Abstract

Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome.

摘要

分割是一种数据分析的标准方法,用于识别将非平稳时间序列划分为同质段的变化点。然而,对于长程分形相关序列,大多数分割技术会检测到虚假的变化点,这些变化点仅仅是由相关性引起的异质性导致的,而不是真正的非平稳性。为了避免这种过度分割,我们提出了一种分割算法,该算法以一个相关序列作为同质性的参考,而不是一个随机独立同分布序列,该相关序列由与待分割序列具有相同相关程度的分数噪声建模。我们将我们的算法应用于具有长程相关性的人工序列,并表明它系统地只检测到由真正的非平稳性产生的变化点,而不是由信号相关性产生的变化点。此外,我们将该方法应用于人类21号染色体长臂的序列,已知该序列具有长程分形相关性。我们只得到了三个段,它们清楚地对应于通过多尺度小波图揭示的不同G + C组成的三个区域。在分割所有人类染色体序列时也得到了类似的结果,这表明人类基因组中存在以前未知的巨大组成超结构。

相似文献

1
Segmentation of time series with long-range fractal correlations.
Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.
2
Effect of nonstationarities on detrended fluctuation analysis.
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):041107. doi: 10.1103/PhysRevE.65.041107. Epub 2002 Apr 8.
3
High-level organization of isochores into gigantic superstructures in the human genome.
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
4
Heuristic segmentation of a nonstationary time series.
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Feb;69(2 Pt 1):021108. doi: 10.1103/PhysRevE.69.021108. Epub 2004 Feb 25.
5
Discovering isochores by least-squares optimal segmentation.
Gene. 2007 Jun 1;394(1-2):53-60. doi: 10.1016/j.gene.2007.01.028. Epub 2007 Feb 16.
8
Latent space unsupervised semantic segmentation.
Front Physiol. 2023 Apr 25;14:1151312. doi: 10.3389/fphys.2023.1151312. eCollection 2023.
9
Segmenting eukaryotic genomes with the Generalized Gibbs Sampler.
J Comput Biol. 2006 Sep;13(7):1369-83. doi: 10.1089/cmb.2006.13.1369.
10
Evaluation of the dispersional analysis method for fractal time series.
Ann Biomed Eng. 1995 Jul-Aug;23(4):491-505. doi: 10.1007/BF02584449.

引用本文的文献

2
Compositional Structure of the Genome: A Review.
Biology (Basel). 2023 Jun 13;12(6):849. doi: 10.3390/biology12060849.
3
The Fractal Tapestry of Life: II Entailment of Fractional Oncology by Physiology Networks.
Front Netw Physiol. 2022 Mar 24;2:845495. doi: 10.3389/fnetp.2022.845495. eCollection 2022.
4
On the Validity of Detrended Fluctuation Analysis at Short Scales.
Entropy (Basel). 2021 Dec 29;24(1):61. doi: 10.3390/e24010061.
5
Driven progressive evolution of genome sequence complexity in Cyanobacteria.
Sci Rep. 2020 Nov 4;10(1):19073. doi: 10.1038/s41598-020-76014-4.
6
NGSmethDB 2017: enhanced methylomes and differential methylation.
Nucleic Acids Res. 2017 Jan 4;45(D1):D97-D103. doi: 10.1093/nar/gkw996. Epub 2016 Oct 27.
7
Magnitude and sign of long-range correlated time series: Decomposition and surrogate signal generation.
Phys Rev E. 2016 Apr;93:042201. doi: 10.1103/PhysRevE.93.042201. Epub 2016 Apr 4.

本文引用的文献

1
Effects of coarse-graining on the scaling behavior of long-range correlated and anti-correlated signals.
Physica A. 2011 Nov 1;390(23-24):4057-4072. doi: 10.1016/j.physa.2011.05.015.
2
High-level organization of isochores into gigantic superstructures in the human genome.
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
3
Level statistics of words: finding keywords in literary texts and symbolic sequences.
Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Mar;79(3 Pt 2):035102. doi: 10.1103/PhysRevE.79.035102. Epub 2009 Mar 10.
4
Stratification pattern of static and scale-invariant dynamic measures of heartbeat fluctuations across sleep stages in young and elderly.
IEEE Trans Biomed Eng. 2009 May;56(5):1564-73. doi: 10.1109/TBME.2009.2014819. Epub 2009 Feb 6.
5
Scale-invariant aspects of cardiac dynamics. Observing sleep stages and circadian phases.
IEEE Eng Med Biol Mag. 2007 Nov-Dec;26(6):33-7. doi: 10.1109/emb.2007.907093.
6
Comparing segmentations by applying randomization techniques.
BMC Bioinformatics. 2007 May 23;8:171. doi: 10.1186/1471-2105-8-171.
7
Identifying characteristic scales in the human genome.
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Mar;75(3 Pt 1):032903. doi: 10.1103/PhysRevE.75.032903. Epub 2007 Mar 16.
8
Markov models of genome segmentation.
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Jan;75(1 Pt 1):011915. doi: 10.1103/PhysRevE.75.011915. Epub 2007 Jan 17.
9
CpGcluster: a distance-based algorithm for CpG-island detection.
BMC Bioinformatics. 2006 Oct 12;7:446. doi: 10.1186/1471-2105-7-446.
10
How not to search for isochores: a reply to Cohen et Al.
Mol Biol Evol. 2005 Dec;22(12):2315-7. doi: 10.1093/molbev/msi231. Epub 2005 Aug 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验