• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有长程分形相关性的时间序列分割

Segmentation of time series with long-range fractal correlations.

作者信息

Bernaola-Galván P, Oliver J L, Hackenberg M, Coronado A V, Ivanov P Ch, Carpena P

机构信息

Dpto. de Física Aplicada II, Universidad de Málaga, 29071 Málaga, Spain.

出版信息

Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.

DOI:10.1140/epjb/e2012-20969-5
PMID:23645997
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3643524/
Abstract

Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome.

摘要

分割是一种数据分析的标准方法,用于识别将非平稳时间序列划分为同质段的变化点。然而,对于长程分形相关序列,大多数分割技术会检测到虚假的变化点,这些变化点仅仅是由相关性引起的异质性导致的,而不是真正的非平稳性。为了避免这种过度分割,我们提出了一种分割算法,该算法以一个相关序列作为同质性的参考,而不是一个随机独立同分布序列,该相关序列由与待分割序列具有相同相关程度的分数噪声建模。我们将我们的算法应用于具有长程相关性的人工序列,并表明它系统地只检测到由真正的非平稳性产生的变化点,而不是由信号相关性产生的变化点。此外,我们将该方法应用于人类21号染色体长臂的序列,已知该序列具有长程分形相关性。我们只得到了三个段,它们清楚地对应于通过多尺度小波图揭示的不同G + C组成的三个区域。在分割所有人类染色体序列时也得到了类似的结果,这表明人类基因组中存在以前未知的巨大组成超结构。

相似文献

1
Segmentation of time series with long-range fractal correlations.具有长程分形相关性的时间序列分割
Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.
2
Effect of nonstationarities on detrended fluctuation analysis.非平稳性对去趋势波动分析的影响。
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):041107. doi: 10.1103/PhysRevE.65.041107. Epub 2002 Apr 8.
3
High-level organization of isochores into gigantic superstructures in the human genome.人类基因组中等位基因的高级组织形成巨大的超结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
4
Heuristic segmentation of a nonstationary time series.非平稳时间序列的启发式分割
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Feb;69(2 Pt 1):021108. doi: 10.1103/PhysRevE.69.021108. Epub 2004 Feb 25.
5
Discovering isochores by least-squares optimal segmentation.通过最小二乘最优分割发现等容线。
Gene. 2007 Jun 1;394(1-2):53-60. doi: 10.1016/j.gene.2007.01.028. Epub 2007 Feb 16.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Identifying compositionally homogeneous and nonhomogeneous domains within the human genome using a novel segmentation algorithm.利用一种新的分割算法鉴定人类基因组中组成均一和非均一的区域。
Nucleic Acids Res. 2010 Aug;38(15):e158. doi: 10.1093/nar/gkq532. Epub 2010 Jun 22.
8
Latent space unsupervised semantic segmentation.潜在空间无监督语义分割
Front Physiol. 2023 Apr 25;14:1151312. doi: 10.3389/fphys.2023.1151312. eCollection 2023.
9
Segmenting eukaryotic genomes with the Generalized Gibbs Sampler.使用广义吉布斯采样器对真核生物基因组进行分割。
J Comput Biol. 2006 Sep;13(7):1369-83. doi: 10.1089/cmb.2006.13.1369.
10
Evaluation of the dispersional analysis method for fractal time series.分形时间序列的离散分析方法评估
Ann Biomed Eng. 1995 Jul-Aug;23(4):491-505. doi: 10.1007/BF02584449.

引用本文的文献

1
Strong evidence for the evolution of decreasing compositional heterogeneity in SARS-CoV-2 genomes during the pandemic.有强有力的证据表明,在疫情期间,SARS-CoV-2基因组的组成异质性不断降低。
Sci Rep. 2025 Apr 10;15(1):12246. doi: 10.1038/s41598-025-95893-z.
2
Compositional Structure of the Genome: A Review.基因组的组成结构:综述
Biology (Basel). 2023 Jun 13;12(6):849. doi: 10.3390/biology12060849.
3
The Fractal Tapestry of Life: II Entailment of Fractional Oncology by Physiology Networks.生命的分形织锦:二、生理网络对分数肿瘤学的蕴含
Front Netw Physiol. 2022 Mar 24;2:845495. doi: 10.3389/fnetp.2022.845495. eCollection 2022.
4
On the Validity of Detrended Fluctuation Analysis at Short Scales.短尺度下去趋势波动分析的有效性
Entropy (Basel). 2021 Dec 29;24(1):61. doi: 10.3390/e24010061.
5
Driven progressive evolution of genome sequence complexity in Cyanobacteria.蓝细菌基因组序列复杂性的驱动进化。
Sci Rep. 2020 Nov 4;10(1):19073. doi: 10.1038/s41598-020-76014-4.
6
NGSmethDB 2017: enhanced methylomes and differential methylation.NGSmethDB 2017:增强的甲基化组与差异甲基化
Nucleic Acids Res. 2017 Jan 4;45(D1):D97-D103. doi: 10.1093/nar/gkw996. Epub 2016 Oct 27.
7
Magnitude and sign of long-range correlated time series: Decomposition and surrogate signal generation.长程相关时间序列的幅度和符号:分解和替代信号生成。
Phys Rev E. 2016 Apr;93:042201. doi: 10.1103/PhysRevE.93.042201. Epub 2016 Apr 4.

本文引用的文献

1
Effects of coarse-graining on the scaling behavior of long-range correlated and anti-correlated signals.粗粒化对长程相关和反相关信号标度行为的影响。
Physica A. 2011 Nov 1;390(23-24):4057-4072. doi: 10.1016/j.physa.2011.05.015.
2
High-level organization of isochores into gigantic superstructures in the human genome.人类基因组中等位基因的高级组织形成巨大的超结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.
3
Level statistics of words: finding keywords in literary texts and symbolic sequences.词汇的层级统计:在文学文本和符号序列中寻找关键词
Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Mar;79(3 Pt 2):035102. doi: 10.1103/PhysRevE.79.035102. Epub 2009 Mar 10.
4
Stratification pattern of static and scale-invariant dynamic measures of heartbeat fluctuations across sleep stages in young and elderly.年轻人和老年人在睡眠各阶段中心跳波动的静态和尺度不变动态测量的分层模式。
IEEE Trans Biomed Eng. 2009 May;56(5):1564-73. doi: 10.1109/TBME.2009.2014819. Epub 2009 Feb 6.
5
Scale-invariant aspects of cardiac dynamics. Observing sleep stages and circadian phases.心脏动力学的尺度不变特征。观察睡眠阶段和昼夜节律阶段。
IEEE Eng Med Biol Mag. 2007 Nov-Dec;26(6):33-7. doi: 10.1109/emb.2007.907093.
6
Comparing segmentations by applying randomization techniques.通过应用随机化技术比较分割结果。
BMC Bioinformatics. 2007 May 23;8:171. doi: 10.1186/1471-2105-8-171.
7
Identifying characteristic scales in the human genome.识别人类基因组中的特征尺度。
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Mar;75(3 Pt 1):032903. doi: 10.1103/PhysRevE.75.032903. Epub 2007 Mar 16.
8
Markov models of genome segmentation.基因组分割的马尔可夫模型。
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Jan;75(1 Pt 1):011915. doi: 10.1103/PhysRevE.75.011915. Epub 2007 Jan 17.
9
CpGcluster: a distance-based algorithm for CpG-island detection.CpG簇:一种基于距离的CpG岛检测算法。
BMC Bioinformatics. 2006 Oct 12;7:446. doi: 10.1186/1471-2105-7-446.
10
How not to search for isochores: a reply to Cohen et Al.如何避免寻找等染色体区域:对科恩等人的回应
Mol Biol Evol. 2005 Dec;22(12):2315-7. doi: 10.1093/molbev/msi231. Epub 2005 Aug 10.