Computational Systems Biology, RPTU Kaiserslautern, 67663, Kaiserslautern, Germany.
Institute of Automatic Control, RPTU Kaiserslautern, 67663, Kaiserslautern, Germany.
BMC Bioinformatics. 2024 Jan 17;25(1):30. doi: 10.1186/s12859-024-05636-6.
Within the frame of their genetic capacity, organisms are able to modify their molecular state to cope with changing environmental conditions or induced genetic disposition. As high throughput methods are becoming increasingly affordable, time series analysis techniques are applied frequently to study the complex dynamic interplay between genes, proteins, and metabolites at the physiological and molecular level. Common analysis approaches fail to simultaneously include (i) information about the replicate variance and (ii) the limited number of responses/shapes that a biological system is typically able to take.
We present a novel approach to model and classify short time series signals, conceptually based on a classical time series analysis, where the dependency of the consecutive time points is exploited. Constrained spline regression with automated model selection separates between noise and signal under the assumption that highly frequent changes are less likely to occur, simultaneously preserving information about the detected variance. This enables a more precise representation of the measured information and improves temporal classification in order to identify biologically interpretable correlations among the data.
An open source F# implementation of the presented method and documentation of its usage is freely available in the TempClass repository, https://github.com/CSBiology/TempClass [58].
在其遗传能力的框架内,生物体能够改变其分子状态以应对不断变化的环境条件或诱导的遗传倾向。随着高通量方法变得越来越经济实惠,时间序列分析技术经常被应用于研究基因、蛋白质和代谢物在生理和分子水平上的复杂动态相互作用。常见的分析方法无法同时包括(i)关于重复方差的信息和(ii)生物系统通常能够采取的有限数量的响应/形状。
我们提出了一种新的方法来对短时间序列信号进行建模和分类,该方法在概念上基于经典的时间序列分析,利用了连续时间点之间的依赖性。受约束的样条回归与自动化模型选择相结合,在假设高频变化不太可能发生的情况下,将噪声和信号分开,同时保留有关检测到的方差的信息。这使得更精确地表示测量信息,并提高了时间分类,以识别数据之间具有生物学意义的相关性。
本文提出的方法的开源 F#实现以及其用法的文档可在 TempClass 存储库中免费获得,网址为 https://github.com/CSBiology/TempClass[58]。