Choudhury Robin A, McRoberts Neil
School of Earth, Environmental, and Marine Sciences, University of Texas, Rio Grande Valley, Edinburg, TX 78541, USA.
Quantitative Biology and Epidemiology Group, Plant Pathology Department, University of California, Davis, Davis, CA 95616, USA.
Entropy (Basel). 2020 Nov 27;22(12):1343. doi: 10.3390/e22121343.
In a previous study, air sampling using vortex air samplers combined with species-specific amplification of pathogen DNA was carried out over two years in four or five locations in the Salinas Valley of California. The resulting time series data for the abundance of pathogen DNA trapped per day displayed complex dynamics with features of both deterministic (chaotic) and stochastic uncertainty. Methods of nonlinear time series analysis developed for the reconstruction of low dimensional attractors provided new insights into the complexity of pathogen abundance data. In particular, the analyses suggested that the length of time series data that it is practical or cost-effective to collect may limit the ability to definitively classify the uncertainty in the data. Over the two years of the study, five location/year combinations were classified as having stochastic linear dynamics and four were not. Calculation of entropy values for either the number of pathogen DNA copies or for a binary string indicating whether the pathogen abundance data were increasing revealed (1) some robust differences in the dynamics between seasons that were not obvious in the time series data themselves and (2) that the series were almost all at their theoretical maximum entropy value when considered from the simple perspective of whether instantaneous change along the sequence was positive.
在之前的一项研究中,在加利福尼亚州萨利纳斯山谷的四到五个地点,使用涡旋空气采样器进行空气采样,并结合病原体DNA的物种特异性扩增,持续了两年时间。由此得到的每天捕获的病原体DNA丰度的时间序列数据呈现出复杂的动态变化,具有确定性(混沌)和随机不确定性的特征。为重建低维吸引子而开发的非线性时间序列分析方法,为病原体丰度数据的复杂性提供了新的见解。特别是,分析表明,实际可行或具有成本效益的时间序列数据收集长度,可能会限制明确分类数据中不确定性的能力。在这项研究的两年时间里,五个地点/年份组合被归类为具有随机线性动态,四个则不是。对病原体DNA拷贝数或表示病原体丰度数据是否增加的二进制字符串计算熵值,揭示了(1)不同季节之间动态变化存在一些稳健的差异,这些差异在时间序列数据本身中并不明显;(2)从序列中瞬时变化是否为正的简单角度考虑,这些序列几乎都处于其理论最大熵值。