Xu Chao, Xu Chen, Tian Wenjing, Hu Anqing, Jiang Rui
School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430200, China.
Accounting College, Wuhan Textile University, Wuhan 430200, China.
Entropy (Basel). 2019 Feb 27;21(3):229. doi: 10.3390/e21030229.
In this study, the Wikipedia page views for four selected topics, namely, education, the economy/finance, medicine, and nature/environment from 2016-2018 are collected and the sample entropies of the three years' page views are estimated and investigated using a short-time series multiscale entropy (sMSE) algorithm for a comprehensible understanding of the complexity of human website searching activities. The sample entropies of the selected topics are found to exhibit different temporal variations. In the past three years, the temporal characteristics of the sample entropies are vividly revealed, and the sample entropies of the selected topics follow the same tendencies and can be quantitatively ranked. By taking the 95% confidence interval into account, the temporal variations of sample entropies are further validated by statistical analysis (non-parametric), including the Wilcoxon signed-rank test and the Mann-Whitney -test. The results suggest that the sample entropies estimated by the sMSE algorithm are feasible for analyzing the temporal variations of complexity for certain topics, whereas the regular variations of estimated sample entropies of different selected topics can't simply be accepted as is. Potential explanations and paths in forthcoming studies are also described and discussed.
在本研究中,收集了2016 - 2018年四个选定主题(即教育、经济/金融、医学和自然/环境)的维基百科页面浏览量,并使用短时序列多尺度熵(sMSE)算法估计和研究了这三年页面浏览量的样本熵,以便全面了解人类网站搜索活动的复杂性。研究发现选定主题的样本熵呈现出不同的时间变化。在过去三年中,样本熵的时间特征得到了生动展现,选定主题的样本熵遵循相同趋势且可进行定量排序。考虑到95%置信区间,通过包括威尔科克森符号秩检验和曼 - 惠特尼检验在内的统计分析(非参数)进一步验证了样本熵的时间变化。结果表明,sMSE算法估计的样本熵对于分析某些主题复杂性的时间变化是可行的,而不同选定主题估计样本熵的值的规律变化不能简单地直接接受。还描述并讨论了后续研究中的潜在解释和路径。