• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于时间序列预测的模型选择方法:将谷歌趋势数据纳入澳大利亚宏观指标

A Model Selection Approach for Time Series Forecasting: Incorporating Google Trends Data in Australian Macro Indicators.

作者信息

Karim Ali Abdul, Pardede Eric, Mann Scott

机构信息

Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia.

出版信息

Entropy (Basel). 2023 Jul 30;25(8):1144. doi: 10.3390/e25081144.

DOI:10.3390/e25081144
PMID:37628174
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10453655/
Abstract

This study examined whether the behaviour of Internet search users obtained from Google Trends contributes to the forecasting of two Australian macroeconomic indicators: monthly unemployment rate and monthly number of short-term visitors. We assessed the performance of traditional time series linear regression (SARIMA) against a widely used machine learning technique (support vector regression) and a deep learning technique (convolutional neural network) in forecasting both indicators across different data settings. Our study focused on the out-of-sample forecasting performance of the SARIMA, SVR, and CNN models and forecasting the two Australian indicators. We adopted a multi-step approach to compare the performance of the models built over different forecasting horizons and assessed the impact of incorporating Google Trends data in the modelling process. Our approach supports a data-driven framework, which reduces the number of features prior to selecting the best-performing model. The experiments showed that incorporating Internet search data in the forecasting models improved the forecasting accuracy and that the results were dependent on the forecasting horizon, as well as the technique. To the best of our knowledge, this study is the first to assess the usefulness of Google search data in the context of these two economic variables. An extensive comparison of the performance of traditional and machine learning techniques on different data settings was conducted to enable the selection of an efficient model, including the forecasting technique, horizon, and modelling features.

摘要

本研究考察了从谷歌趋势获取的互联网搜索用户行为是否有助于预测澳大利亚的两个宏观经济指标

月度失业率和月度短期访客数量。我们评估了传统时间序列线性回归(SARIMA)与一种广泛使用的机器学习技术(支持向量回归)和一种深度学习技术(卷积神经网络)在不同数据设置下预测这两个指标的性能。我们的研究聚焦于SARIMA、支持向量回归(SVR)和卷积神经网络(CNN)模型的样本外预测性能以及对澳大利亚这两个指标的预测。我们采用多步骤方法来比较在不同预测期构建的模型的性能,并评估在建模过程中纳入谷歌趋势数据的影响。我们的方法支持一个数据驱动的框架,该框架在选择表现最佳的模型之前减少了特征数量。实验表明,在预测模型中纳入互联网搜索数据提高了预测准确性,并且结果取决于预测期以及技术。据我们所知,本研究首次在这两个经济变量的背景下评估谷歌搜索数据的有用性。我们对传统技术和机器学习技术在不同数据设置下的性能进行了广泛比较,以便能够选择一个有效的模型,包括预测技术、预测期和建模特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/28d7cd96f17f/entropy-25-01144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/6067c4d282b8/entropy-25-01144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/2701318315b4/entropy-25-01144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/c28715389d86/entropy-25-01144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/28d7cd96f17f/entropy-25-01144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/6067c4d282b8/entropy-25-01144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/2701318315b4/entropy-25-01144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/c28715389d86/entropy-25-01144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25f5/10453655/28d7cd96f17f/entropy-25-01144-g004.jpg

相似文献

1
A Model Selection Approach for Time Series Forecasting: Incorporating Google Trends Data in Australian Macro Indicators.一种用于时间序列预测的模型选择方法:将谷歌趋势数据纳入澳大利亚宏观指标
Entropy (Basel). 2023 Jul 30;25(8):1144. doi: 10.3390/e25081144.
2
Time Series Forecasting of Univariate Agrometeorological Data: A Comparative Performance Evaluation via One-Step and Multi-Step Ahead Forecasting Strategies.单变量农业气象数据的时间序列预测:通过一步和多步超前预测策略的比较性能评估。
Sensors (Basel). 2021 Apr 1;21(7):2430. doi: 10.3390/s21072430.
3
Using Internet Search Trends and Historical Trading Data for Predicting Stock Markets by the Least Squares Support Vector Regression Model.利用最小二乘支持向量回归模型通过互联网搜索趋势和历史交易数据预测股票市场。
Comput Intell Neurosci. 2018 Jul 24;2018:6305246. doi: 10.1155/2018/6305246. eCollection 2018.
4
The Validity of Google Trends Search Volumes for Behavioral Forecasting of National Suicide Rates in Ireland.谷歌趋势搜索量用于预测爱尔兰全国自杀率的有效性。
Int J Environ Res Public Health. 2019 Sep 2;16(17):3201. doi: 10.3390/ijerph16173201.
5
Can Google Trends data improve forecasting of Lyme disease incidence?谷歌趋势数据能否改善莱姆病发病率的预测?
Zoonoses Public Health. 2019 Feb;66(1):101-107. doi: 10.1111/zph.12539. Epub 2018 Nov 16.
6
Comparative Analysis of Different Univariate Forecasting Methods in Modelling and Predicting the Romanian Unemployment Rate for the Period 2021-2022.2021 - 2022年罗马尼亚失业率建模与预测中不同单变量预测方法的比较分析
Entropy (Basel). 2021 Mar 9;23(3):325. doi: 10.3390/e23030325.
7
Forecasting daily emergency department arrivals using high-dimensional multivariate data: a feature selection approach.利用高维多元数据预测每日急诊科就诊人数:一种特征选择方法。
BMC Med Inform Decis Mak. 2022 May 17;22(1):134. doi: 10.1186/s12911-022-01878-7.
8
Bringing forecasting into the future: Using Google to predict visitation in U.S. national parks.将预测带入未来:利用谷歌预测美国国家公园的游客量。
J Environ Manage. 2019 Aug 1;243:88-94. doi: 10.1016/j.jenvman.2019.05.006. Epub 2019 May 10.
9
Performance evaluation of Emergency Department patient arrivals forecasting models by including meteorological and calendar information: A comparative study.纳入气象和日历信息的急诊科患者到达预测模型的性能评估:一项比较研究。
Comput Biol Med. 2021 Aug;135:104541. doi: 10.1016/j.compbiomed.2021.104541. Epub 2021 Jun 3.
10
Forecasting tuberculosis using diabetes-related google trends data.利用与糖尿病相关的谷歌趋势数据预测结核病
Pathog Glob Health. 2020 Jul;114(5):236-241. doi: 10.1080/20477724.2020.1767854. Epub 2020 May 26.

引用本文的文献

1
Spatiotemporal analysis and forecasting of public attention to China's five major religions.公众对中国五大宗教关注度的时空分析与预测
Sci Rep. 2025 Aug 8;15(1):29116. doi: 10.1038/s41598-025-15396-9.

本文引用的文献

1
Can multi-source heterogeneous data improve the forecasting performance of tourist arrivals amid COVID-19? Mixed-data sampling approach.多源异构数据能否提高新冠疫情期间游客到访量的预测性能?混合数据抽样方法。
Tour Manag. 2023 Oct;98:104759. doi: 10.1016/j.tourman.2023.104759. Epub 2023 Apr 3.
2
Nowcasting unemployment rates with Google searches: evidence from the Visegrad Group countries.利用谷歌搜索进行实时失业率预测:来自维谢格拉德集团国家的证据。
PLoS One. 2015 May 22;10(5):e0127084. doi: 10.1371/journal.pone.0127084. eCollection 2015.
3
A survey on filter techniques for feature selection in gene expression microarray analysis.
基因表达微阵列分析中特征选择的过滤技术调查。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1106-19. doi: 10.1109/TCBB.2012.33.