• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测COVID-19传播演变的社交媒体大数据混合深度学习

Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission.

作者信息

Chew Alvin Wei Ze, Pan Yue, Wang Ying, Zhang Limao

机构信息

Bentley Systems Research Office, 1 Harbourfront Pl, HarbourFront Tower One, Singapore 098633, Singapore.

Shanghai Key Laboratory for Digital Maintenance of Buildings and Infrastructure, Department of Civil Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, China.

出版信息

Knowl Based Syst. 2021 Dec 5;233:107417. doi: 10.1016/j.knosys.2021.107417. Epub 2021 Aug 24.

DOI:10.1016/j.knosys.2021.107417
PMID:34690447
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8522122/
Abstract

In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community's aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN's development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN's selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time-series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model's testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally.

摘要

在本研究中,构建了一种名为ODANN的混合深度学习模型,该模型基于神经网络(NN),并结合了数据同化和自然语言处理(NLP)特征提取方法,用于同时处理每日新冠肺炎时间序列记录和大量与新冠肺炎相关的推特数据,这些数据代表了全球社区对当前疫情的综合情绪反应,通过一个提议的G参数对全球新冠肺炎确诊病例数的增长率进行建模。总体而言,ODANN的开发阶段有3个关键组成部分,即:(i)对2020年1月23日至2020年5月10日期间与新冠肺炎相关的推特数据进行数据充实和预处理,这些数据包括超过1亿条用英语撰写的推文;(ii)随后利用多种NLP特征提取方法,在一组最优超参数下,将充实后的推特数据编码为有用的语义词向量,用于训练ODANN;(iii)还将具有特定特征的历史时间序列数据同化到ODANN选定的隐藏层中,以提前1天每日对G参数进行建模。到目前为止,我们的实验结果表明,通过采用5天的滚动时间窗口大小,相对于用于同化不同数据特征的历史时间序列记录数量,ODANN在模型测试步骤中获得的计算均方根误差(RMSE)和平均绝对误差(MAE)分数方面优于其他传统时间序列模型和近期研究。总体而言,ODANN的总结结果显示了其在全球新冠肺炎病例数增长率建模和预测方面的竞争优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/54a4db5e79f3/gr18_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/34175d1c7089/fx1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/8901c5066b9f/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/bcf118096440/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/f9f38f8c5592/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/5bfc82ddfef6/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/d28109b14258/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/ae32c26eeef2/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/9119dea0c1f8/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/82bf7a630794/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/de58670c208d/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/19aaa9a7d322/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/64950bf3cf24/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/f9a04e23dc25/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/373acd31f5c2/gr13_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/df711a8ba3f1/gr14_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/d0009418f232/gr15_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/b948c4a53617/gr16_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/7caa65bc9394/gr17_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/54a4db5e79f3/gr18_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/34175d1c7089/fx1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/8901c5066b9f/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/bcf118096440/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/f9f38f8c5592/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/5bfc82ddfef6/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/d28109b14258/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/ae32c26eeef2/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/9119dea0c1f8/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/82bf7a630794/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/de58670c208d/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/19aaa9a7d322/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/64950bf3cf24/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/f9a04e23dc25/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/373acd31f5c2/gr13_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/df711a8ba3f1/gr14_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/d0009418f232/gr15_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/b948c4a53617/gr16_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/7caa65bc9394/gr17_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0679/8522122/54a4db5e79f3/gr18_lrg.jpg

相似文献

1
Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission.用于预测COVID-19传播演变的社交媒体大数据混合深度学习
Knowl Based Syst. 2021 Dec 5;233:107417. doi: 10.1016/j.knosys.2021.107417. Epub 2021 Aug 24.
2
Deep learning modeling of public's sentiments towards temporal evolution of COVID-19 transmission.公众对新冠病毒传播时间演变的情绪的深度学习建模
Appl Soft Comput. 2022 Dec;131:109728. doi: 10.1016/j.asoc.2022.109728. Epub 2022 Oct 20.
3
Deep Learning-Based Prediction of Daily COVID-19 Cases Using X (Twitter) Data.基于深度学习的 X(推特)数据预测每日 COVID-19 病例。
Stud Health Technol Inform. 2024 Aug 22;316:1989-1993. doi: 10.3233/SHTI240824.
4
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
5
Big Data, Natural Language Processing, and Deep Learning to Detect and Characterize Illicit COVID-19 Product Sales: Infoveillance Study on Twitter and Instagram.大数据、自然语言处理和深度学习技术在检测和识别非法销售 COVID-19 产品中的应用:对 Twitter 和 Instagram 的 Infoveillance 研究。
JMIR Public Health Surveill. 2020 Aug 25;6(3):e20794. doi: 10.2196/20794.
6
Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study.新冠疫情期间推特上抑郁症趋势的监测:观察性研究。
JMIR Infodemiology. 2021 Jul 18;1(1):e26769. doi: 10.2196/26769. eCollection 2021 Jan-Dec.
7
A COVID-19 Pandemic Artificial Intelligence-Based System With Deep Learning Forecasting and Automatic Statistical Data Acquisition: Development and Implementation Study.一种基于人工智能的新冠肺炎大流行深度学习预测与自动统计数据采集系统:开发与实施研究
J Med Internet Res. 2021 May 20;23(5):e27806. doi: 10.2196/27806.
8
Multiclass sentiment analysis on COVID-19-related tweets using deep learning models.使用深度学习模型对与COVID-19相关推文进行多类别情感分析。
Neural Comput Appl. 2022;34(22):19615-19627. doi: 10.1007/s00521-022-07650-2. Epub 2022 Aug 6.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Using Tweets to Understand How COVID-19-Related Health Beliefs Are Affected in the Age of Social Media: Twitter Data Analysis Study.利用推文了解社交媒体时代 COVID-19 相关健康信念如何受到影响:推特数据分析研究。
J Med Internet Res. 2021 Feb 22;23(2):e26302. doi: 10.2196/26302.

引用本文的文献

1
An explainable GeoAI approach for the multimodal analysis of urban human dynamics: a case study for the COVID-19 pandemic in Rio de Janeiro.一种用于城市人类动态多模态分析的可解释地理人工智能方法:以里约热内卢的新冠疫情为例
Comput Urban Sci. 2025;5(1):13. doi: 10.1007/s43762-025-00172-2. Epub 2025 Mar 3.
2
Discovering Time-Varying Public Interest for COVID-19 Case Prediction in South Korea Using Search Engine Queries: Infodemiology Study.利用搜索引擎查询发现韩国新冠肺炎病例预测中的时变公众兴趣:信息流行病学研究
J Med Internet Res. 2024 Dec 16;26:e63476. doi: 10.2196/63476.
3
MGLEP: Multimodal Graph Learning for Modeling Emerging Pandemics with Big Data.

本文引用的文献

1
Forecasting virus outbreaks with social media data via neural ordinary differential equations.利用社交媒体数据通过神经常微分方程预测病毒爆发。
Sci Rep. 2023 Jul 5;13(1):10870. doi: 10.1038/s41598-023-37118-9.
2
Discovering optimal strategies for mitigating COVID-19 spread using machine learning: Experience from Asia.利用机器学习发现缓解新冠病毒传播的最佳策略:来自亚洲的经验。
Sustain Cities Soc. 2021 Dec;75:103254. doi: 10.1016/j.scs.2021.103254. Epub 2021 Aug 13.
3
Correlating dynamic climate conditions and socioeconomic-governmental factors to spatiotemporal spread of COVID-19 via semantic segmentation deep learning analysis.
MGLEP:基于大数据的多模态图学习模型在新兴传染病中的应用。
Sci Rep. 2024 Jul 16;14(1):16377. doi: 10.1038/s41598-024-67146-y.
4
A Systematic Literature Review and Future Perspectives for Handling Big Data Analytics in COVID-19 Diagnosis.COVID-19诊断中处理大数据分析的系统文献综述及未来展望
New Gener Comput. 2023;41(2):243-280. doi: 10.1007/s00354-023-00211-8. Epub 2023 Mar 16.
5
Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain's case study.通过经典机器学习模型和集成学习模型预测 COVID-19 的传播:西班牙案例研究。
Sci Rep. 2023 Apr 25;13(1):6750. doi: 10.1038/s41598-023-33795-8.
6
COVID-19 case prediction using emotion trends Twitter emoji analysis: A case study in Japan.利用情绪趋势预测 COVID-19 病例:以日本为例的 Twitter 表情符号分析
Front Public Health. 2023 Mar 14;11:1079315. doi: 10.3389/fpubh.2023.1079315. eCollection 2023.
7
Relevance of E-Participation in the state health campaign in Spain: #EstoNoEsUnJuego / #ThisIsNotAGame.电子参与在西班牙国家健康运动中的相关性:#这不是一场游戏 / #ThisIsNotAGame 。
Technol Soc. 2022 Feb;68:101877. doi: 10.1016/j.techsoc.2022.101877. Epub 2022 Jan 11.
8
Deep learning modeling of public's sentiments towards temporal evolution of COVID-19 transmission.公众对新冠病毒传播时间演变的情绪的深度学习建模
Appl Soft Comput. 2022 Dec;131:109728. doi: 10.1016/j.asoc.2022.109728. Epub 2022 Oct 20.
9
Twitter conversations predict the daily confirmed COVID-19 cases.推特对话可预测每日新增新冠肺炎确诊病例。
Appl Soft Comput. 2022 Nov;129:109603. doi: 10.1016/j.asoc.2022.109603. Epub 2022 Sep 5.
10
Deep Neural Networks for Optimal Selection of Features Related to Flu.用于流感相关特征最优选择的深度神经网络
Evid Based Complement Alternat Med. 2022 Jul 14;2022:7639875. doi: 10.1155/2022/7639875. eCollection 2022.
通过语义分割深度学习分析,将动态气候条件和社会经济-政府因素与新冠病毒病的时空传播相关联。
Sustain Cities Soc. 2021 Dec;75:103231. doi: 10.1016/j.scs.2021.103231. Epub 2021 Aug 5.
4
Prediction of COVID-19 Waves Using Social Media and Google Search: A Case Study of the US and Canada.利用社交媒体和谷歌搜索预测新冠疫情浪潮:以美国和加拿大为例
Front Public Health. 2021 Apr 16;9:656635. doi: 10.3389/fpubh.2021.656635. eCollection 2021.
5
Transfer learning for establishment of recognition of COVID-19 on CT imaging using small-sized training datasets.使用小型训练数据集进行迁移学习以建立基于CT成像的新型冠状病毒肺炎识别模型
Knowl Based Syst. 2021 Apr 22;218:106849. doi: 10.1016/j.knosys.2021.106849. Epub 2021 Feb 6.
6
Public attention about COVID-19 on social media: An investigation based on data mining and text analysis.社交媒体上公众对新冠病毒的关注:基于数据挖掘和文本分析的调查
Pers Individ Dif. 2021 Jun;175:110701. doi: 10.1016/j.paid.2021.110701. Epub 2021 Jan 28.
7
What social media told us in the time of COVID-19: a scoping review.社交媒体在 COVID-19 大流行期间告诉了我们什么:范围综述。
Lancet Digit Health. 2021 Mar;3(3):e175-e194. doi: 10.1016/S2589-7500(20)30315-0. Epub 2021 Jan 28.
8
COVID-19: Forecasting confirmed cases and deaths with a simple time series model.新冠疫情:用简单时间序列模型预测确诊病例和死亡人数
Int J Forecast. 2022 Apr-Jun;38(2):439-452. doi: 10.1016/j.ijforecast.2020.11.010. Epub 2020 Dec 4.
9
The use of Twitter by state leaders and its impact on the public during the COVID-19 pandemic.国家领导人在新冠疫情期间使用推特及其对公众的影响。
Heliyon. 2020 Nov 19;6(11):e05540. doi: 10.1016/j.heliyon.2020.e05540. eCollection 2020 Nov.
10
Spatio-temporal estimation of the daily cases of COVID-19 in worldwide using random forest machine learning algorithm.使用随机森林机器学习算法对全球范围内新冠肺炎每日病例数进行时空估计。
Chaos Solitons Fractals. 2020 Nov;140:110210. doi: 10.1016/j.chaos.2020.110210. Epub 2020 Aug 20.