• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挖掘谷歌趋势数据以进行结直肠癌(CRC)患病率的即时预报和预测。

Mining Google Trends data for nowcasting and forecasting colorectal cancer (CRC) prevalence.

作者信息

Tudor Cristiana, Sova Robert Aurelian

机构信息

Bucharest University of Economic Studies, Bucharest, Romania.

出版信息

PeerJ Comput Sci. 2023 Oct 4;9:e1518. doi: 10.7717/peerj-cs.1518. eCollection 2023.

DOI:10.7717/peerj-cs.1518
PMID:37869464
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10588692/
Abstract

BACKGROUND

Colorectal cancer (CRC) is the third most prevalent and second most lethal form of cancer in the world. Consequently, CRC cancer prevalence projections are essential for assessing the future burden of the disease, planning resource allocation, and developing service delivery strategies, as well as for grasping the shifting environment of cancer risk factors. However, unlike cancer incidence and mortality rates, national and international agencies do not routinely issue projections for cancer prevalence. Moreover, the limited or even nonexistent cancer statistics for large portions of the world, along with the high heterogeneity among world nations, further complicate the task of producing timely and accurate CRC prevalence projections. In this situation, population interest, as shown by Internet searches, can be very important for improving cancer statistics and, in the long run, for helping cancer research.

METHODS

This study aims to model, nowcast and forecast the CRC prevalence at the global level using a three-step framework that incorporates three well-established univariate statistical and machine-learning models. First, data mining is performed to evaluate the relevancy of Google Trends (GT) data as a surrogate for the number of CRC survivors. The results demonstrate that population web-search interest in the term "colonoscopy" is the most reliable indicator to nowcast CRC disease prevalence. Then, various statistical and machine-learning models, including ARIMA, ETS, and FNNAR, are trained and tested using relevant GT time series. Finally, the updated monthly query series spanning 2004-2022 and the best forecasting model in terms of out-of-sample forecasting ability (., the neural network autoregression) are utilized to generate point forecasts up to 2025.

RESULTS

Results show that the number of people with colorectal cancer will continue to rise over the next 24 months. This in turn emphasizes the urgency for public policies aimed at reducing the population's exposure to the principal modifiable risk factors, such as lifestyle and nutrition. In addition, given the major drop in population interest in CRC during the first wave of the COVID-19 pandemic, the findings suggest that public health authorities should implement measures to increase cancer screening rates during pandemics. This in turn would deliver positive externalities, including the mitigation of the global burden and the enhancement of the quality of official statistics.

摘要

背景

结直肠癌(CRC)是全球第三大常见癌症,也是第二大致命癌症。因此,结直肠癌患病率预测对于评估该疾病未来的负担、规划资源分配、制定服务提供策略以及了解癌症风险因素不断变化的环境至关重要。然而,与癌症发病率和死亡率不同,国家和国际机构并不定期发布癌症患病率预测。此外,世界上大部分地区的癌症统计数据有限甚至不存在,再加上世界各国之间存在高度异质性,使得及时准确地进行结直肠癌患病率预测的任务更加复杂。在这种情况下,互联网搜索显示出的公众关注度对于改善癌症统计数据,从长远来看,对于帮助癌症研究可能非常重要。

方法

本研究旨在使用一个三步框架对全球范围内的结直肠癌患病率进行建模、即时预测和预测,该框架纳入了三种成熟的单变量统计和机器学习模型。首先,进行数据挖掘以评估谷歌趋势(GT)数据作为结直肠癌幸存者数量替代指标的相关性。结果表明,人群对“结肠镜检查”一词的网络搜索兴趣是即时预测结直肠癌疾病患病率最可靠的指标。然后,使用相关的GT时间序列对包括自回归积分移动平均(ARIMA)、误差项、趋势和季节性(ETS)以及前馈神经网络自回归(FNNAR)在内的各种统计和机器学习模型进行训练和测试。最后,利用涵盖2004 - 2022年的更新月度查询序列以及在样本外预测能力方面表现最佳的预测模型(即神经网络自回归)生成直至2025年的点预测。

结果

结果显示,在接下来的24个月里,结直肠癌患者数量将持续上升。这反过来强调了旨在减少人群接触主要可改变风险因素(如生活方式和营养)的公共政策的紧迫性。此外,鉴于在新冠疫情第一波期间人群对结直肠癌的关注度大幅下降,研究结果表明公共卫生当局应采取措施在疫情期间提高癌症筛查率。这反过来将产生积极的外部效应,包括减轻全球负担和提高官方统计数据的质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/95b9b4111230/peerj-cs-09-1518-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/2c3aa4a42da7/peerj-cs-09-1518-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/5460b17f49bf/peerj-cs-09-1518-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/a3b43d780aa0/peerj-cs-09-1518-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/473fd536664b/peerj-cs-09-1518-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/7f2de2b68aca/peerj-cs-09-1518-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/dd5655ff9445/peerj-cs-09-1518-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/f734762fcca2/peerj-cs-09-1518-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/e7d9d624a275/peerj-cs-09-1518-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/36201f3f01da/peerj-cs-09-1518-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/9327ceae3064/peerj-cs-09-1518-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/ec0d5c15652e/peerj-cs-09-1518-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/afa8ace595a7/peerj-cs-09-1518-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/95b9b4111230/peerj-cs-09-1518-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/2c3aa4a42da7/peerj-cs-09-1518-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/5460b17f49bf/peerj-cs-09-1518-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/a3b43d780aa0/peerj-cs-09-1518-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/473fd536664b/peerj-cs-09-1518-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/7f2de2b68aca/peerj-cs-09-1518-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/dd5655ff9445/peerj-cs-09-1518-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/f734762fcca2/peerj-cs-09-1518-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/e7d9d624a275/peerj-cs-09-1518-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/36201f3f01da/peerj-cs-09-1518-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/9327ceae3064/peerj-cs-09-1518-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/ec0d5c15652e/peerj-cs-09-1518-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/afa8ace595a7/peerj-cs-09-1518-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fafc/10588692/95b9b4111230/peerj-cs-09-1518-g013.jpg

相似文献

1
Mining Google Trends data for nowcasting and forecasting colorectal cancer (CRC) prevalence.挖掘谷歌趋势数据以进行结直肠癌(CRC)患病率的即时预报和预测。
PeerJ Comput Sci. 2023 Oct 4;9:e1518. doi: 10.7717/peerj-cs.1518. eCollection 2023.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A Novel Approach to Modeling and Forecasting Cancer Incidence and Mortality Rates through Web Queries and Automated Forecasting Algorithms: Evidence from Romania.一种通过网络查询和自动预测算法对癌症发病率和死亡率进行建模与预测的新方法:来自罗马尼亚的证据。
Biology (Basel). 2022 Jun 3;11(6):857. doi: 10.3390/biology11060857.
4
Global fertility in 204 countries and territories, 1950-2021, with forecasts to 2100: a comprehensive demographic analysis for the Global Burden of Disease Study 2021.204 个国家和地区的全球生育率,1950-2021 年,预测至 2100 年:2021 年全球疾病负担研究的综合人口分析。
Lancet. 2024 May 18;403(10440):2057-2099. doi: 10.1016/S0140-6736(24)00550-6. Epub 2024 Mar 20.
5
Burden of disease scenarios for 204 countries and territories, 2022-2050: a forecasting analysis for the Global Burden of Disease Study 2021.204 个国家和地区 2022-2050 年疾病负担情景:2021 年全球疾病负担研究的预测分析。
Lancet. 2024 May 18;403(10440):2204-2256. doi: 10.1016/S0140-6736(24)00685-8.
6
A COVID-19 Pandemic Artificial Intelligence-Based System With Deep Learning Forecasting and Automatic Statistical Data Acquisition: Development and Implementation Study.一种基于人工智能的新冠肺炎大流行深度学习预测与自动统计数据采集系统:开发与实施研究
J Med Internet Res. 2021 May 20;23(5):e27806. doi: 10.2196/27806.
7
Forecasting the COVID-19 Epidemic by Integrating Symptom Search Behavior Into Predictive Models: Infoveillance Study.将症状搜索行为纳入预测模型预测 COVID-19 疫情:信息监测研究。
J Med Internet Res. 2021 Aug 11;23(8):e28876. doi: 10.2196/28876.
8
Colon Cancer Screening - Is It Time Yet?结肠癌筛查——是时候了吗?
J Coll Physicians Surg Pak. 2017 Jun;27(6):327-328.
9
Infodemiological study on the impact of the COVID-19 pandemic on increased headache incidences at the world level.关于 COVID-19 大流行对全球头痛发生率增加的影响的信息流行病学研究。
Sci Rep. 2022 Jun 17;12(1):10253. doi: 10.1038/s41598-022-13663-7.
10
The Validity of Google Trends Search Volumes for Behavioral Forecasting of National Suicide Rates in Ireland.谷歌趋势搜索量用于预测爱尔兰全国自杀率的有效性。
Int J Environ Res Public Health. 2019 Sep 2;16(17):3201. doi: 10.3390/ijerph16173201.

引用本文的文献

1
Efficient and Accurate Brain Tumor Classification Using Hybrid MobileNetV2-Support Vector Machine for Magnetic Resonance Imaging Diagnostics in Neoplasms.使用混合MobileNetV2-支持向量机进行高效准确的脑肿瘤分类用于肿瘤磁共振成像诊断
Brain Sci. 2024 Nov 25;14(12):1178. doi: 10.3390/brainsci14121178.
2
Usefulness and safety of new ultrasmall-diameter colonoscope for cases with difficult insertion: a retrospective study.新型超细直径结肠镜在困难插入病例中的应用及安全性:一项回顾性研究。
Sci Rep. 2024 Sep 14;14(1):21506. doi: 10.1038/s41598-024-72689-1.

本文引用的文献

1
Common Statistical Errors in Scientific Investigations: A Simple Guide to Avoid Unfounded Decisions.科学研究中常见的统计错误:避免无根据决策的简易指南。
Cureus. 2023 Jan 4;15(1):e33351. doi: 10.7759/cureus.33351. eCollection 2023 Jan.
2
Global burden of colorectal cancer in 2020 and 2040: incidence and mortality estimates from GLOBOCAN.2020年和2040年全球结直肠癌负担:来自全球癌症负担(GLOBOCAN)的发病率和死亡率估计
Gut. 2023 Feb;72(2):338-344. doi: 10.1136/gutjnl-2022-327736. Epub 2022 Sep 8.
3
Role of Artificial Intelligence and Machine Learning in Prediction, Diagnosis, and Prognosis of Cancer.
人工智能和机器学习在癌症预测、诊断及预后中的作用。
Cureus. 2022 Nov 2;14(11):e31008. doi: 10.7759/cureus.31008. eCollection 2022 Nov.
4
Metastatic colorectal cancer: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up.转移性结直肠癌:ESMO 诊断、治疗及随访临床实践指南
Ann Oncol. 2023 Jan;34(1):10-32. doi: 10.1016/j.annonc.2022.10.003. Epub 2022 Oct 25.
5
Image-based time series forecasting: A deep convolutional neural network approach.基于图像的时间序列预测:一种深度卷积神经网络方法。
Neural Netw. 2023 Jan;157:39-53. doi: 10.1016/j.neunet.2022.10.006. Epub 2022 Oct 15.
6
The global burden of cancer attributable to risk factors, 2010-19: a systematic analysis for the Global Burden of Disease Study 2019.归因于风险因素的癌症全球负担,2010-19 年:2019 年全球疾病负担研究的系统分析。
Lancet. 2022 Aug 20;400(10352):563-591. doi: 10.1016/S0140-6736(22)01438-6.
7
A Novel Approach to Modeling and Forecasting Cancer Incidence and Mortality Rates through Web Queries and Automated Forecasting Algorithms: Evidence from Romania.一种通过网络查询和自动预测算法对癌症发病率和死亡率进行建模与预测的新方法:来自罗马尼亚的证据。
Biology (Basel). 2022 Jun 3;11(6):857. doi: 10.3390/biology11060857.
8
Infodemiological study on the impact of the COVID-19 pandemic on increased headache incidences at the world level.关于 COVID-19 大流行对全球头痛发生率增加的影响的信息流行病学研究。
Sci Rep. 2022 Jun 17;12(1):10253. doi: 10.1038/s41598-022-13663-7.
9
Changes in cancer screening before and during COVID-19: findings from the Korean National Cancer Screening Survey 2019 and 2020.癌症筛查在 COVID-19 前后的变化:来自 2019 年和 2020 年韩国国家癌症筛查调查的结果。
Epidemiol Health. 2022;44:e2022051. doi: 10.4178/epih.e2022051. Epub 2022 May 30.
10
The impact of the COVID-19 pandemic on cancer care.新冠疫情对癌症护理的影响。
Nat Cancer. 2020 Jun;1(6):565-567. doi: 10.1038/s43018-020-0074-y.