监测食源性疾病的识别方法：现有公共卫生监测技术综述

Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques.

作者信息

Oldroyd Rachel A, Morris Michelle A, Birkin Mark

机构信息

Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.

School of Geography, University of Leeds, Leeds, United Kingdom.

出版信息

JMIR Public Health Surveill. 2018 Jun 6;4(2):e57. doi: 10.2196/publichealth.8218.

DOI:10.2196/publichealth.8218

PMID:29875090

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6010836/

Abstract

BACKGROUND

Traditional methods of monitoring foodborne illness are associated with problems of untimeliness and underreporting. In recent years, alternative data sources such as social media data have been used to monitor the incidence of disease in the population (infodemiology and infoveillance). These data sources prove timelier than traditional general practitioner data, they can help to fill the gaps in the reporting process, and they often include additional metadata that is useful for supplementary research.

OBJECTIVE

The aim of the study was to identify and formally analyze research papers using consumer-generated data, such as social media data or restaurant reviews, to quantify a disease or public health ailment. Studies of this nature are scarce within the food safety domain, therefore identification and understanding of transferrable methods in other health-related fields are of particular interest.

METHODS

Structured scoping methods were used to identify and analyze primary research papers using consumer-generated data for disease or public health surveillance. The title, abstract, and keyword fields of 5 databases were searched using predetermined search terms. A total of 5239 papers matched the search criteria, of which 145 were taken to full-text review-62 papers were deemed relevant and were subjected to data characterization and thematic analysis.

RESULTS

The majority of studies (40/62, 65%) focused on the surveillance of influenza-like illness. Only 10 studies (16%) used consumer-generated data to monitor outbreaks of foodborne illness. Twitter data (58/62, 94%) and Yelp reviews (3/62, 5%) were the most commonly used data sources. Studies reporting high correlations against baseline statistics used advanced statistical and computational approaches to calculate the incidence of disease. These include classification and regression approaches, clustering approaches, and lexicon-based approaches. Although they are computationally intensive due to the requirement of training data, studies using classification approaches reported the best performance.

CONCLUSIONS

By analyzing studies in digital epidemiology, computer science, and public health, this paper has identified and analyzed methods of disease monitoring that can be transferred to foodborne disease surveillance. These methods fall into 4 main categories: basic approach, classification and regression, clustering approaches, and lexicon-based approaches. Although studies using a basic approach to calculate disease incidence generally report good performance against baseline measures, they are sensitive to chatter generated by media reports. More computationally advanced approaches are required to filter spurious messages and protect predictive systems against false alarms. Research using consumer-generated data for monitoring influenza-like illness is expansive; however, research regarding the use of restaurant reviews and social media data in the context of food safety is limited. Considering the advantages reported in this review, methods using consumer-generated data for foodborne disease surveillance warrant further investment.

摘要

背景

传统的食源性疾病监测方法存在不及时和报告不足的问题。近年来，社交媒体数据等替代数据源已被用于监测人群中的疾病发病率（信息流行病学和信息监测）。这些数据源比传统的全科医生数据更及时，有助于填补报告过程中的空白，并且通常包含对补充研究有用的额外元数据。

目的

本研究的目的是识别并正式分析使用消费者生成的数据（如社交媒体数据或餐厅评论）来量化疾病或公共卫生问题的研究论文。在食品安全领域，这类性质的研究很少，因此识别并了解其他健康相关领域中可转移的方法尤为重要。

方法

采用结构化的范围界定方法来识别和分析使用消费者生成的数据进行疾病或公共卫生监测的原创研究论文。使用预先确定的搜索词搜索5个数据库的标题、摘要和关键词字段。共有5239篇论文符合搜索标准，其中145篇进入全文评审——62篇被认为相关，并进行了数据特征描述和主题分析。

结果

大多数研究（40/62，65%）聚焦于流感样疾病的监测。只有10项研究（16%）使用消费者生成的数据来监测食源性疾病的暴发。推特数据（58/62，94%）和Yelp评论（3/62，5%）是最常用的数据源。报告与基线统计数据高度相关的研究使用了先进的统计和计算方法来计算疾病发病率。这些方法包括分类和回归方法、聚类方法以及基于词典的方法。尽管由于需要训练数据，这些方法计算量很大，但使用分类方法的研究表现最佳。

结论

通过分析数字流行病学、计算机科学和公共卫生领域的研究，本文识别并分析了可转移到食源性疾病监测的疾病监测方法。这些方法主要分为4类：基本方法、分类和回归方法、聚类方法以及基于词典的方法。尽管使用基本方法计算疾病发病率的研究通常报告与基线测量相比表现良好，但它们对媒体报道产生的闲聊很敏感。需要更先进的计算方法来过滤虚假信息，保护预测系统免受误报影响。使用消费者生成的数据监测流感样疾病的研究很多；然而，在食品安全背景下使用餐厅评论和社交媒体数据的研究有限。考虑到本综述中报告的优势，使用消费者生成的数据进行食源性疾病监测的方法值得进一步投入研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/568a/6010836/55a4fd574b0c/publichealth_v4i2e57_fig1.jpg

相似文献

Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques.监测食源性疾病的识别方法：现有公共卫生监测技术综述

JMIR Public Health Surveill. 2018 Jun 6;4(2):e57. doi: 10.2196/publichealth.8218.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Supplementing Public Health Inspection via Social Media.通过社交媒体辅助公共卫生检查

PLoS One. 2016 Mar 29;11(3):e0152117. doi: 10.1371/journal.pone.0152117. eCollection 2016.

Garbage in, Garbage Out: Data Collection, Quality Assessment and Reporting Standards for Social Media Data Use in Health Research, Infodemiology and Digital Disease Detection.输入垃圾，输出垃圾：健康研究、信息流行病学和数字疾病检测中社交媒体数据使用的数据收集、质量评估及报告标准

J Med Internet Res. 2016 Feb 26;18(2):e41. doi: 10.2196/jmir.4738.

Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树：影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。

Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.

Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet.信息流行病学与信息监测：一套新兴的公共卫生信息学方法的框架，用于分析互联网上的搜索、交流和出版行为。

J Med Internet Res. 2009 Mar 27;11(1):e11. doi: 10.2196/jmir.1157.

Scoping review on search queries and social media for disease surveillance: a chronology of innovation.关于疾病监测的搜索查询和社交媒体的范围审查：创新年表

J Med Internet Res. 2013 Jul 18;15(7):e147. doi: 10.2196/jmir.2740.

Social media based surveillance systems for healthcare using machine learning: A systematic review.基于社交媒体的机器学习医疗保健监测系统：一项系统综述。

J Biomed Inform. 2020 Aug;108:103500. doi: 10.1016/j.jbi.2020.103500. Epub 2020 Jul 2.

Promoting and supporting self-management for adults living in the community with physical chronic illness: A systematic review of the effectiveness and meaningfulness of the patient-practitioner encounter.促进和支持社区中患有慢性身体疾病的成年人进行自我管理：对医患互动的有效性和意义的系统评价。

JBI Libr Syst Rev. 2009;7(13):492-582. doi: 10.11124/01938924-200907130-00001.

A Platform for Crowdsourced Foodborne Illness Surveillance: Description of Users and Reports.众包食源性疾病监测平台：用户与报告描述

JMIR Public Health Surveill. 2017 Jul 5;3(3):e42. doi: 10.2196/publichealth.7076.

引用本文的文献

Big data analytics in food industry: a state-of-the-art literature review.食品行业中的大数据分析：最新文献综述

NPJ Sci Food. 2025 Mar 21;9(1):36. doi: 10.1038/s41538-025-00394-y.

Foodborne Event Detection Based on Social Media Mining: A Systematic Review.基于社交媒体挖掘的食源性事件检测：系统综述

Foods. 2025 Jan 14;14(2):239. doi: 10.3390/foods14020239.

Predicting Food Safety Compliance for Informed Food Outlet Inspections: A Machine Learning Approach.预测食品安全合规性以实现知情食品出口检查：一种机器学习方法。

Int J Environ Res Public Health. 2021 Nov 30;18(23):12635. doi: 10.3390/ijerph182312635.

Identifying non-traditional electronic datasets for population-level surveillance and prevention of cardiometabolic diseases: a scoping review protocol.用于人群级别的心血管代谢疾病监测和预防的非传统电子数据集的识别：系统评价方案。

BMJ Open. 2021 Aug 18;11(8):e053485. doi: 10.1136/bmjopen-2021-053485.

Progress Towards Using Linked Population-Based Data For Geohealth Research: Comparisons Of Aotearoa New Zealand And The United Kingdom.利用基于人群的关联数据进行地球健康研究的进展：新西兰与英国的比较

Appl Spat Anal Policy. 2021;14(4):1025-1040. doi: 10.1007/s12061-021-09381-8. Epub 2021 Apr 29.

Machine Learning Prediction of Foodborne Disease Pathogens: Algorithm Development and Validation Study.食源性病原体的机器学习预测：算法开发与验证研究

JMIR Med Inform. 2021 Jan 26;9(1):e24924. doi: 10.2196/24924.

Outbreak Investigation of a Multipathogen Foodborne Disease in a Training Institute in Rabat, Morocco: Case-Control Study.摩洛哥拉巴特一所培训机构食源性多病原体疾病暴发调查：病例对照研究

JMIR Public Health Surveill. 2019 Sep 25;5(3):e14227. doi: 10.2196/14227.

Sampling and Sampling Frames in Big Data Epidemiology.大数据流行病学中的抽样与抽样框

Curr Epidemiol Rep. 2019 Mar;6(1):14-22. doi: 10.1007/s40471-019-0179-y. Epub 2019 Feb 2.

本文引用的文献

Data Programming: Creating Large Training Sets, Quickly.数据编程：快速创建大型训练集。

Adv Neural Inf Process Syst. 2016 Dec;29:3567-3575.

Coughing, sneezing, and aching online: Twitter and the volume of influenza-like illness in a pediatric hospital.线上的咳嗽、打喷嚏与疼痛：推特与一家儿科医院的流感样疾病数量

PLoS One. 2017 Jul 28;12(7):e0182008. doi: 10.1371/journal.pone.0182008. eCollection 2017.

An unsupervised machine learning model for discovering latent infectious diseases using social media data.一种使用社交媒体数据发现潜在传染病的无监督机器学习模型。

J Biomed Inform. 2017 Feb;66:82-94. doi: 10.1016/j.jbi.2016.12.007. Epub 2016 Dec 26.

Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza.应用地理信息系统和机器学习方法于推特数据以进行流感的多尺度监测。

PLoS One. 2016 Jul 25;11(7):e0157734. doi: 10.1371/journal.pone.0157734. eCollection 2016.

SimNest: Social Media Nested Epidemic Simulation via Online Semi-supervised Deep Learning.SimNest：通过在线半监督深度学习实现的社交媒体嵌套疫情模拟

Proc IEEE Int Conf Data Min. 2015 Nov;2015:639-648. doi: 10.1109/ICDM.2015.39.

Surveillance Tools Emerging From Search Engines and Social Media Data for Determining Eye Disease Patterns.从搜索引擎和社交媒体数据中涌现的用于确定眼病模式的监测工具。

JAMA Ophthalmol. 2016 Sep 1;134(9):1024-30. doi: 10.1001/jamaophthalmol.2016.2267.

Regional Level Influenza Study with Geo-Tagged Twitter Data.利用地理标记推特数据进行的区域流感研究。

J Med Syst. 2016 Aug;40(8):189. doi: 10.1007/s10916-016-0545-y. Epub 2016 Jul 2.

DEFENDER: Detecting and Forecasting Epidemics Using Novel Data-Analytics for Enhanced Response.防御者：利用新型数据分析检测和预测流行病以加强应对

PLoS One. 2016 May 18;11(5):e0155417. doi: 10.1371/journal.pone.0155417. eCollection 2016.

Supplementing Public Health Inspection via Social Media.通过社交媒体辅助公共卫生检查

PLoS One. 2016 Mar 29;11(3):e0152117. doi: 10.1371/journal.pone.0152117. eCollection 2016.

Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance.结合搜索、社交媒体和传统数据源以改善流感监测。

PLoS Comput Biol. 2015 Oct 29;11(10):e1004513. doi: 10.1371/journal.pcbi.1004513. eCollection 2015 Oct.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

监测食源性疾病的识别方法：现有公共卫生监测技术综述

Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献