Suppr超能文献

监测食源性疾病的识别方法:现有公共卫生监测技术综述

Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques.

作者信息

Oldroyd Rachel A, Morris Michelle A, Birkin Mark

机构信息

Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.

School of Geography, University of Leeds, Leeds, United Kingdom.

出版信息

JMIR Public Health Surveill. 2018 Jun 6;4(2):e57. doi: 10.2196/publichealth.8218.

Abstract

BACKGROUND

Traditional methods of monitoring foodborne illness are associated with problems of untimeliness and underreporting. In recent years, alternative data sources such as social media data have been used to monitor the incidence of disease in the population (infodemiology and infoveillance). These data sources prove timelier than traditional general practitioner data, they can help to fill the gaps in the reporting process, and they often include additional metadata that is useful for supplementary research.

OBJECTIVE

The aim of the study was to identify and formally analyze research papers using consumer-generated data, such as social media data or restaurant reviews, to quantify a disease or public health ailment. Studies of this nature are scarce within the food safety domain, therefore identification and understanding of transferrable methods in other health-related fields are of particular interest.

METHODS

Structured scoping methods were used to identify and analyze primary research papers using consumer-generated data for disease or public health surveillance. The title, abstract, and keyword fields of 5 databases were searched using predetermined search terms. A total of 5239 papers matched the search criteria, of which 145 were taken to full-text review-62 papers were deemed relevant and were subjected to data characterization and thematic analysis.

RESULTS

The majority of studies (40/62, 65%) focused on the surveillance of influenza-like illness. Only 10 studies (16%) used consumer-generated data to monitor outbreaks of foodborne illness. Twitter data (58/62, 94%) and Yelp reviews (3/62, 5%) were the most commonly used data sources. Studies reporting high correlations against baseline statistics used advanced statistical and computational approaches to calculate the incidence of disease. These include classification and regression approaches, clustering approaches, and lexicon-based approaches. Although they are computationally intensive due to the requirement of training data, studies using classification approaches reported the best performance.

CONCLUSIONS

By analyzing studies in digital epidemiology, computer science, and public health, this paper has identified and analyzed methods of disease monitoring that can be transferred to foodborne disease surveillance. These methods fall into 4 main categories: basic approach, classification and regression, clustering approaches, and lexicon-based approaches. Although studies using a basic approach to calculate disease incidence generally report good performance against baseline measures, they are sensitive to chatter generated by media reports. More computationally advanced approaches are required to filter spurious messages and protect predictive systems against false alarms. Research using consumer-generated data for monitoring influenza-like illness is expansive; however, research regarding the use of restaurant reviews and social media data in the context of food safety is limited. Considering the advantages reported in this review, methods using consumer-generated data for foodborne disease surveillance warrant further investment.

摘要

背景

传统的食源性疾病监测方法存在不及时和报告不足的问题。近年来,社交媒体数据等替代数据源已被用于监测人群中的疾病发病率(信息流行病学和信息监测)。这些数据源比传统的全科医生数据更及时,有助于填补报告过程中的空白,并且通常包含对补充研究有用的额外元数据。

目的

本研究的目的是识别并正式分析使用消费者生成的数据(如社交媒体数据或餐厅评论)来量化疾病或公共卫生问题的研究论文。在食品安全领域,这类性质的研究很少,因此识别并了解其他健康相关领域中可转移的方法尤为重要。

方法

采用结构化的范围界定方法来识别和分析使用消费者生成的数据进行疾病或公共卫生监测的原创研究论文。使用预先确定的搜索词搜索5个数据库的标题、摘要和关键词字段。共有5239篇论文符合搜索标准,其中145篇进入全文评审——62篇被认为相关,并进行了数据特征描述和主题分析。

结果

大多数研究(40/62,65%)聚焦于流感样疾病的监测。只有10项研究(16%)使用消费者生成的数据来监测食源性疾病的暴发。推特数据(58/62,94%)和Yelp评论(3/62,5%)是最常用的数据源。报告与基线统计数据高度相关的研究使用了先进的统计和计算方法来计算疾病发病率。这些方法包括分类和回归方法、聚类方法以及基于词典的方法。尽管由于需要训练数据,这些方法计算量很大,但使用分类方法的研究表现最佳。

结论

通过分析数字流行病学、计算机科学和公共卫生领域的研究,本文识别并分析了可转移到食源性疾病监测的疾病监测方法。这些方法主要分为4类:基本方法、分类和回归方法、聚类方法以及基于词典的方法。尽管使用基本方法计算疾病发病率的研究通常报告与基线测量相比表现良好,但它们对媒体报道产生的闲聊很敏感。需要更先进的计算方法来过滤虚假信息,保护预测系统免受误报影响。使用消费者生成的数据监测流感样疾病的研究很多;然而,在食品安全背景下使用餐厅评论和社交媒体数据的研究有限。考虑到本综述中报告的优势,使用消费者生成的数据进行食源性疾病监测的方法值得进一步投入研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/568a/6010836/55a4fd574b0c/publichealth_v4i2e57_fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验