• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

发现网络餐厅评论中的食源性疾病。

Discovering foodborne illness in online restaurant reviews.

机构信息

Computer Science Department, Data Science Institute, Columbia University, New York, NY, USA.

Bureau of Communicable Disease, New York City Department of Health and Mental Hygiene, Queens, NY, USA.

出版信息

J Am Med Inform Assoc. 2018 Dec 1;25(12):1586-1592. doi: 10.1093/jamia/ocx093.

DOI:10.1093/jamia/ocx093
PMID:29329402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7647154/
Abstract

OBJECTIVE

We developed a system for the discovery of foodborne illness mentioned in online Yelp restaurant reviews using text classification. The system is used by the New York City Department of Health and Mental Hygiene (DOHMH) to monitor Yelp for foodborne illness complaints.

MATERIALS AND METHODS

We built classifiers for 2 tasks: (1) determining if a review indicated a person experiencing foodborne illness and (2) determining if a review indicated multiple people experiencing foodborne illness. We first developed a prototype classifier in 2012 for both tasks using a small labeled dataset. Over years of system deployment, DOHMH epidemiologists labeled 13 526 reviews selected by this classifier. We used these biased data and a sample of complementary reviews in a principled bias-adjusted training scheme to develop significantly improved classifiers. Finally, we performed an error analysis of the best resulting classifiers.

RESULTS

We found that logistic regression trained with bias-adjusted augmented data performed best for both classification tasks, with F1-scores of 87% and 66% for tasks 1 and 2, respectively.

DISCUSSION

Our error analysis revealed that the inability of our models to account for long phrases caused the most errors. Our bias-adjusted training scheme illustrates how to improve a classification system iteratively by exploiting available biased labeled data.

CONCLUSIONS

Our system has been instrumental in the identification of 10 outbreaks and 8523 complaints of foodborne illness associated with New York City restaurants since July 2012. Our evaluation has identified strong classifiers for both tasks, whose deployment will allow DOHMH epidemiologists to more effectively monitor Yelp for foodborne illness investigations.

摘要

目的

我们开发了一个使用文本分类在在线 Yelp 餐厅评论中发现食源性疾病的系统。该系统由纽约市卫生局(DOHMH)用于监测 Yelp 上的食源性疾病投诉。

材料和方法

我们为 2 个任务构建了分类器:(1)确定评论是否表明有人患有食源性疾病,(2)确定评论是否表明多人患有食源性疾病。我们首先在 2012 年使用一个小的标记数据集为这两个任务开发了一个原型分类器。在系统部署的多年中,DOHMH 流行病学家标记了这个分类器选择的 13526 条评论。我们使用这些有偏差的数据和一个补充评论的样本,在一个有原则的偏差调整训练方案中,开发了显著改进的分类器。最后,我们对最佳分类器进行了错误分析。

结果

我们发现,使用偏差调整增强数据训练的逻辑回归在两个分类任务中表现最好,分别为 87%和 66%的 F1 分数。

讨论

我们的错误分析表明,我们的模型无法解释长短语是造成错误的主要原因。我们的偏差调整训练方案说明了如何通过利用可用的有偏差的标记数据来迭代地改进分类系统。

结论

自 2012 年 7 月以来,我们的系统已经在识别与纽约市餐馆有关的 10 起暴发和 8523 起食源性疾病投诉方面发挥了重要作用。我们的评估已经确定了两个任务的强大分类器,它们的部署将使 DOHMH 流行病学家能够更有效地监测 Yelp 上的食源性疾病调查。

相似文献

1
Discovering foodborne illness in online restaurant reviews.发现网络餐厅评论中的食源性疾病。
J Am Med Inform Assoc. 2018 Dec 1;25(12):1586-1592. doi: 10.1093/jamia/ocx093.
2
Using online reviews by restaurant patrons to identify unreported cases of foodborne illness - New York City, 2012-2013.利用餐厅食客的在线评论来识别未报告的食源性疾病病例 - 纽约市,2012-2013 年。
MMWR Morb Mortal Wkly Rep. 2014 May 23;63(20):441-5.
3
Supplementing Public Health Inspection via Social Media.通过社交媒体辅助公共卫生检查
PLoS One. 2016 Mar 29;11(3):e0152117. doi: 10.1371/journal.pone.0152117. eCollection 2016.
4
Health department use of social media to identify foodborne illness - Chicago, Illinois, 2013-2014.卫生部门利用社交媒体识别食源性疾病——伊利诺伊州芝加哥,2013-2014 年。
MMWR Morb Mortal Wkly Rep. 2014 Aug 15;63(32):681-5.
5
Online restaurant reviews identify outbreaks of undetected foodborne illness.在线餐厅评论可识别未被发现的食源性疾病暴发情况。
BMJ. 2014 May 27;348:g3560. doi: 10.1136/bmj.g3560.
6
Foodborne Illness Complaint Systems Detect, and Restaurant Inspection Programs Prevent Restaurant-Associated Foodborne Illness Outbreaks.食源性疾病投诉系统可检测到食源性疾病,并可预防餐厅相关食源性疾病爆发。
Foodborne Pathog Dis. 2024 Feb;21(2):92-98. doi: 10.1089/fpd.2023.0086. Epub 2023 Nov 21.
7
Online reports of foodborne illness capture foods implicated in official foodborne outbreak reports.食源性疾病的在线报告记录了官方食源性疾病暴发报告中涉及的食物。
Prev Med. 2014 Oct;67:264-9. doi: 10.1016/j.ypmed.2014.08.003. Epub 2014 Aug 11.
8
Foodborne Outbreak Rates Associated with Restaurant Inspection Grading and Posting at the Point of Service: Evaluation Using National Foodborne Outbreak Surveillance Data.与服务点餐厅检查评级和公布相关的食源性疾病爆发率:利用国家食源性疾病监测数据进行评估。
J Food Prot. 2022 Jul 1;85(7):1000-1007. doi: 10.4315/JFP-22-007.
9
Using the National Environmental Assessment Reporting System to Enhance Foodborne Illness Outbreak Investigations in New York City Restaurants.利用国家环境评估报告系统加强纽约市餐厅食源性疾病暴发调查
J Environ Health. 2017 Apr;79(8):46-8.
10
Using Twitter to Identify and Respond to Food Poisoning: The Food Safety STL Project.利用推特识别和应对食物中毒:食品安全圣路易斯项目。
J Public Health Manag Pract. 2017 Nov/Dec;23(6):577-580. doi: 10.1097/PHH.0000000000000516.

引用本文的文献

1
Big data analytics in food industry: a state-of-the-art literature review.食品行业中的大数据分析:最新文献综述
NPJ Sci Food. 2025 Mar 21;9(1):36. doi: 10.1038/s41538-025-00394-y.
2
Foodborne Event Detection Based on Social Media Mining: A Systematic Review.基于社交媒体挖掘的食源性事件检测:系统综述
Foods. 2025 Jan 14;14(2):239. doi: 10.3390/foods14020239.
3
Internet-based surveillance to track trends in seasonal allergies across the United States.基于互联网的监测,以追踪全美国季节性过敏的趋势。
PNAS Nexus. 2024 Oct 29;3(10):pgae430. doi: 10.1093/pnasnexus/pgae430. eCollection 2024 Oct.
4
A Novel Foodborne Illness Detection and Web Application Tool Based on Social Media.一种基于社交媒体的新型食源性疾病检测与网络应用工具。
Foods. 2023 Jul 20;12(14):2769. doi: 10.3390/foods12142769.
5
Social Media Role and Its Impact on Public Health: A Narrative Review.社交媒体的作用及其对公众健康的影响:一项叙述性综述
Cureus. 2023 Jan 13;15(1):e33737. doi: 10.7759/cureus.33737. eCollection 2023 Jan.
6
Predicting Food Safety Compliance for Informed Food Outlet Inspections: A Machine Learning Approach.预测食品安全合规性以实现知情食品出口检查:一种机器学习方法。
Int J Environ Res Public Health. 2021 Nov 30;18(23):12635. doi: 10.3390/ijerph182312635.
7
Evaluation of the Membrane Damage Mechanism of Chlorogenic Acid against and and Its Application in the Preservation of Raw Pork and Skim Milk.评估绿原酸对 和 膜损伤机制及其在生猪肉和脱脂乳保鲜中的应用。
Molecules. 2021 Nov 8;26(21):6748. doi: 10.3390/molecules26216748.
8
Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media.社交媒体中潜在食源性疾病爆发相关实体的提取:众包与机器学习方法
Sci Rep. 2021 Nov 4;11(1):21678. doi: 10.1038/s41598-021-00766-w.
9
Foodborne Disease Risk Prediction Using Multigraph Structural Long Short-term Memory Networks: Algorithm Design and Validation Study.基于多重图结构长短期记忆网络的食源性疾病风险预测:算法设计与验证研究
JMIR Med Inform. 2021 Aug 2;9(8):e29433. doi: 10.2196/29433.
10
A Public Health Informatics Solution to Improving Food Safety in Restaurants: Putting the Missing Piece in the Puzzle.一种改善餐厅食品安全的公共卫生信息学解决方案:补齐拼图中的缺失部分。
Online J Public Health Inform. 2021 Apr 9;13(1):e5. doi: 10.5210/ojphi.v13i1.11087. eCollection 2021.

本文引用的文献

1
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance.结合搜索、社交媒体和传统数据源以改善流感监测。
PLoS Comput Biol. 2015 Oct 29;11(10):e1004513. doi: 10.1371/journal.pcbi.1004513. eCollection 2015 Oct.
2
Comparing timeliness, content, and disease severity of formal and informal source outbreak reporting.比较正式和非正式来源的疫情报告的及时性、内容和疾病严重程度。
BMC Infect Dis. 2015 Mar 20;15:135. doi: 10.1186/s12879-015-0885-0.
3
Online reports of foodborne illness capture foods implicated in official foodborne outbreak reports.食源性疾病的在线报告记录了官方食源性疾病暴发报告中涉及的食物。
Prev Med. 2014 Oct;67:264-9. doi: 10.1016/j.ypmed.2014.08.003. Epub 2014 Aug 11.
4
Health department use of social media to identify foodborne illness - Chicago, Illinois, 2013-2014.卫生部门利用社交媒体识别食源性疾病——伊利诺伊州芝加哥,2013-2014 年。
MMWR Morb Mortal Wkly Rep. 2014 Aug 15;63(32):681-5.
5
Using online reviews by restaurant patrons to identify unreported cases of foodborne illness - New York City, 2012-2013.利用餐厅食客的在线评论来识别未报告的食源性疾病病例 - 纽约市,2012-2013 年。
MMWR Morb Mortal Wkly Rep. 2014 May 23;63(20):441-5.
6
Surveillance for foodborne disease outbreaks - United States, 1998-2008.食源性疾病暴发监测 - 美国,1998-2008 年。
MMWR Surveill Summ. 2013 Jun 28;62(2):1-34.
7
Foodborne illness acquired in the United States--unspecified agents.食源性疾病在美国感染--未特指病原体。
Emerg Infect Dis. 2011 Jan;17(1):16-22. doi: 10.3201/eid1701.091101p2.
8
HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports.健康地图:通过对互联网媒体报道进行自动分类和可视化来实现全球传染病监测。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):150-7. doi: 10.1197/jamia.M2544. Epub 2007 Dec 20.