• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动化词汇发现技术在网络疫情情报地理解析中的应用。

Automated vocabulary discovery for geo-parsing online epidemic intelligence.

机构信息

Children's Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, 300 Longwood Ave, Boston, MA 02115, USA.

出版信息

BMC Bioinformatics. 2009 Nov 24;10:385. doi: 10.1186/1471-2105-10-385.

DOI:10.1186/1471-2105-10-385
PMID:19930702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2787530/
Abstract

BACKGROUND

Automated surveillance of the Internet provides a timely and sensitive method for alerting on global emerging infectious disease threats. HealthMap is part of a new generation of online systems designed to monitor and visualize, on a real-time basis, disease outbreak alerts as reported by online news media and public health sources. HealthMap is of specific interest for national and international public health organizations and international travelers. A particular task that makes such a surveillance useful is the automated discovery of the geographic references contained in the retrieved outbreak alerts. This task is sometimes referred to as "geo-parsing". A typical approach to geo-parsing would demand an expensive training corpus of alerts manually tagged by a human.

RESULTS

Given that human readers perform this kind of task by using both their lexical and contextual knowledge, we developed an approach which relies on a relatively small expert-built gazetteer, thus limiting the need of human input, but focuses on learning the context in which geographic references appear. We show in a set of experiments, that this approach exhibits a substantial capacity to discover geographic locations outside of its initial lexicon.

CONCLUSION

The results of this analysis provide a framework for future automated global surveillance efforts that reduce manual input and improve timeliness of reporting.

摘要

背景

互联网自动监测为全球新发传染病威胁的预警提供了及时而敏感的方法。HealthMap 是新一代在线系统的一部分,旨在实时监测和可视化在线新闻媒体和公共卫生资源报告的疾病暴发警报。HealthMap 对国家和国际公共卫生组织以及国际旅行者具有特殊意义。此类监测的一项特定任务是自动发现检索到的暴发警报中包含的地理参考。此任务有时称为“地理解析”。典型的地理解析方法需要昂贵的人工标记的警报训练语料库。

结果

鉴于人类读者通过使用词汇和上下文知识来执行此类任务,我们开发了一种依赖于相对较小的专家构建的地名词典的方法,从而减少了对人工输入的需求,但侧重于学习地理参考出现的上下文。我们在一系列实验中表明,这种方法具有在其初始词典之外发现地理位置的强大能力。

结论

该分析的结果为未来的自动化全球监测工作提供了一个框架,该框架减少了人工输入并提高了报告的及时性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/3822f6fa4e83/1471-2105-10-385-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/7e756a7bfda2/1471-2105-10-385-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/ad9b3eec822e/1471-2105-10-385-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/3e4775364d33/1471-2105-10-385-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/e5d1ae50bc07/1471-2105-10-385-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/bfcf84284c0a/1471-2105-10-385-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/4865c1ff9638/1471-2105-10-385-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/3822f6fa4e83/1471-2105-10-385-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/7e756a7bfda2/1471-2105-10-385-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/ad9b3eec822e/1471-2105-10-385-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/3e4775364d33/1471-2105-10-385-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/e5d1ae50bc07/1471-2105-10-385-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/bfcf84284c0a/1471-2105-10-385-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/4865c1ff9638/1471-2105-10-385-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/2787530/3822f6fa4e83/1471-2105-10-385-7.jpg

相似文献

1
Automated vocabulary discovery for geo-parsing online epidemic intelligence.自动化词汇发现技术在网络疫情情报地理解析中的应用。
BMC Bioinformatics. 2009 Nov 24;10:385. doi: 10.1186/1471-2105-10-385.
2
HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports.健康地图:通过对互联网媒体报道进行自动分类和可视化来实现全球传染病监测。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):150-7. doi: 10.1197/jamia.M2544. Epub 2007 Dec 20.
3
Use of unstructured event-based reports for global infectious disease surveillance.使用基于非结构化事件的报告进行全球传染病监测。
Emerg Infect Dis. 2009 May;15(5):689-95. doi: 10.3201/eid1505.081114.
4
HealthMap: the development of automated real-time internet surveillance for epidemic intelligence.健康地图:用于疫情情报的自动化实时互联网监测系统的开发。
Euro Surveill. 2007 Nov 29;12(11):E071129.5. doi: 10.2807/esw.12.48.03322-en.
5
Surveillance for Neisseria meningitidis Disease Activity and Transmission Using Information Technology.利用信息技术监测脑膜炎奈瑟菌疾病活动和传播情况。
PLoS One. 2015 May 20;10(5):e0127406. doi: 10.1371/journal.pone.0127406. eCollection 2015.
6
A framework for enhancing spatial and temporal granularity in report-based health surveillance systems.基于报告的健康监测系统中增强时空粒度的框架。
BMC Med Inform Decis Mak. 2010 Jan 12;10:1. doi: 10.1186/1472-6947-10-1.
7
Epidemic intelligence needs of stakeholders in the Asia-Pacific region.亚太地区利益相关者的流行病情报需求。
Western Pac Surveill Response J. 2018 Dec 18;9(4):28-36. doi: 10.5365/wpsar.2018.9.2.009. eCollection 2018 Oct-Dec.
8
ID-Viewer: a visual analytics architecture for infectious diseases surveillance and response management in Pakistan.ID-Viewer:巴基斯坦传染病监测与应对管理的可视化分析架构
Public Health. 2016 May;134:72-85. doi: 10.1016/j.puhe.2016.01.006. Epub 2016 Feb 13.
9
Comparison of web-based biosecurity intelligence systems: BioCaster, EpiSPIDER and HealthMap.基于网络的生物安保情报系统比较:BioCaster、EpiSPIDER 和 HealthMap。
Transbound Emerg Dis. 2012 Jun;59(3):223-32. doi: 10.1111/j.1865-1682.2011.01258.x. Epub 2011 Dec 20.
10
An exploratory study of a text classification framework for Internet-based surveillance of emerging epidemics.基于文本的突发传染病网络监测研究框架的探索性研究
Int J Med Inform. 2011 Jan;80(1):56-66. doi: 10.1016/j.ijmedinf.2010.10.015. Epub 2010 Dec 4.

引用本文的文献

1
Web monitoring of emerging animal infectious diseases integrated in the French Animal Health Epidemic Intelligence System.网络监测新发动物传染病,集成于法国动物卫生疫情情报系统。
PLoS One. 2018 Aug 3;13(8):e0199960. doi: 10.1371/journal.pone.0199960. eCollection 2018.
2
The Potential of Social Media and Internet-Based Data in Preventing and Fighting Infectious Diseases: From Internet to Twitter.社交媒体和基于互联网的数据在预防和抗击传染病方面的潜力:从互联网到推特
Adv Exp Med Biol. 2017;972:131-139. doi: 10.1007/5584_2016_132.
3
Characterizing Influenza surveillance systems performance: application of a Bayesian hierarchical statistical model to Hong Kong surveillance data.

本文引用的文献

1
HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports.健康地图:通过对互联网媒体报道进行自动分类和可视化来实现全球传染病监测。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):150-7. doi: 10.1197/jamia.M2544. Epub 2007 Dec 20.
2
HealthMap: the development of automated real-time internet surveillance for epidemic intelligence.健康地图:用于疫情情报的自动化实时互联网监测系统的开发。
Euro Surveill. 2007 Nov 29;12(11):E071129.5. doi: 10.2807/esw.12.48.03322-en.
3
Identifying gene and protein mentions in text using conditional random fields.
流感监测系统性能特征分析:贝叶斯分层统计模型在香港监测数据中的应用。
BMC Public Health. 2014 Aug 15;14:850. doi: 10.1186/1471-2458-14-850.
4
Social media and internet-based data in global systems for public health surveillance: a systematic review.社交媒体和基于互联网的数据在全球公共卫生监测系统中的应用:系统评价。
Milbank Q. 2014 Mar;92(1):7-33. doi: 10.1111/1468-0009.12038.
5
Uncovering text mining: a survey of current work on web-based epidemic intelligence.揭示文本挖掘:基于网络的传染病智能研究现状调查。
Glob Public Health. 2012;7(7):731-49. doi: 10.1080/17441692.2012.699975. Epub 2012 Jul 11.
6
Research agenda for mass gatherings: a call to action.大型集会研究议程:行动呼吁。
Lancet Infect Dis. 2012 Mar;12(3):231-9. doi: 10.1016/S1473-3099(11)70353-X. Epub 2012 Jan 16.
使用条件随机场识别文本中的基因和蛋白质提及。
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-6-S1-S6. Epub 2005 May 24.