• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于互联网资源的疾病相关主题提取系统(DiTeX)。

DiTeX: Disease-related topic extraction system through internet-based sources.

机构信息

Department of Computer Science, Sangmyung University, Seoul, South Korea.

出版信息

PLoS One. 2018 Aug 3;13(8):e0201933. doi: 10.1371/journal.pone.0201933. eCollection 2018.

DOI:10.1371/journal.pone.0201933
PMID:30075009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6075781/
Abstract

This paper describes the web-based automated disease-related topic extraction system, called to DiTeX, which monitors important disease-related topics and provides associated information. National disease surveillance systems require a considerable amount of time to inform people of recent outbreaks of diseases. To solve this problem, many studies have used Internet-based sources such as news and Social Network Service (SNS). However, these sources contain many intentional elements that disturb extracting important topics. To address this challenge, we employ Natural Language Processing and an effective ranking algorithm, and develop DiTeX that provides important disease-related topics. This report describes the web front-end and back-end architecture, implementation, performance of the ranking algorithm, and captured topics of DiTeX. We describe processes for collecting Internet-based data and extracting disease-related topics based on search keywords. Our system then applies a ranking algorithm to evaluate the importance of disease-related topics extracted from these data. Finally, we conduct analysis based on real-world incidents to evaluate the performance and the effectiveness of DiTeX. To evaluate DiTeX, we analyze the ranking of well-known disease-related incidents for various ranking algorithms. The topic extraction rate of our ranking algorithm is superior to those of others. We demonstrate the validity of DiTeX by summarizing the disease-related topics of each day extracted by our system. To our knowledge, DiTeX is the world's first automated web-based real-time service system that extracts and presents disease-related topics, trends and related data through web-based sources. DiTeX is now available on the web through http://epidemic.co.kr/media/topics.

摘要

本文描述了一个基于网络的自动化疾病相关主题提取系统,称为 DiTeX,它可以监测重要的疾病相关主题,并提供相关信息。国家疾病监测系统需要相当长的时间来告知人们最近的疾病爆发情况。为了解决这个问题,许多研究都使用了基于互联网的来源,如新闻和社交网络服务 (SNS)。然而,这些来源包含了许多干扰提取重要主题的故意元素。为了解决这个挑战,我们使用了自然语言处理和有效的排名算法,并开发了 DiTeX,提供了重要的疾病相关主题。本报告描述了 DiTeX 的网络前端和后端架构、实现、排名算法的性能以及捕获的主题。我们描述了基于搜索关键字从互联网数据中提取疾病相关主题的过程。我们的系统然后应用排名算法来评估从这些数据中提取的疾病相关主题的重要性。最后,我们根据真实事件进行分析,以评估 DiTeX 的性能和有效性。为了评估 DiTeX,我们分析了各种排名算法中著名疾病相关事件的排名。我们的排名算法的主题提取率优于其他算法。我们通过总结我们系统每天提取的疾病相关主题来证明 DiTeX 的有效性。据我们所知,D iTeX 是世界上第一个通过基于网络的实时服务系统,通过基于网络的来源提取和呈现疾病相关主题、趋势和相关数据。D iTeX 现在可以通过 http://epidemic.co.kr/media/topics 在网络上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/ce8a005aea0f/pone.0201933.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/206a11facd82/pone.0201933.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/4beb7b02d38d/pone.0201933.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/b6646d90abf3/pone.0201933.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/328a321a995d/pone.0201933.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/445c4d8a2b45/pone.0201933.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/ce8a005aea0f/pone.0201933.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/206a11facd82/pone.0201933.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/4beb7b02d38d/pone.0201933.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/b6646d90abf3/pone.0201933.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/328a321a995d/pone.0201933.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/445c4d8a2b45/pone.0201933.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc03/6075781/ce8a005aea0f/pone.0201933.g006.jpg

相似文献

1
DiTeX: Disease-related topic extraction system through internet-based sources.基于互联网资源的疾病相关主题提取系统(DiTeX)。
PLoS One. 2018 Aug 3;13(8):e0201933. doi: 10.1371/journal.pone.0201933. eCollection 2018.
2
EagleEye: A Worldwide Disease-Related Topic Extraction System Using a Deep Learning Based Ranking Algorithm and Internet-Sourced Data.鹰眼:一个基于深度学习排序算法和互联网源数据的全球疾病相关主题提取系统。
Sensors (Basel). 2021 Jul 7;21(14):4665. doi: 10.3390/s21144665.
3
Twitter mining for fine-grained syndromic surveillance.用于细粒度症状监测的推特挖掘
Artif Intell Med. 2014 Jul;61(3):153-63. doi: 10.1016/j.artmed.2014.01.002. Epub 2014 Jan 31.
4
Media scanning and verification system as a supplemental tool to disease outbreak detection & reporting at National Centre for Disease Control, Delhi.作为德里国家疾病控制中心疾病爆发检测与报告补充工具的媒体扫描与核实系统。
J Commun Dis. 2012 Mar;44(1):9-14.
5
Social media and internet-based data in global systems for public health surveillance: a systematic review.社交媒体和基于互联网的数据在全球公共卫生监测系统中的应用:系统评价。
Milbank Q. 2014 Mar;92(1):7-33. doi: 10.1111/1468-0009.12038.
6
Avian Influenza Risk Surveillance in North America with Online Media.利用在线媒体对北美的禽流感风险进行监测。
PLoS One. 2016 Nov 23;11(11):e0165688. doi: 10.1371/journal.pone.0165688. eCollection 2016.
7
The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review.基于互联网的公共卫生监测资源应用(信息监测):系统评价
J Med Internet Res. 2020 Mar 13;22(3):e13680. doi: 10.2196/13680.
8
Interfacing a biosurveillance portal and an international network of institutional analysts to detect biological threats.连接生物监测门户与机构分析师国际网络以检测生物威胁。
Biosecur Bioterror. 2014 Nov-Dec;12(6):325-36. doi: 10.1089/bsp.2014.0031.
9
HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports.健康地图:通过对互联网媒体报道进行自动分类和可视化来实现全球传染病监测。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):150-7. doi: 10.1197/jamia.M2544. Epub 2007 Dec 20.
10
Characterizing Twitter Discussions About HPV Vaccines Using Topic Modeling and Community Detection.使用主题建模和社区检测来刻画关于HPV疫苗的推特讨论。
J Med Internet Res. 2016 Aug 29;18(8):e232. doi: 10.2196/jmir.6045.

引用本文的文献

1
A Semantic-Based Approach for Managing Healthcare Big Data: A Survey.基于语义的医疗保健大数据管理方法:调查。
J Healthc Eng. 2020 Nov 23;2020:8865808. doi: 10.1155/2020/8865808. eCollection 2020.
2
PEACOCK: A Map-Based Multitype Infectious Disease Outbreak Information System.孔雀:一个基于地图的多类型传染病爆发信息系统。
IEEE Access. 2019 Jun 21;7:82956-82969. doi: 10.1109/ACCESS.2019.2924189. eCollection 2019.
3
DOVE: An Infectious Disease Outbreak Statistics Visualization System.DOVE:一种传染病爆发统计可视化系统。

本文引用的文献

1
Towards Large-scale Twitter Mining for Drug-related Adverse Events.迈向用于药物相关不良事件的大规模推特挖掘
SHB12 (2012). 2012 Oct 29;2012:25-32. doi: 10.1145/2389707.2389713.
2
Forecasting influenza in Hong Kong with Google search queries and statistical model fusion.利用谷歌搜索查询和统计模型融合预测香港的流感情况。
PLoS One. 2017 May 2;12(5):e0176690. doi: 10.1371/journal.pone.0176690. eCollection 2017.
3
Forecasting Influenza Outbreaks in Boroughs and Neighborhoods of New York City.预测纽约市行政区及社区的流感疫情
IEEE Access. 2018 Aug 24;6:47206-47216. doi: 10.1109/ACCESS.2018.2867030. eCollection 2018.
4
Word2vec convolutional neural networks for classification of news articles and tweets.基于词向量卷积神经网络的新闻文章和推文分类。
PLoS One. 2019 Aug 22;14(8):e0220976. doi: 10.1371/journal.pone.0220976. eCollection 2019.
PLoS Comput Biol. 2016 Nov 17;12(11):e1005201. doi: 10.1371/journal.pcbi.1005201. eCollection 2016 Nov.
4
Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance.基于云的电子健康记录用于实时、特定地区的流感监测。
Sci Rep. 2016 May 11;6:25732. doi: 10.1038/srep25732.
5
Who tweets? Deriving the demographic characteristics of age, occupation and social class from twitter user meta-data.谁会发推文?从推特用户元数据中推导年龄、职业和社会阶层的人口统计学特征。
PLoS One. 2015 Mar 2;10(3):e0115545. doi: 10.1371/journal.pone.0115545. eCollection 2015.
6
Monitoring influenza epidemics in china with search query from baidu.利用百度搜索查询监测中国的流感疫情。
PLoS One. 2013 May 30;8(5):e64323. doi: 10.1371/journal.pone.0064323. Print 2013.
7
The geography of happiness: connecting twitter sentiment and expression, demographics, and objective characteristics of place.幸福地理学:连接推特情绪和表达、人口统计学以及地点的客观特征。
PLoS One. 2013 May 29;8(5):e64417. doi: 10.1371/journal.pone.0064417. Print 2013.
8
Health-related hot topic detection in online communities using text clustering.利用文本聚类技术在在线社区中发现与健康相关的热门话题。
PLoS One. 2013;8(2):e56221. doi: 10.1371/journal.pone.0056221. Epub 2013 Feb 15.
9
BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.BEAGLE:一个用于统计系统发生学的应用程序编程接口和高性能计算库。
Syst Biol. 2012 Jan;61(1):170-3. doi: 10.1093/sysbio/syr100. Epub 2011 Oct 1.
10
Public health surveillance of dental pain via Twitter.通过 Twitter 进行口腔疼痛的公共卫生监测。
J Dent Res. 2011 Sep;90(9):1047-51. doi: 10.1177/0022034511415273. Epub 2011 Jul 18.