• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

城市应急:一个基于公民传感器的应急事件数据集,用于监测厄瓜多尔的城市场景。

UrbangEnCy: An emergency events dataset based on citizen sensors for monitoring urban scenarios in Ecuador.

作者信息

Parraga-Alava Jorge, Alcivar-Cevallos Roberth, Vaca-Cardenas Leticia, Meza Jaime

机构信息

Facultad de Ciencias Informáticas, Universidad Técnica de Manabí, Avenida Jose María Urbina, Portoviejo 130104, Ecuador.

Departamento de Ingeniería Informática, Universidad de Santiago de Chile, Av. Ecuador 3659, Santiago 9160000, Chile.

出版信息

Data Brief. 2020 Dec 24;34:106693. doi: 10.1016/j.dib.2020.106693. eCollection 2021 Feb.

DOI:10.1016/j.dib.2020.106693
PMID:33490324
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7804594/
Abstract

Recently, the use of the citizen-sensors (people generating and sharing real data by social media) for detecting and disseminating emergency events in real-time have shown a considerable increase because people at the place of the event, as well as elsewhere, can quickly post relevant information on this type of alerts. Here, we present an emergency events dataset called . The dataset contains over 25500 texts in Spanish posted on Twitter from January 19th to August 19th, 2020, with emergencies and non-emergencies related content in Ecuador. We obtained, cleaned and, filtered these tweets and, then we selected the location and temporal data as well as tweet content. Besides, the data set includes annotations regarding the type of tweet (emergency / non-emergency) as well as additional nomenclature used to describe emergencies in the Center for immediate response service to emergencies (ECU 911) of Ecuador and international emergency services agencies (ESAs). dataset facilitates evaluating data science performance, machine learning, and natural language processing algorithms used with supervised and unsupervised problems re- related to text mining and pattern recognition. The dataset is freely and publicly available at https://doi.org/10.17632/4x37zz82k8.

摘要

最近,公民传感器(即通过社交媒体生成并分享真实数据的人)用于实时检测和传播紧急事件的情况显著增加,因为事件发生地以及其他地方的人们能够迅速发布有关此类警报的相关信息。在此,我们展示一个名为 的紧急事件数据集。该数据集包含2020年1月19日至8月19日期间在推特上发布的超过25500条西班牙语推文,内容涉及厄瓜多尔的紧急情况和非紧急情况。我们获取、清理并筛选了这些推文,然后选取了位置和时间数据以及推文内容。此外,该数据集包括有关推文类型(紧急/非紧急)的注释,以及用于描述厄瓜多尔紧急情况即时响应服务中心(ECU 911)和国际紧急服务机构(ESA)中紧急情况的其他术语。该数据集便于评估与文本挖掘和模式识别相关的监督和无监督问题中所使用的数据科学性能、机器学习和自然语言处理算法。该数据集可在https://doi.org/10.17632/4x37zz82k8上免费公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ec8/7804594/d6166214889e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ec8/7804594/e814e324e299/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ec8/7804594/d6166214889e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ec8/7804594/e814e324e299/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ec8/7804594/d6166214889e/gr2.jpg

相似文献

1
UrbangEnCy: An emergency events dataset based on citizen sensors for monitoring urban scenarios in Ecuador.城市应急:一个基于公民传感器的应急事件数据集,用于监测厄瓜多尔的城市场景。
Data Brief. 2020 Dec 24;34:106693. doi: 10.1016/j.dib.2020.106693. eCollection 2021 Feb.
2
Machine Learning and Natural Language Processing for Geolocation-Centric Monitoring and Characterization of Opioid-Related Social Media Chatter.基于机器学习和自然语言处理的地理定位中心监测和特征描述阿片类药物相关社交媒体聊天。
JAMA Netw Open. 2019 Nov 1;2(11):e1914672. doi: 10.1001/jamanetworkopen.2019.14672.
3
Online Influence and Sentiment of Fitness Tweets: Analysis of Two Million Fitness Tweets.健身推文的网络影响力与情感分析:两百万条健身推文剖析
JMIR Public Health Surveill. 2017 Oct 31;3(4):e82. doi: 10.2196/publichealth.8507.
4
Evaluating Behavioral and Linguistic Changes During Drug Treatment for Depression Using Tweets in Spanish: Pairwise Comparison Study.使用西班牙语推文评估抑郁症药物治疗期间的行为和语言变化:成对比较研究。
J Med Internet Res. 2020 Dec 18;22(12):e20920. doi: 10.2196/20920.
5
Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach.检测阿拉伯地区与 COVID-19 相关推文的仇恨言论:深度学习和主题建模方法。
J Med Internet Res. 2020 Dec 8;22(12):e22609. doi: 10.2196/22609.
6
Tweet Now, See You In the ED Later? Examining the Association Between Alcohol-related Tweets and Emergency Care Visits.现在发推文,稍后在急诊室见?探究与酒精相关的推文和急诊就诊之间的关联。
Acad Emerg Med. 2016 Jul;23(7):831-4. doi: 10.1111/acem.12983. Epub 2016 Jun 20.
7
Classification of Twitter Users Who Tweet About E-Cigarettes.发布关于电子烟推文的推特用户分类。
JMIR Public Health Surveill. 2017 Sep 26;3(3):e63. doi: 10.2196/publichealth.8060.
8
RoCoLe: A coffee leaf images dataset for evaluation of machine learning based methods in plant diseases recognition.RoCoLe:一个用于评估基于机器学习的植物病害识别方法的咖啡叶图像数据集。
Data Brief. 2019 Aug 19;25:104414. doi: 10.1016/j.dib.2019.104414. eCollection 2019 Aug.
9
Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter.应用多种数据收集工具量化推特上的人乳头瘤病毒疫苗传播情况
J Med Internet Res. 2016 Dec 5;18(12):e318. doi: 10.2196/jmir.6670.
10
Emotion dataset from Indonesian public opinion.来自印尼民意的情感数据集。
Data Brief. 2022 Jul 12;43:108465. doi: 10.1016/j.dib.2022.108465. eCollection 2022 Aug.

本文引用的文献

1
Assessment of supervised classifiers for the task of detecting messages with suicidal ideation.用于检测具有自杀意念信息任务的监督分类器评估。
Heliyon. 2020 Aug 3;6(8):e04412. doi: 10.1016/j.heliyon.2020.e04412. eCollection 2020 Aug.
2
Interrater reliability: the kappa statistic.组内一致性:kappa 统计量。
Biochem Med (Zagreb). 2012;22(3):276-82.