Rudra Koustav, Sharma Ashish, Ganguly Niloy, Imran Muhammad
1IIT Kharagpur, Kharagpur, India.
2Qatar Computing Research Institute, HBKU, Doha, Qatar.
Inf Syst Front. 2018;20(5):933-948. doi: 10.1007/s10796-018-9844-9. Epub 2018 Mar 20.
During a new disease outbreak, frustration and uncertainties among affected and vulnerable population increase. Affected communities look for known symptoms, prevention measures, and treatment strategies. On the other hand, health organizations try to get situational updates to assess the severity of the outbreak, known affected cases, and other details. Recent emergence of social media platforms such as Twitter provide convenient ways and fast access to disseminate and consume information to/from a wider audience. Research studies have shown potential of this online information to address information needs of concerned authorities during outbreaks, epidemics, and pandemics. In this work, we target three types of end-users (i) vulnerable population-people who are not yet affected and are looking for prevention related information (ii) affected population-people who are affected and looking for treatment related information, and (iii) health organizations-like WHO, who are interested in gaining situational awareness to make timely decisions. We use Twitter data from two recent outbreaks (Ebola and MERS) to build an automatic classification approach useful to categorize tweets into different disease related categories. Moreover, the classified messages are used to generate different kinds of summaries useful for affected and vulnerable communities as well as health organizations. Results obtained from extensive experimentation show the effectiveness of the proposed approach.
在新疾病爆发期间,受影响人群和弱势群体中的挫败感和不确定性会增加。受影响的社区会寻找已知症状、预防措施和治疗策略。另一方面,卫生组织试图获取疫情最新情况,以评估疫情的严重程度、已知的受影响病例及其他细节。诸如推特等社交媒体平台的近期出现,为向更广泛受众传播和获取信息提供了便捷途径和快速渠道。研究表明,这种在线信息在疫情、流行病和大流行期间满足相关当局信息需求方面具有潜力。在这项工作中,我们针对三类终端用户:(i)弱势群体——尚未受到影响且在寻找预防相关信息的人群;(ii)受影响人群——已受影响且在寻找治疗相关信息的人群;以及(iii)卫生组织,如世界卫生组织,它们希望了解疫情态势以便及时做出决策。我们使用来自最近两次疫情(埃博拉和中东呼吸综合征)的推特数据,构建一种自动分类方法,用于将推文分类到不同的疾病相关类别。此外,分类后的信息用于生成对受影响和弱势群体以及卫生组织有用的各类摘要。广泛实验获得的结果表明了所提方法的有效性。