Madichetty Sreenivasulu, M Sridevi
Department of Computer Science and Engineering, National Institute of Technology, Tiruchirappalli, India.
Multimed Tools Appl. 2021;80(3):3927-3949. doi: 10.1007/s11042-020-09873-8. Epub 2020 Sep 25.
Social media platform like Twitter is one of the primary sources for sharing real-time information at the time of events such as disasters, political events, etc. Detecting the resource tweets during a disaster is an essential task because tweets contain different types of information such as infrastructure damage, resources, opinions and sympathies of disaster events, etc. Tweets are posted related to Need and Availability of Resources (NAR) by humanitarian organizations and victims. Hence, reliable methodologies are required for detecting the NAR tweets during a disaster. The existing works don't focus well on NAR tweets detection and also had poor performance. Hence, this paper focus on detection of NAR tweets during a disaster. Existing works often use features and appropriate machine learning algorithms on several Natural Language Processing (NLP) tasks. Recently, there is a wide use of Convolutional Neural Networks (CNN) in text classification problems. However, it requires a large amount of manual labeled data. There is no such large labeled data is available for NAR tweets during a disaster. To overcome this problem, stacking of Convolutional Neural Networks with traditional feature based classifiers is proposed for detecting the NAR tweets. In our approach, we propose several informative features such as aid, need, food, packets, earthquake, etc. are used in the classifier and CNN. The learned features (output of CNN and classifier with informative features) are utilized in another classifier (meta-classifier) for detection of NAR tweets. The classifiers such as SVM, KNN, Decision tree, and Naive Bayes are used in the proposed model. From the experiments, we found that the usage of KNN (base classifier) and SVM (meta classifier) with the combination of CNN in the proposed model outperform the other algorithms. This paper uses 2015 and 2016 Nepal and Italy earthquake datasets for experimentation. The experimental results proved that the proposed model achieves the best accuracy compared to baseline methods.
像推特这样的社交媒体平台是在灾难、政治事件等发生时分享实时信息的主要来源之一。在灾难期间检测资源推文是一项重要任务,因为推文中包含不同类型的信息,如基础设施损坏情况、资源、对灾难事件的看法和同情等。人道主义组织和受害者会发布与资源需求和可用性(NAR)相关的推文。因此,需要可靠的方法来在灾难期间检测NAR推文。现有工作对NAR推文检测的关注不够,性能也较差。因此,本文专注于灾难期间NAR推文的检测。现有工作在多个自然语言处理(NLP)任务中经常使用特征和适当的机器学习算法。最近,卷积神经网络(CNN)在文本分类问题中得到了广泛应用。然而,它需要大量的人工标注数据。在灾难期间,没有如此大量的标注数据可用于NAR推文。为了克服这个问题,提出了将卷积神经网络与传统基于特征的分类器堆叠起来用于检测NAR推文。在我们的方法中,我们提出了几个信息丰富的特征,如援助、需求、食物、包裹、地震等,并将其用于分类器和CNN中。学到的特征(CNN和带有信息丰富特征的分类器的输出)被用于另一个分类器(元分类器)来检测NAR推文。所提出的模型中使用了支持向量机、K近邻、决策树和朴素贝叶斯等分类器。通过实验,我们发现所提出的模型中使用K近邻(基础分类器)和支持向量机(元分类器)并结合CNN的方法优于其他算法。本文使用2015年和2016年尼泊尔和意大利地震数据集进行实验。实验结果证明,与基线方法相比,所提出的模型具有最高的准确率。