Malik Muhammad Shahid Iqbal, Younas Muhammad Zeeshan, Jamjoom Mona Mamdouh, Ignatov Dmitry I
Department of Computer Science, National Research University Higher School of Economics, Moscow, Russia.
Department of Computer Science, Capital University of Science and Technology, Islamabad, Pakistan.
PeerJ Comput Sci. 2024 Feb 16;10:e1859. doi: 10.7717/peerj-cs.1859. eCollection 2024.
Identification of infrastructure and human damage assessment tweets is beneficial to disaster management organizations as well as victims during a disaster. Most of the prior works focused on the detection of informative/situational tweets, and infrastructure damage, only one focused on human damage. This study presents a novel approach for detecting damage assessment tweets involving infrastructure and human damages. We investigated the potential of the Bidirectional Encoder Representations from Transformer (BERT) model to learn universal contextualized representations targeting to demonstrate its effectiveness for binary and multi-class classification of disaster damage assessment tweets. The objective is to exploit a pre-trained BERT as a transfer learning mechanism after fine-tuning important hyper-parameters on the CrisisMMD dataset containing seven disasters. The effectiveness of fine-tuned BERT is compared with five benchmarks and nine comparable models by conducting exhaustive experiments. The findings show that the fine-tuned BERT outperformed all benchmarks and comparable models and achieved state-of-the-art performance by demonstrating up to 95.12% macro-f1-score, and 88% macro-f1-score for binary and multi-class classification. Specifically, the improvement in the classification of human damage is promising.
识别有关基础设施和人员损害评估的推文,对灾害管理组织以及灾难中的受害者都有益处。大多数先前的工作都集中在检测信息性/情境性推文以及基础设施损害方面,只有一项研究关注人员损害。本研究提出了一种新颖的方法来检测涉及基础设施和人员损害的损害评估推文。我们研究了来自Transformer的双向编码器表征(BERT)模型学习通用上下文表征的潜力,旨在证明其在灾害损害评估推文的二分类和多分类中的有效性。目标是在包含七种灾害的CrisisMMD数据集上微调重要超参数后,将预训练的BERT用作迁移学习机制。通过进行详尽的实验,将微调后的BERT的有效性与五个基准和九个可比模型进行了比较。研究结果表明,微调后的BERT优于所有基准和可比模型,并通过在二分类和多分类中分别展示高达95.12%的宏F1分数和88%的宏F1分数,取得了当前最优的性能。具体而言,在人员损害分类方面的改进很有前景。