Peng Zifan, Li Mingchen, Wang Yue, Ho George T S
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Khoury College of Computer Sciences, Northeastern University, Boston, USA.
Expert Syst Appl. 2023 Nov 1;229:120501. doi: 10.1016/j.eswa.2023.120501. Epub 2023 May 18.
The COVID-19 pandemic has been accompanied by a proliferation of online misinformation and disinformation about the virus. Combating this 'infodemic' has been identified as one of the top priorities of the World Health Organization, because false and misleading information can lead to a range of negative consequences, including the spread of false remedies, conspiracy theories, and xenophobia. This paper aims to combat the COVID-19 infodemic on multiple fronts, including determining the credibility of information, identifying its potential harm to society, and the necessity of intervention by relevant organizations. We present a prompt-based curriculum learning method to achieve this goal. The proposed method could overcome the challenges of data sparsity and class imbalance issues. Using online social media texts as input, the proposed model can verify content from multiple perspectives by answering a series of questions concerning the text's reliability. Experiments revealed the effectiveness of prompt tuning and curriculum learning in assessing the reliability of COVID-19-related text. The proposed method outperforms typical text classification methods, including fastText and BERT. In addition, the proposed method is robust to the hyperparameter settings, making it more applicable with limited infrastructure resources.
新冠疫情期间,关于该病毒的网上错误信息和虚假信息大量涌现。对抗这种“信息疫情”已被确定为世界卫生组织的首要任务之一,因为虚假和误导性信息会导致一系列负面后果,包括传播错误疗法、阴谋论和仇外心理。本文旨在从多个方面对抗新冠“信息疫情”,包括确定信息的可信度、识别其对社会的潜在危害以及相关组织进行干预的必要性。我们提出了一种基于提示的课程学习方法来实现这一目标。该方法可以克服数据稀疏和类别不平衡问题带来的挑战。以在线社交媒体文本为输入,该模型可以通过回答一系列关于文本可靠性的问题,从多个角度验证内容。实验表明,提示调整和课程学习在评估新冠相关文本的可靠性方面是有效的。该方法优于典型的文本分类方法,包括fastText和BERT。此外,该方法对超参数设置具有鲁棒性,使其在基础设施资源有限的情况下更适用。