Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90007, USA.
School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
Sci Rep. 2021 Mar 12;11(1):5861. doi: 10.1038/s41598-021-84993-1.
Social media have emerged as increasingly popular means and environments for information gathering and propagation. This vigorous growth of social media contributed not only to a pandemic (fast-spreading and far-reaching) of rumors and misinformation, but also to an urgent need for text-based rumor detection strategies. To speed up the detection of misinformation, traditional rumor detection methods based on hand-crafted feature selection need to be replaced by automatic artificial intelligence (AI) approaches. AI decision making systems require to provide explanations in order to assure users of their trustworthiness. Inspired by the thriving development of generative adversarial networks (GANs) on text applications, we propose a GAN-based layered model for rumor detection with explanations. To demonstrate the universality of the proposed approach, we demonstrate its benefits on a gene classification with mutation detection case study. Similarly to the rumor detection, the gene classification can also be formulated as a text-based classification problem. Unlike fake news detection that needs a previously collected verified news database, our model provides explanations in rumor detection based on tweet-level texts only without referring to a verified news database. The layered structure of both generative and discriminative models contributes to the outstanding performance. The layered generators produce rumors by intelligently inserting controversial information in non-rumors, and force the layered discriminators to detect detailed glitches and deduce exactly which parts in the sentence are problematic. On average, in the rumor detection task, our proposed model outperforms state-of-the-art baselines on PHEME dataset by [Formula: see text] in terms of macro-f1. The excellent performance of our model for textural sequences is also demonstrated by the gene mutation case study on which it achieves [Formula: see text] macro-f1 score.
社交媒体已成为信息收集和传播日益流行的手段和环境。社交媒体的蓬勃发展不仅促成了谣言和错误信息的快速传播和广泛传播的大流行,也迫切需要基于文本的谣言检测策略。为了加快错误信息的检测,传统的基于手工特征选择的谣言检测方法需要被自动人工智能(AI)方法所取代。AI 决策系统需要提供解释,以确保用户对其可信度的信任。受文本应用中生成对抗网络(GAN)蓬勃发展的启发,我们提出了一种基于 GAN 的分层模型,用于具有解释的谣言检测。为了证明所提出方法的通用性,我们在基因突变检测案例研究中展示了其在基因分类中的优势。与需要预先收集的已验证新闻数据库的假新闻检测不同,我们的模型在谣言检测中仅基于推文级别的文本提供解释,而无需参考已验证的新闻数据库。生成和判别模型的分层结构有助于出色的性能。分层生成器通过在非谣言中智能地插入有争议的信息来制造谣言,并迫使分层判别器检测到详细的故障,并准确推断出句子中的哪些部分有问题。平均而言,在谣言检测任务中,与 PHEME 数据集上的最先进基线相比,我们提出的模型在宏观 f1 方面的表现提高了[Formula: see text]。我们的模型在文本序列方面的出色性能也通过基因突变案例研究得到了证明,它在基因突变案例研究中实现了[Formula: see text]的宏观 f1 得分。