Varshney Deepika, Vishwakarma Dinesh Kumar
Biometric Research Laboratory, Department of Information Technology, Delhi Technological University, Delhi 110042, India.
Data Knowl Eng. 2023 Jan;143:102103. doi: 10.1016/j.datak.2022.102103. Epub 2022 Nov 11.
The spreading of misleading information on social web platforms has fuelled massive panic and confusion among the public regarding the Corona disease, the detection of which is of paramount importance. Previous studies mainly relied on a specific web platform to collect crucial evidence to detect fake content. The analysis identifies that retrieving clues from two or more different sources/web platforms gives more reliable prediction and confidence concerning a specific claim. This study proposed a novel multi-web platform voting framework that incorporates 4 sets of novel features: content, linguistic, similarity, and sentiments. The features have been gathered from each web-platforms to validate the news. To validate the fact/claim, a unique source platform is designed to collect relevant clues/headlines from two web platforms (YouTube, Google) based on specific queries and extracted features concerning each clue/headline. The proposed idea is to incorporate a unique platform to assist researchers in gathering relevant and vital evidence from diverse web platforms. After evaluation and validation, it has been identified that the built model is quite intelligent, gives promising results, and effectively predicts misleading information. The model correctly detected about 98% of the COVID misinformation on the constraint Covid-19 fake news dataset. Furthermore, it is observed that it is efficient to gather clues from multiple web platforms for more reliable predictions to validate the news. The suggested work depicts numerous practical applications for health policy-makers and practitioners that could be useful in safeguarding and implicating awareness among society from misleading information dissemination during this pandemic.
社交网络平台上误导性信息的传播引发了公众对新冠疾病的大规模恐慌和困惑,而新冠疾病的检测至关重要。以往的研究主要依赖于特定的网络平台来收集关键证据以检测虚假内容。分析发现,从两个或更多不同来源/网络平台检索线索,对于特定的说法能给出更可靠的预测和更高的可信度。本研究提出了一种新颖的多网络平台投票框架,该框架纳入了4组新颖的特征:内容、语言、相似度和情感。这些特征是从每个网络平台收集而来,用于验证新闻。为了验证事实/说法,设计了一个独特的源平台,根据特定查询从两个网络平台(YouTube、谷歌)收集相关线索/标题,并提取与每个线索/标题相关的特征。所提出的想法是纳入一个独特的平台,以协助研究人员从不同的网络平台收集相关且重要的证据。经过评估和验证,已确定所构建的模型相当智能,能给出有前景的结果,并能有效预测误导性信息。该模型在受限的新冠-19假新闻数据集上正确检测出了约98%的新冠错误信息。此外,还观察到从多个网络平台收集线索以进行更可靠的预测来验证新闻是有效的。所建议的工作为卫生政策制定者和从业者描绘了众多实际应用,这在这场疫情期间保护社会并提高人们对误导性信息传播的认识方面可能会很有用。