Abdalsalam Mohammed, Li Chunlin, Dahou Abdelghani, Kryvinska Natalia
School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, Hubei, China.
School of Computer Science and Information Technology, Sudan University of Science and Technology, Khartoum, Khartoum, Sudan.
PeerJ Comput Sci. 2024 Sep 20;10:e2252. doi: 10.7717/peerj-cs.2252. eCollection 2024.
The world faces the ongoing challenge of terrorism and extremism, which threaten the stability of nations, the security of their citizens, and the integrity of political, economic, and social systems. Given the complexity and multifaceted nature of this phenomenon, combating it requires a collective effort, with tailored methods to address its various aspects. Identifying the terrorist organization responsible for an attack is a critical step in combating terrorism. Historical data plays a pivotal role in this process, providing insights that can inform prevention and response strategies. With advancements in technology and artificial intelligence (AI), particularly in military applications, there is growing interest in utilizing these developments to enhance national and regional security against terrorism. Central to this effort are terrorism databases, which serve as rich resources for data on armed organizations, extremist entities, and terrorist incidents. The Global Terrorism Database (GTD) stands out as one of the most widely used and accessible resources for researchers. Recent progress in machine learning (ML), deep learning (DL), and natural language processing (NLP) offers promising avenues for improving the identification and classification of terrorist organizations. This study introduces a framework designed to classify and predict terrorist groups using bidirectional recurrent units and self-attention mechanisms, referred to as BiGRU-SA. This approach utilizes the comprehensive data in the GTD by integrating textual features extracted by DistilBERT with features that show a high correlation with terrorist organizations. Additionally, the Synthetic Minority Over-sampling Technique with Tomek links (SMOTE-T) was employed to address data imbalance and enhance the robustness of our predictions. The BiGRU-SA model captures temporal dependencies and contextual information within the data. By processing data sequences in both forward and reverse directions, BiGRU-SA offers a comprehensive view of the temporal dynamics, significantly enhancing classification accuracy. To evaluate the effectiveness of our framework, we compared ten models, including six traditional ML models and four DL algorithms. The proposed BiGRU-SA framework demonstrated outstanding performance in classifying 36 terrorist organizations responsible for terrorist attacks, achieving an accuracy of 98.68%, precision of 96.06%, sensitivity of 96.83%, specificity of 99.50%, and a Matthews correlation coefficient of 97.50%. Compared to state-of-the-art methods, the proposed model outperformed others, confirming its effectiveness and accuracy in the classification and prediction of terrorist organizations.
世界面临着恐怖主义和极端主义这一持续的挑战,它们威胁着国家的稳定、公民的安全以及政治、经济和社会体系的完整性。鉴于这一现象的复杂性和多面性,打击它需要集体努力,并采用量身定制的方法来应对其各个方面。确定对袭击负责的恐怖组织是打击恐怖主义的关键一步。历史数据在这一过程中起着关键作用,提供的见解可为预防和应对策略提供参考。随着技术和人工智能(AI)的进步,特别是在军事应用方面,人们越来越有兴趣利用这些发展来加强国家和地区反恐安全。这项工作的核心是恐怖主义数据库,它是有关武装组织、极端主义实体和恐怖事件数据的丰富资源。全球恐怖主义数据库(GTD)是研究人员使用最广泛且最易获取的资源之一。机器学习(ML)、深度学习(DL)和自然语言处理(NLP)方面的最新进展为改进恐怖组织的识别和分类提供了有前景的途径。本研究引入了一个旨在使用双向循环单元和自注意力机制对恐怖组织进行分类和预测的框架,称为BiGRU-SA。这种方法通过将DistilBERT提取的文本特征与与恐怖组织高度相关的特征相结合,利用了GTD中的全面数据。此外,采用带托梅克链接的合成少数过采样技术(SMOTE-T)来解决数据不平衡问题,并提高我们预测的稳健性。BiGRU-SA模型捕捉数据中的时间依赖性和上下文信息。通过向前和向后处理数据序列,BiGRU-SA提供了时间动态的全面视图,显著提高了分类准确率。为了评估我们框架的有效性,我们比较了十个模型,包括六个传统ML模型和四个DL算法。所提出的BiGRU-SA框架在对36个对恐怖袭击负责的恐怖组织进行分类时表现出色,准确率达到98.68%,精确率为96.06%,灵敏度为96.83%,特异性为99.50%,马修斯相关系数为97.50%。与现有最先进的方法相比,所提出的模型表现优于其他模型,证实了其在恐怖组织分类和预测中的有效性和准确性。