基于生成式预训练变换器解决基于机器阅读理解的问答中的文本歧义与语义模糊问题。

On solving textual ambiguities and semantic vagueness in MRC based question answering using generative pre-trained transformers.

作者信息

Ahmed Muzamil, Khan Hikmat, Iqbal Tassawar, Khaled Alarfaj Fawaz, Alomair Abdullah, Almusallam Naif

机构信息

Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt, Pakistan.

Department of Management Information Systems, School of Business, King Faisal University, Hofuf, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2023 Jul 24;9:e1422. doi: 10.7717/peerj-cs.1422. eCollection 2023.

DOI:10.7717/peerj-cs.1422

PMID:37547420

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10403182/

Abstract

Machine reading comprehension (MRC) is one of the most challenging tasks and active fields in natural language processing (NLP). MRC systems aim to enable a machine to understand a given context in natural language and to answer a series of questions about it. With the advent of bi-directional deep learning algorithms and large-scale datasets, MRC achieved improved results. However, these models are still suffering from two research issues: textual ambiguities and semantic vagueness to comprehend the long passages and generate answers for abstractive MRC systems. To address these issues, this paper proposes a novel Extended Generative Pretrained Transformers-based Question Answering (ExtGPT-QA) model to generate precise and relevant answers to questions about a given context. The proposed architecture comprises two modified forms of encoder and decoder as compared to GPT. The encoder uses a positional encoder to assign a unique representation with each word in the sentence for reference to address the textual ambiguities. Subsequently, the decoder module involves a multi-head attention mechanism along with affine and aggregation layers to mitigate semantic vagueness with MRC systems. Additionally, we applied syntax and semantic feature engineering techniques to enhance the effectiveness of the proposed model. To validate the proposed model's effectiveness, a comprehensive empirical analysis is carried out using three benchmark datasets including SQuAD, Wiki-QA, and News-QA. The results of the proposed ExtGPT-QA outperformed state of art MRC techniques and obtained 93.25% and 90.52% F1-score and exact match, respectively. The results confirm the effectiveness of the ExtGPT-QA model to address textual ambiguities and semantic vagueness issues in MRC systems.

摘要

机器阅读理解（MRC）是自然语言处理（NLP）中最具挑战性的任务和活跃领域之一。MRC系统旨在使机器能够理解自然语言中的给定上下文，并回答一系列关于该上下文的问题。随着双向深度学习算法和大规模数据集的出现，MRC取得了更好的结果。然而，这些模型仍面临两个研究问题：文本歧义性和语义模糊性，难以理解长段落并为抽象MRC系统生成答案。为了解决这些问题，本文提出了一种基于扩展生成预训练变换器的新型问答（ExtGPT-QA）模型，以生成关于给定上下文问题的精确且相关的答案。与GPT相比，所提出的架构包括两种修改形式的编码器和解码器。编码器使用位置编码器为句子中的每个单词分配唯一表示以供参考，以解决文本歧义性。随后，解码器模块涉及多头注意力机制以及仿射和聚合层，以减轻MRC系统中的语义模糊性。此外，我们应用了句法和语义特征工程技术来提高所提出模型的有效性。为了验证所提出模型的有效性，使用包括SQuAD、Wiki-QA和News-QA在内的三个基准数据集进行了全面的实证分析。所提出的ExtGPT-QA的结果优于现有MRC技术，F1分数和精确匹配率分别达到93.25%和90.52%。结果证实了ExtGPT-QA模型在解决MRC系统中的文本歧义性和语义模糊性问题方面的有效性。