School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China.
Comput Intell Neurosci. 2022 Sep 30;2022:7178818. doi: 10.1155/2022/7178818. eCollection 2022.
Question classification is an important component of the question answering system (QA system), which is designed to restrict the answer types and accurately locate the answers. Therefore, the classification results of the questions affect the quality and performance of the QA system. Most question classification methods in the past have relied on a large amount of manually labeled training data. However, in real situations, especially in new domains, it is very difficult to obtain a large amount of labeled data. Transfer learning is an effective approach to solve the problem with the scarcity of annotated data in new domains. We compare the effects of different deep transfer learning methods on cross-domain question classification. On the basis of the ALBERT fine-tuning model, we extract the category labels of the source domain, the question text, and the predicted category labels of the target domain as input to extract the category labels. Additionally, the semantic information of the category labels is extracted to achieve cross-domain question classification. Furthermore, WordNet is used to expand the question, which further improves the classification accuracy of the target domain. Experimental results show that the above methods can further improve the classification accuracy in new domains based on deep transfer learning.
问题分类是问答系统(QA 系统)的重要组成部分,旨在限制答案类型并准确定位答案。因此,问题的分类结果会影响 QA 系统的质量和性能。过去的大多数问题分类方法都依赖于大量的人工标注训练数据。然而,在实际情况下,尤其是在新领域,获取大量标注数据非常困难。迁移学习是解决新领域中注释数据稀缺问题的有效方法。我们比较了不同深度迁移学习方法对跨领域问题分类的效果。在 ALBERT 微调模型的基础上,我们提取源域的类别标签、问题文本和目标域的预测类别标签作为输入,提取类别标签。此外,还提取类别标签的语义信息,以实现跨领域问题分类。进一步地,使用 WordNet 扩展问题,进一步提高了目标域的分类准确性。实验结果表明,上述方法可以在深度迁移学习的基础上进一步提高新领域的分类准确性。