Ma Ting-Huai, Yu Xin, Rong Huan
School of Computer Science, Nanjing University of Information Science & Technology, Nanjing 210044, China.
School of Artificial Intelligence (School of Future Technology), Nanjing University of Information Science & Technology, Nanjing 210044, China.
Math Biosci Eng. 2023 Jan;20(1):1195-1128. doi: 10.3934/mbe.2023055. Epub 2022 Oct 26.
Most current deep learning-based news headline generation models only target domain-specific news data. When a new news domain appears, it is usually costly to obtain a large amount of data with reference truth on the new domain for model training, so text generation models trained by traditional supervised approaches often do not generalize well on the new domain-inspired by the idea of transfer learning, this paper designs a cross-domain transfer text generation method based on domain data distribution alignment, intermediate domain redistribution, and zero-shot learning semantic prototype transduction, focusing on the data problem with no reference truth in the target domain. Eventually, the model can be guided by the most relevant source domain data to generate headlines from the target domain news text through the semantic correlation between source and target domain data during the training process of generating headlines for the target domain news, even without any reference truth of the news headlines in the target domain, which improves the usability of the text generation model in real scenarios. The experimental results show that the proposed transfer text generation method has a good domain transfer effect and outperforms other existing transfer text generation methods in various text generation evaluation indexes, proving the proposed method's effectiveness in this paper.
当前大多数基于深度学习的新闻标题生成模型仅针对特定领域的新闻数据。当出现新的新闻领域时,为模型训练获取大量具有新领域参考真值的数据通常成本很高,因此通过传统监督方法训练的文本生成模型在新领域上往往泛化能力不佳。受迁移学习思想的启发,本文设计了一种基于领域数据分布对齐、中间领域重新分布和零样本学习语义原型转换的跨领域迁移文本生成方法,重点解决目标领域中没有参考真值的数据问题。最终,在为目标领域新闻生成标题的训练过程中,模型可以通过源域和目标域数据之间的语义相关性,由最相关的源域数据引导,从目标域新闻文本中生成标题,即使目标域中没有新闻标题的任何参考真值也能如此,这提高了文本生成模型在实际场景中的可用性。实验结果表明,所提出的迁移文本生成方法具有良好的领域迁移效果,在各种文本生成评估指标上均优于其他现有的迁移文本生成方法,证明了本文所提方法的有效性。