School of Information and Electronic Engineering(Sussex Artificial Intelligence Institute), Zhejiang Gongshang University, Hangzhou, Zhejiang Province, China.
PLoS One. 2024 Apr 16;19(4):e0302104. doi: 10.1371/journal.pone.0302104. eCollection 2024.
The explosive growth of dialogue data has aroused significant interest among scholars in abstractive dialogue summarization. In this paper, we propose a novel sequence-to-sequence framework called DS-SS (Dialogue Summarization with Factual-Statement Fusion and Dialogue Segmentation) for summarizing dialogues. The novelty of the DS-SS framework mainly lies in two aspects: 1) Factual statements are extracted from the source dialogue and combined with the source dialogue to perform the further dialogue encoding; and 2) A dialogue segmenter is trained and used to separate a dialogue to be encoded into several topic-coherent segments. Thanks to these two aspects, the proposed framework may better encode dialogues, thereby generating summaries exhibiting higher factual consistency and informativeness. Experimental results on two large-scale datasets SAMSum and DialogSum demonstrate the superiority of our framework over strong baselines, as evidenced by both automatic evaluation metrics and human evaluation.
对话数据的爆炸式增长引起了学者们对抽象对话总结的极大兴趣。在本文中,我们提出了一种名为 DS-SS(基于事实陈述融合和对话分割的对话总结)的新型序列到序列框架,用于总结对话。DS-SS 框架的新颖之处主要在于两个方面:1)从源对话中提取事实陈述,并将其与源对话结合以进一步进行对话编码;2)训练并使用对话分割器将待编码的对话分割成几个主题一致的片段。由于这两个方面,所提出的框架可以更好地对对话进行编码,从而生成具有更高事实一致性和信息量的摘要。在两个大规模数据集 SAMSum 和 DialogSum 上的实验结果表明,我们的框架优于强大的基线,这一点可以通过自动评估指标和人工评估得到证明。