基于猫群优化算法的单文档文本摘要

Debnath Dipanwita, Das Ranjita, Pakray Partha

Mizoram, 796012 India National Institute of Technology Mizoram.

Assam, 788010 India National Institute of Technology Silchar.

Appl Intell (Dordr). 2023;53(10):12268-12287. doi: 10.1007/s10489-022-04149-0. Epub 2022 Sep 24.

The availability of a tremendous amount of online information bringing about a broad interest in extracting relevant information in a compact and meaningful way, prompted the need for automatic text summarization. Hence, in the proposed system, the automated text summarization has been considered as an extractive single-document summarization problem, and a Cat Swarm Optimization (CSO) algorithm-based approach is proposed to solve it, whose objective is to generate good summaries in terms of content coverage, informative, anti-redundancy, and readability. In this work, input documents are pre-processed first. Then the cat population is initialized, where each individual (cat) in a binary vector is randomly initialized in the search space, considering the constraint. The objective function is then formulated considering different sentence quality measures. The Best Cat Memory Pool (BCMP) is initialized based on the objective function score. After that, individuals are randomly distributed for position updating to perform seeking/tracing mode operations based on the mixture ratio in each iteration. BCMP is also updated accordingly. Finally, an optimal individual is chosen to generate the summary after the last iteration. DUC-2001 and DUC-2002 data sets and ROUGE measures are used for system evaluation, and the obtained results are compared with the various state-of-the-art methods. We have achieved approximately 25% and 5% improvement on ROUGE-1 and ROUGE-2 scores on the datasets over the best existing method mentioned in this paper, revealing the proposed method's superiority. The proposed system is also evaluated considering the generational distance, CPU processing time, cohesion, and readability factor, reflecting that the system-generated summaries are readable, concise, relevant, and fast. We have also conducted a two-sample t-test, and one-way ANOVA test showing the proposed approach is statistically significant.

大量在线信息的可获取性引发了人们对以紧凑且有意义的方式提取相关信息的广泛兴趣，这促使了自动文本摘要技术的需求。因此，在该系统中，自动文本摘要被视为一个抽取式单文档摘要问题，并提出了一种基于猫群优化（CSO）算法的方法来解决它，其目标是在内容覆盖、信息性、抗冗余性和可读性方面生成高质量的摘要。在这项工作中，首先对输入文档进行预处理。然后初始化猫群，其中二进制向量中的每个个体（猫）在考虑约束的情况下在搜索空间中随机初始化。接着根据不同的句子质量度量来制定目标函数。基于目标函数得分初始化最佳猫记忆池（BCMP）。之后，个体根据每次迭代中的混合比例随机分布以进行位置更新，从而执行搜索/追踪模式操作。BCMP也相应地更新。最后，在最后一次迭代后选择最优个体来生成摘要。使用DUC - 2001和DUC - 2002数据集以及ROUGE度量进行系统评估，并将所得结果与各种现有最优方法进行比较。在这些数据集上，我们在ROUGE - 1和ROUGE - 2得分上比本文提及的最佳现有方法分别提高了约25%和5%，这表明了所提方法的优越性。还从生成距离、CPU处理时间、连贯性和可读性因素等方面对所提系统进行了评估，结果表明系统生成的摘要具有可读性、简洁性、相关性且速度快。我们还进行了双样本t检验和单因素方差分析测试，结果表明所提方法具有统计学意义。

相似文献

Single document text summarization addressed with a cat swarm optimization approach.

Appl Intell (Dordr). 2023;53(10):12268-12287. doi: 10.1007/s10489-022-04149-0. Epub 2022 Sep 24.

Extractive single document summarization using binary differential evolution: Optimization of different sentence quality measures.

PLoS One. 2019 Nov 14;14(11):e0223477. doi: 10.1371/journal.pone.0223477. eCollection 2019.

CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text.

BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 14):306. doi: 10.1186/s12911-020-01330-8.

Reaching for upper bound ROUGE score of extractive summarization methods.

PeerJ Comput Sci. 2022 Sep 26;8:e1103. doi: 10.7717/peerj-cs.1103. eCollection 2022.

Exploiting Intersentence Information for Better Question-Driven Abstractive Summarization: Algorithm Development and Validation.

JMIR Med Inform. 2022 Aug 15;10(8):e38052. doi: 10.2196/38052.

Extractive summarization of clinical trial descriptions.

Int J Med Inform. 2019 Sep;129:114-121. doi: 10.1016/j.ijmedinf.2019.05.019. Epub 2019 May 30.

Quantifying the informativeness for biomedical literature summarization: An itemset mining method.

Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.

Graph-based extractive text summarization method for Hausa text.

PLoS One. 2023 May 9;18(5):e0285376. doi: 10.1371/journal.pone.0285376. eCollection 2023.

CovSumm: an unsupervised transformer-cum-graph-based hybrid document summarization model for CORD-19.

J Supercomput. 2023 Apr 26:1-23. doi: 10.1007/s11227-023-05291-3.

User-Oriented Summaries Using a PSO Based Scoring Optimization Method.

Entropy (Basel). 2019 Jun 22;21(6):617. doi: 10.3390/e21060617.

引用本文的文献

Advanced multiple document summarization iterative recursive transformer networks and multimodal transformer.

PeerJ Comput Sci. 2024 Dec 9;10:e2463. doi: 10.7717/peerj-cs.2463. eCollection 2024.

Integrating particle swarm optimization with backtracking search optimization feature extraction with two-dimensional convolutional neural network and attention-based stacked bidirectional long short-term memory classifier for effective single and multi-document summarization.

PeerJ Comput Sci. 2024 Dec 12;10:e2435. doi: 10.7717/peerj-cs.2435. eCollection 2024.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Single document text summarization addressed with a cat swarm optimization approach.

Appl Intell (Dordr). 2023;53(10):12268-12287. doi: 10.1007/s10489-022-04149-0. Epub 2022 Sep 24.

Extractive single document summarization using binary differential evolution: Optimization of different sentence quality measures.

PLoS One. 2019 Nov 14;14(11):e0223477. doi: 10.1371/journal.pone.0223477. eCollection 2019.

CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text.

BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 14):306. doi: 10.1186/s12911-020-01330-8.

Reaching for upper bound ROUGE score of extractive summarization methods.

PeerJ Comput Sci. 2022 Sep 26;8:e1103. doi: 10.7717/peerj-cs.1103. eCollection 2022.

Exploiting Intersentence Information for Better Question-Driven Abstractive Summarization: Algorithm Development and Validation.

JMIR Med Inform. 2022 Aug 15;10(8):e38052. doi: 10.2196/38052.

Extractive summarization of clinical trial descriptions.

Int J Med Inform. 2019 Sep;129:114-121. doi: 10.1016/j.ijmedinf.2019.05.019. Epub 2019 May 30.

Quantifying the informativeness for biomedical literature summarization: An itemset mining method.

Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.

Graph-based extractive text summarization method for Hausa text.

PLoS One. 2023 May 9;18(5):e0285376. doi: 10.1371/journal.pone.0285376. eCollection 2023.

CovSumm: an unsupervised transformer-cum-graph-based hybrid document summarization model for CORD-19.

J Supercomput. 2023 Apr 26:1-23. doi: 10.1007/s11227-023-05291-3.

User-Oriented Summaries Using a PSO Based Scoring Optimization Method.

Entropy (Basel). 2019 Jun 22;21(6):617. doi: 10.3390/e21060617.

引用本文的文献

Advanced multiple document summarization iterative recursive transformer networks and multimodal transformer.

PeerJ Comput Sci. 2024 Dec 9;10:e2463. doi: 10.7717/peerj-cs.2463. eCollection 2024.

PeerJ Comput Sci. 2024 Dec 12;10:e2435. doi: 10.7717/peerj-cs.2435. eCollection 2024.

Single document text summarization addressed with a cat swarm optimization approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献