School of Electronic and Information Engineering, Changchun University, Changchun 130022, China.
School of Media, Qufu Normal University, Rizhao 276826, China.
J Environ Public Health. 2022 Jul 13;2022:1166989. doi: 10.1155/2022/1166989. eCollection 2022.
With the rapid development of we-media information dissemination, WeChat official accounts platform has become an important way for people to obtain health related knowledge. However, the platform information is redundant, miscellaneous, and overloaded. In order to meet the increasingly accurate and efficient knowledge service needs of users, reorganizing and aggregating document knowledge resources is effective. If we use the way of artificial recognition to filter information, it will inevitably cause huge labor and time cost, and the effect is very little in front of massive articles. This paper proposes a text summarization method for the WeChat platform based on improved TextRank that takes into account both user demands and sentence features during the summarization process. The data source crawled from the Sogou WeChat platform. The results show that the TextRank algorithm has obvious hints on the accuracy of text summarization extraction after fusing the Word2vec model. The improved TextRank method, integrating user demands and sentence features into the model, makes the results of text summarization closer to the theme of the article and more able to meet the user demand. According to the complexity of the algorithm, this method is not suitable for the automatic summarization of long or multiple documents.
随着自媒体信息传播的飞速发展,微信公众号平台已成为人们获取健康相关知识的重要途径。然而,平台信息冗余、繁杂、过载。为了满足用户日益精确和高效的知识服务需求,对文档知识资源进行重组和聚合是有效的。如果我们采用人工识别的方式来过滤信息,必然会造成巨大的劳动和时间成本,而且在海量文章面前效果微乎其微。本文提出了一种基于改进的 TextRank 的微信平台文本摘要方法,该方法在摘要过程中同时考虑了用户需求和句子特征。数据来源于搜狗微信平台的抓取。结果表明,在融合 Word2vec 模型后,TextRank 算法对文本摘要提取的准确性有明显提示。改进的 TextRank 方法将用户需求和句子特征集成到模型中,使文本摘要的结果更贴近文章主题,更能满足用户需求。根据算法的复杂性,该方法不适合对长文档或多个文档进行自动摘要。