Suppr超能文献

从复杂网络视角分析关于热点事件的在线中文新闻标题的词汇

Words analysis of online Chinese news headlines about trending events: a complex network perspective.

作者信息

Li Huajiao, Fang Wei, An Haizhong, Huang Xuan

机构信息

School of Humanities and Economic Management, China University of Geosciences, Beijing, China; Key Laboratory of Carrying Capacity Assessment for Resource and Environment, Ministry of Land and Resources, Beijing, China; Department of Energy and Mineral Engineering in the College of Earth and Mineral Sciences, The Pennsylvania State University, State College, Pennsylvania, United States of America; Lab of Resources and Environmental Management, China University of Geosciences, Beijing, China.

School of Humanities and Economic Management, China University of Geosciences, Beijing, China; Key Laboratory of Carrying Capacity Assessment for Resource and Environment, Ministry of Land and Resources, Beijing, China; Lab of Resources and Environmental Management, China University of Geosciences, Beijing, China.

出版信息

PLoS One. 2015 Mar 25;10(3):e0122174. doi: 10.1371/journal.pone.0122174. eCollection 2015.

Abstract

Because the volume of information available online is growing at breakneck speed, keeping up with meaning and information communicated by the media and netizens is a new challenge both for scholars and for companies who must address public relations crises. Most current theories and tools are directed at identifying one website or one piece of online news and do not attempt to develop a rapid understanding of all websites and all news covering one topic. This paper represents an effort to integrate statistics, word segmentation, complex networks and visualization to analyze headlines' keywords and words relationships in online Chinese news using two samples: the 2011 Bohai Bay oil spill and the 2010 Gulf of Mexico oil spill. We gathered all the news headlines concerning the two trending events in the search results from Baidu, the most popular Chinese search engine. We used Simple Chinese Word Segmentation to segment all the headlines into words and then took words as nodes and considered adjacent relations as edges to construct word networks both using the whole sample and at the monthly level. Finally, we develop an integrated mechanism to analyze the features of words' networks based on news headlines that can account for all the keywords in the news about a particular event and therefore track the evolution of news deeply and rapidly.

摘要

由于网上可用信息的数量正以惊人的速度增长,对于学者以及必须应对公关危机的公司而言,跟上媒体和网民所传达的含义和信息是一项新挑战。当前大多数理论和工具都旨在识别一个网站或一则在线新闻,而并非尝试对涵盖某一主题的所有网站和所有新闻形成快速理解。本文致力于整合统计学、分词、复杂网络和可视化技术,利用两个样本——2011年渤海湾漏油事件和2010年墨西哥湾漏油事件,来分析中文在线新闻标题中的关键词及其词语关系。我们从中国最受欢迎的搜索引擎百度的搜索结果中收集了与这两起热门事件相关的所有新闻标题。我们使用简体中文分词工具将所有标题分词,然后将词语作为节点,并把相邻关系视为边,分别基于整个样本和月度数据构建词语网络。最后,我们开发了一种综合机制,用于基于新闻标题分析词语网络的特征,该机制能够涵盖特定事件新闻中的所有关键词,从而深入且快速地追踪新闻的演变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5aa/4373827/3fdab96415cd/pone.0122174.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验