University of Texas School of Biomedical Informatics, 7000 Fannin St Suite 600, Houston, TX, 77030, USA.
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):69. doi: 10.1186/s12911-017-0469-6.
As one of the serious public health issues, vaccination refusal has been attracting more and more attention, especially for newly approved human papillomavirus (HPV) vaccines. Understanding public opinion towards HPV vaccines, especially concerns on social media, is of significant importance for HPV vaccination promotion.
In this study, we leveraged a hierarchical machine learning based sentiment analysis system to extract public opinions towards HPV vaccines from Twitter. English tweets containing HPV vaccines-related keywords were collected from November 2, 2015 to March 28, 2016. Manual annotation was done to evaluate the performance of the system on the unannotated tweets corpus. Followed time series analysis was applied to this corpus to track the trends of machine-deduced sentiments and their associations with different days of the week.
The evaluation of the unannotated tweets corpus showed that the micro-averaging F scores have reached 0.786. The learning system deduced the sentiment labels for 184,214 tweets in the collected unannotated tweets corpus. Time series analysis identified a coincidence between mainstream outcome and Twitter contents. A weak trend was found for "Negative" tweets that decreased firstly and began to increase later; an opposite trend was identified for "Positive" tweets. Tweets that contain the worries on efficacy for HPV vaccines showed a relative significant decreasing trend. Strong associations were found between some sentiments ("Positive", "Negative", "Negative-Safety" and "Negative-Others") with different days of the week.
Our efforts on sentiment analysis for newly approved HPV vaccines provide us an automatic and instant way to extract public opinion and understand the concerns on Twitter. Our approaches can provide a feedback to public health professionals to monitor online public response, examine the effectiveness of their HPV vaccination promotion strategies and adjust their promotion plans.
疫苗接种拒绝作为一个严重的公共卫生问题,越来越受到关注,特别是对于新批准的人乳头瘤病毒(HPV)疫苗。了解公众对 HPV 疫苗的看法,特别是在社交媒体上的关注点,对于 HPV 疫苗接种推广具有重要意义。
本研究利用基于分层机器学习的情感分析系统从 Twitter 上提取公众对 HPV 疫苗的看法。从 2015 年 11 月 2 日至 2016 年 3 月 28 日,收集包含 HPV 疫苗相关关键词的英语推文。对未注释的推文语料库进行手动注释,以评估系统的性能。对该语料库进行了时间序列分析,以跟踪机器推断的情绪趋势及其与一周中不同日子的关联。
对未注释的推文语料库的评估表明,微平均 F 分数达到 0.786。学习系统为收集的未注释推文语料库中的 184,214 条推文推断了情绪标签。时间序列分析发现主流结果和 Twitter 内容之间存在巧合。“负面”推文呈先下降后上升的趋势;“正面”推文则相反。包含 HPV 疫苗功效担忧的推文显示出相对显著的下降趋势。HPV 疫苗的一些情绪(“正面”、“负面”、“负面-安全”和“负面-其他”)与一周中的不同日子之间存在强烈关联。
我们对新批准的 HPV 疫苗的情感分析工作为我们提供了一种自动即时的方式,从 Twitter 上提取公众意见并了解他们的关注点。我们的方法可以为公共卫生专业人员提供反馈,监测在线公众反应,检查其 HPV 疫苗推广策略的有效性,并调整其推广计划。