Su Xing Yu, Suominen Hanna, Hanlen Leif
National ICT Australian, Canberra Research Laboratory, Australia.
Stud Health Technol Inform. 2011;168:150-7.
We aim to improve retrieval of health information from Twitter.
The popularity of social media and micro-blogs has emphasised their potential for knowledge discovery and trend building. However, capturing and relating concepts in these short-spoken and lexically extensive sources of information requires search engines with increasing intelligence.
Our approach uses query expansion techniques to associate query terms with the most similar Twitter terms to capture trends in the gamut of information.
We demonstrated the value, defined as improved precision, of our search engine by considering three search tasks and two independent annotators. We also showed the stability of the engine with an increasing number of tweets; this is crucial as large data sets are needed for capturing trends with high confidence. These results encourage us to continue developing the engine for discovering trends in health information available at Twitter.
我们旨在改进从推特获取健康信息的检索效果。
社交媒体和微博的普及凸显了它们在知识发现和趋势构建方面的潜力。然而,要在这些表述简短且词汇丰富的信息源中捕捉并关联概念,需要搜索引擎具备越来越高的智能。
我们的方法使用查询扩展技术,将查询词与最相似的推特词汇相关联,以捕捉信息范围内的趋势。
通过考虑三项搜索任务和两名独立注释者,我们证明了我们搜索引擎的价值,即提高了精确率。我们还展示了随着推文数量增加该引擎的稳定性;这一点至关重要,因为需要大量数据集才能高置信度地捕捉趋势。这些结果促使我们继续开发该引擎,以发现推特上可用的健康信息趋势。