Trattner Christoph, Parra Denis, Elsweiler David
Department of New Media Technology, MODUL University Vienna, Vienna, Austria.
Departamento de Ciencia de la Computación, Pontificia Universidad Católica de Chile, Santiago, Chile.
PLoS One. 2017 Jun 21;12(6):e0179144. doi: 10.1371/journal.pone.0179144. eCollection 2017.
Studying the impact of food consumption on people's health is a serious matter for its implications on public policy, but it has traditionally been a slow process since it requires information gathered through expensive collection processes such as surveys, census and systematic reviews of research articles. We argue that this process could be supported and hastened using data collected via online social networks. In this work we investigate the relationships between the online traces left behind by users of a large US online food community and the prevalence of obesity in 47 states and 311 counties in the US. Using data associated with the recipes bookmarked over an 9-year period by 144,839 users of the Allrecipes.com food website residing throughout the US, several hierarchical regression models are created to (i) shed light on these relations and (ii) establish their magnitude. The results of our analysis provide strong evidence that bookmarking activities on recipes in online food communities can provide a signal allowing food and health related issues, such as obesity to be better understood and monitored. We discover that higher fat and sugar content in bookmarked recipes is associated with higher rates of obesity. The dataset is complicated, but strong temporal and geographical trends are identifiable. We show the importance of accounting for these trends in the modeling process.
研究食物消费对人们健康的影响是一件严肃的事情,因为它对公共政策有影响,但传统上这是一个缓慢的过程,因为它需要通过诸如调查、人口普查和对研究文章的系统综述等昂贵的收集过程来收集信息。我们认为,利用通过在线社交网络收集的数据可以支持并加速这一过程。在这项工作中,我们研究了美国一个大型在线食品社区用户留下的在线痕迹与美国47个州和311个县的肥胖患病率之间的关系。利用与全美Allrecipes.com食品网站的144,839名用户在9年时间里收藏的食谱相关的数据,创建了几个层次回归模型,以(i)阐明这些关系,(ii)确定它们的强度。我们的分析结果提供了有力证据,表明在线食品社区中食谱的收藏活动可以提供一个信号,使肥胖等与食品和健康相关的问题得到更好的理解和监测。我们发现,收藏食谱中较高的脂肪和糖含量与较高的肥胖率相关。数据集很复杂,但可以识别出强烈的时间和地理趋势。我们展示了在建模过程中考虑这些趋势的重要性。