Suppr超能文献

万维网上与健康相关的搜索的流行程度如何?对互联网上搜索引擎查询进行定性和定量分析。

What is the prevalence of health-related searches on the World Wide Web? Qualitative and quantitative analysis of search engine queries on the internet.

作者信息

Eysenbach G, Kohler Ch

机构信息

Centre for Global eHealth Innovation, University Health Network, Toronto General Hospital, Canada.

出版信息

AMIA Annu Symp Proc. 2003;2003:225-9.

Abstract

While health information is often said to be the most sought after information on the web, empirical data on the actual frequency of health-related searches on the web are missing. In the present study we aimed to determine the prevalence of health-related searches on the web by analyzing search terms entered by people into popular search engines. We also made some preliminary attempts in qualitatively describing and classifying these searches. Occasional difficulties in determining what constitutes a "health-related" search led us to propose and validate a simple method to automatically classify a search string as "health-related". This method is based on determining the proportion of pages on the web containing the search string and the word "health", as a proportion of the total number of pages with the search string alone. Using human codings as gold standard we plotted a ROC curve and determined empirically that if this "co-occurance rate" is larger than 35%, the search string can be said to be health-related (sensitivity: 85.2%, specificity 80.4%). The results of our "human" codings of search queries determined that about 4.5% of all searches are "health-related". We estimate that globally a minimum of 6.75 Million health-related searches are being conducted on the web every day, which is roughly the same number of searches that have been conducted on the NLM Medlars system in 1996 in a full year.

摘要

虽然健康信息常被认为是网络上最受追捧的信息,但关于网络上与健康相关搜索的实际频率的实证数据却缺失。在本研究中,我们旨在通过分析人们在流行搜索引擎中输入的搜索词来确定网络上与健康相关搜索的流行程度。我们还在定性描述和分类这些搜索方面做了一些初步尝试。在确定什么构成“与健康相关”的搜索时偶尔遇到的困难促使我们提出并验证一种将搜索字符串自动分类为“与健康相关”的简单方法。该方法基于确定包含搜索字符串和“健康”一词的网页在仅包含搜索字符串的网页总数中所占的比例。以人工编码作为金标准,我们绘制了一条ROC曲线,并通过实证确定,如果这种“共现率”大于35%,则可以说搜索字符串与健康相关(敏感性:85.2%,特异性80.4%)。我们对搜索查询进行“人工”编码的结果表明,所有搜索中约4.5%是“与健康相关”的。我们估计,全球每天在网络上至少进行675万次与健康相关的搜索,这大致与1996年全年在NLM Medlars系统上进行的搜索次数相同。

相似文献

引用本文的文献

4
Reddit users' perspectives on radiofrequency ablation: A data analysis.Reddit用户对射频消融术的看法:一项数据分析。
Interv Pain Med. 2024 Dec 19;4(1):100535. doi: 10.1016/j.inpm.2024.100535. eCollection 2025 Mar.
9
Internet use by pregnant women during prenatal care.孕妇在产前护理期间使用互联网。
Einstein (Sao Paulo). 2024 Apr 8;22:eAO0447. doi: 10.31744/einstein_journal/2024AO0447. eCollection 2024.

本文引用的文献

5
Consumer health informatics.消费者健康信息学
BMJ. 2000 Jun 24;320(7251):1713-6. doi: 10.1136/bmj.320.7251.1713.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验