• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在互联网上搜索癌症信息:分析自然语言搜索查询

Searching for cancer information on the internet: analyzing natural language search queries.

作者信息

Bader Judith L, Theofanos Mary Frances

机构信息

National Cancer Institute, Office of Communications, Cancer Information Products and Services, Communications Technology Branch, Bethesda, MD 20852, USA.

出版信息

J Med Internet Res. 2003 Dec 11;5(4):e31. doi: 10.2196/jmir.5.4.e31.

DOI:10.2196/jmir.5.4.e31
PMID:14713659
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1550578/
Abstract

BACKGROUND

Searching for health information is one of the most-common tasks performed by Internet users. Many users begin searching on popular search engines rather than on prominent health information sites. We know that many visitors to our (National Cancer Institute) Web site, cancer.gov, arrive via links in search engine result.

OBJECTIVE

To learn more about the specific needs of our general-public users, we wanted to understand what lay users really wanted to know about cancer, how they phrased their questions, and how much detail they used.

METHODS

The National Cancer Institute partnered with AskJeeves, Inc to develop a methodology to capture, sample, and analyze 3 months of cancer-related queries on the Ask.com Web site, a prominent United States consumer search engine, which receives over 35 million queries per week. Using a benchmark set of 500 terms and word roots supplied by the National Cancer Institute, AskJeeves identified a test sample of cancer queries for 1 week in August 2001. From these 500 terms only 37 appeared >or= 5 times/day over the trial test week in 17208 queries. Using these 37 terms, 204165 instances of cancer queries were found in the Ask.com query logs for the actual test period of June-August 2001. Of these, 7500 individual user questions were randomly selected for detailed analysis and assigned to appropriate categories. The exact language of sample queries is presented.

RESULTS

Considering multiples of the same questions, the sample of 7500 individual user queries represented 76077 queries (37% of the total 3-month pool). Overall 78.37% of sampled Cancer queries asked about 14 specific cancer types. Within each cancer type, queries were sorted into appropriate subcategories including at least the following: General Information, Symptoms, Diagnosis and Testing, Treatment, Statistics, Definition, and Cause/Risk/Link. The most-common specific cancer types mentioned in queries were Digestive/Gastrointestinal/Bowel (15.0%), Breast (11.7%), Skin (11.3%), and Genitourinary (10.5%). Additional subcategories of queries about specific cancer types varied, depending on user input. Queries that were not specific to a cancer type were also tracked and categorized.

CONCLUSIONS

Natural-language searching affords users the opportunity to fully express their information needs and can aid users naïve to the content and vocabulary. The specific queries analyzed for this study reflect news and research studies reported during the study dates and would surely change with different study dates. Analyzing queries from search engines represents one way of knowing what kinds of content to provide to users of a given Web site. Users ask questions using whole sentences and keywords, often misspelling words. Providing the option for natural-language searching does not obviate the need for good information architecture, usability engineering, and user testing in order to optimize user experience.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8218/1550578/5096504a7f81/jmir_v5i4e31_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8218/1550578/f5cbc5705e3a/jmir_v5i4e31_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8218/1550578/5096504a7f81/jmir_v5i4e31_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8218/1550578/f5cbc5705e3a/jmir_v5i4e31_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8218/1550578/5096504a7f81/jmir_v5i4e31_fig2.jpg
摘要

背景

搜索健康信息是互联网用户最常进行的任务之一。许多用户从流行的搜索引擎开始搜索,而非著名的健康信息网站。我们知道,很多访问我们(美国国家癌症研究所)网站cancer.gov的用户是通过搜索引擎结果中的链接而来。

目的

为了更多地了解普通大众用户的具体需求,我们想知道普通用户真正想了解关于癌症的哪些方面、他们如何表述问题以及他们使用的细节程度。

方法

美国国家癌症研究所与AskJeeves公司合作,开发一种方法来收集、抽样和分析Ask.com网站上三个月内与癌症相关的查询。Ask.com是美国一个著名的消费者搜索引擎,每周接收超过3500万个查询。利用美国国家癌症研究所提供的500个术语和词根作为基准,AskJeeves在2001年8月确定了为期一周的癌症查询测试样本。在17208条查询中,在测试周里,这500个术语中只有37个出现次数≥5次/天。使用这37个术语,在2001年6月至8月的实际测试期间,在Ask.com的查询日志中发现了204165个癌症查询实例。其中,随机选择了7500个用户的问题进行详细分析,并归入适当的类别。给出了样本查询的确切表述。

结果

考虑到相同问题的多个实例,7500个用户查询样本代表了76077条查询(占三个月总查询量的37%)。总体而言,78.37%的抽样癌症查询是关于14种特定癌症类型的。在每种癌症类型中,查询被归入适当的子类别,至少包括以下方面:一般信息、症状、诊断与检测、治疗、统计数据、定义以及病因/风险/关联。查询中提及最多的特定癌症类型是消化/胃肠/肠道(15.0%)、乳腺(11.7%)、皮肤(11.3%)和泌尿生殖系统(10.5%)。关于特定癌症类型的查询的其他子类别因用户输入而异。非特定于某种癌症类型的查询也被跟踪和分类。

结论

自然语言搜索使用户有机会充分表达他们的信息需求,并且可以帮助不熟悉相关内容和词汇的用户。本研究分析的具体查询反映了研究期间报道的新闻和研究,并且肯定会随不同的研究日期而变化。分析来自搜索引擎的查询是了解应向特定网站用户提供何种内容的一种方式。用户使用完整句子和关键词提问,经常拼错单词。提供自然语言搜索选项并不能消除对良好信息架构、可用性工程和用户测试的需求,以便优化用户体验。

相似文献

1
Searching for cancer information on the internet: analyzing natural language search queries.在互联网上搜索癌症信息:分析自然语言搜索查询
J Med Internet Res. 2003 Dec 11;5(4):e31. doi: 10.2196/jmir.5.4.e31.
2
A study of medical and health queries to web search engines.一项关于向网络搜索引擎提出的医疗卫生问题的研究。
Health Info Libr J. 2004 Mar;21(1):44-51. doi: 10.1111/j.1471-1842.2004.00481.x.
3
Use of Internet audience measurement data to gauge market share for online health information services.利用互联网受众测量数据来评估在线健康信息服务的市场份额。
J Med Internet Res. 2005 Jul 1;7(3):e31. doi: 10.2196/jmir.7.3.e31.
4
The zeitgeist of online health search. Implications for a consumer-centric health system.在线健康搜索的时代精神。对以消费者为中心的卫生系统的影响。
J Gen Intern Med. 2006 Feb;21(2):134-9. doi: 10.1111/j.1525-1497.2005.0295.x. Epub 2005 Dec 7.
5
Language preferences on websites and in Google searches for human health and food information.网站以及谷歌搜索中关于人类健康和食品信息的语言偏好。
J Med Internet Res. 2007 Jun 28;9(2):e18. doi: 10.2196/jmir.9.2.e18.
6
Evaluation of new multimedia formats for cancer communications.癌症传播新多媒体形式的评估。
J Med Internet Res. 2003 Jul-Sep;5(3):e16. doi: 10.2196/jmir.5.3.e16. Epub 2003 Aug 29.
7
Comparing image search behaviour in the ARRS GoldMiner search engine and a clinical PACS/RIS.比较ARRS GoldMiner搜索引擎与临床PACS/RIS中的图像搜索行为。
J Biomed Inform. 2015 Aug;56:57-64. doi: 10.1016/j.jbi.2015.04.013. Epub 2015 May 19.
8
Semantics and the medical web: a review of barriers and breakthroughs in effective healthcare query.语义学与医学网络:有效医疗查询中的障碍与突破综述
Health Info Libr J. 2004 Jun;21(2):109-16. doi: 10.1111/j.1471-1842.2004.00491.x.
9
What are patients seeking when they turn to the Internet? Qualitative content analysis of questions asked by visitors to an orthopaedics Web site.患者在求助于互联网时寻求的是什么?对一个骨科网站访问者所提问题的定性内容分析。
J Med Internet Res. 2003 Oct 10;5(4):e24. doi: 10.2196/jmir.5.4.e24.
10
A first look at HealthCyberMap medical semantic subject search engine.初窥HealthCyberMap医学语义主题搜索引擎。
Technol Health Care. 2004;12(1):33-41.

引用本文的文献

1
Applications of Natural Language Processing in Otolaryngology: A Scoping Review.自然语言处理在耳鼻咽喉科的应用:一项范围综述
Laryngoscope. 2025 Sep;135(9):3049-3063. doi: 10.1002/lary.32198. Epub 2025 May 1.
2
Impact of COVID-19 on Public Interest in Breast Cancer Screening and Related Symptoms: Google Trends Analysis.2019年冠状病毒病对公众乳腺癌筛查兴趣及相关症状的影响:谷歌趋势分析
JMIR Cancer. 2023 Jun 6;9:e39105. doi: 10.2196/39105.
3
White Paper: Mimetics of Class 2 Tumor Suppressor Proteins as Novel Drug Candidates for Personalized Cancer Therapy.

本文引用的文献

1
The impact of the Internet on cancer outcomes.互联网对癌症治疗结果的影响。
CA Cancer J Clin. 2003 Nov-Dec;53(6):356-71. doi: 10.3322/canjclin.53.6.356.
2
What is the prevalence of health-related searches on the World Wide Web? Qualitative and quantitative analysis of search engine queries on the internet.万维网上与健康相关的搜索的流行程度如何?对互联网上搜索引擎查询进行定性和定量分析。
AMIA Annu Symp Proc. 2003;2003:225-9.
3
Internet use for cancer information among racial/ethnic populations and low literacy groups.种族/族裔群体和低文化素养群体中通过互联网获取癌症信息的情况。
白皮书:作为个性化癌症治疗新候选药物的2类肿瘤抑制蛋白模拟物
Cancers (Basel). 2022 Sep 9;14(18):4386. doi: 10.3390/cancers14184386.
4
News Coverage of Colorectal Cancer on Google News: Descriptive Study.谷歌新闻上结直肠癌的新闻报道:描述性研究。
JMIR Cancer. 2022 Jun 15;8(2):e39180. doi: 10.2196/39180.
5
Digesting the Contents: an Analysis of Online Colorectal Cancer Education Websites.消化内容:在线结直肠癌教育网站分析。
J Cancer Educ. 2022 Apr;37(2):263-273. doi: 10.1007/s13187-020-01864-5. Epub 2020 Sep 9.
6
Quality of Online Information Regarding Cervical Cancer.关于宫颈癌的在线信息质量
Cureus. 2020 Aug 1;12(8):e9511. doi: 10.7759/cureus.9511.
7
Quality of Online Resources for Pancreatic Cancer Patients.胰腺癌患者在线资源的质量。
J Cancer Educ. 2019 Apr;34(2):223-228. doi: 10.1007/s13187-017-1290-8.
8
Disease Monitoring and Health Campaign Evaluation Using Google Search Activities for HIV and AIDS, Stroke, Colorectal Cancer, and Marijuana Use in Canada: A Retrospective Observational Study.利用谷歌搜索活动对加拿大艾滋病毒和艾滋病、中风、结直肠癌及大麻使用情况进行疾病监测与健康运动评估:一项回顾性观察研究
JMIR Public Health Surveill. 2016 Oct 12;2(2):e156. doi: 10.2196/publichealth.6504.
9
Medical information on the internet: a tool for measuring consumer perception of quality aspects.互联网上的医学信息:一种衡量消费者对质量方面认知的工具。
Interact J Med Res. 2015 Mar 30;4(1):e8. doi: 10.2196/ijmr.3144.
10
The impact of search engine selection and sorting criteria on vaccination beliefs and attitudes: two experiments manipulating Google output.搜索引擎选择和排序标准对疫苗接种观念及态度的影响:两项操纵谷歌搜索结果的实验
J Med Internet Res. 2014 Apr 2;16(4):e100. doi: 10.2196/jmir.2642.
Cancer Control. 2003 Sep-Oct;10(5 Suppl):45-51. doi: 10.1177/107327480301005s07.
4
Information gathering over time by breast cancer patients.乳腺癌患者随时间推移进行的信息收集。
J Med Internet Res. 2003 Jul-Sep;5(3):e15. doi: 10.2196/jmir.5.3.e15. Epub 2003 Aug 27.
5
Internet use by patients with prostate cancer undergoing radiotherapy.接受放射治疗的前列腺癌患者的互联网使用情况。
Urology. 2003 Aug;62(2):273-7. doi: 10.1016/s0090-4295(03)00251-6.
6
A multi-institutional study of Internet utilization by radiation oncology patients.一项关于放射肿瘤患者互联网使用情况的多机构研究。
Int J Radiat Oncol Biol Phys. 2003 Jul 15;56(4):1201-5. doi: 10.1016/s0360-3016(03)00407-3.
7
Use of the Internet and e-mail for health care information: results from a national survey.利用互联网和电子邮件获取医疗保健信息:一项全国性调查的结果。
JAMA. 2003 May 14;289(18):2400-6. doi: 10.1001/jama.289.18.2400.
8
American oncologists' views of internet use by cancer patients: a mail survey of American Society of Clinical Oncology members.美国肿瘤学家对癌症患者使用互联网情况的看法:对美国临床肿瘤学会成员的邮件调查
J Clin Oncol. 2003 Mar 1;21(5):942-7. doi: 10.1200/JCO.2003.08.007.
9
Patient use of the internet for information in a lung cancer clinic.肺癌诊所中患者利用互联网获取信息的情况。
Chest. 2003 Feb;123(2):452-7. doi: 10.1378/chest.123.2.452.
10
Use of the Internet by women with breast cancer.乳腺癌女性对互联网的使用情况。
J Med Internet Res. 2002 Apr-Nov;4(2):E9. doi: 10.2196/jmir.4.2.e9.