Suppr超能文献

PubMed搜索引擎中的拼写校正

SPELLING CORRECTION IN THE PUBMED SEARCH ENGINE.

作者信息

Wilbur W John, Kim Won, Xie Natalie

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, U.S.A.

出版信息

Inf Retr Boston. 2006 Nov;9(5):543-564. doi: 10.1007/s10791-006-9002-8.

Abstract

It is known that users of internet search engines often enter queries with misspellings in one or more search terms. Several web search engines make suggestions for correcting misspelled words, but the methods used are proprietary and unpublished to our knowledge. Here we describe the methodology we have developed to perform spelling correction for the PubMed search engine. Our approach is based on the noisy channel model for spelling correction and makes use of statistics harvested from user logs to estimate the probabilities of different types of edits that lead to misspellings. The unique problems encountered in correcting search engine queries are discussed and our solutions are outlined.

摘要

众所周知,互联网搜索引擎的用户在输入一个或多个搜索词时常常会出现拼写错误。有几个网络搜索引擎会给出纠正拼写错误单词的建议,但据我们所知,所使用的方法是专有的且未公开。在这里,我们描述了我们为PubMed搜索引擎开发的执行拼写纠错的方法。我们的方法基于用于拼写纠错的噪声信道模型,并利用从用户日志中收集的统计数据来估计导致拼写错误的不同类型编辑的概率。我们讨论了在纠正搜索引擎查询时遇到的独特问题,并概述了我们的解决方案。

相似文献

1
SPELLING CORRECTION IN THE PUBMED SEARCH ENGINE.
Inf Retr Boston. 2006 Nov;9(5):543-564. doi: 10.1007/s10791-006-9002-8.
2
Searching for cancer information on the internet: analyzing natural language search queries.
J Med Internet Res. 2003 Dec 11;5(4):e31. doi: 10.2196/jmir.5.4.e31.
4
Matching health information seekers' queries to medical terms.
BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S11. doi: 10.1186/1471-2105-13-S14-S11. Epub 2012 Sep 7.
5
Common Misspellings and Their Impact on Health Sciences Literature Search Results.
Med Ref Serv Q. 2023 Jul-Sep;42(3):211-227. doi: 10.1080/02763869.2023.2214038.
6
Comparing image search behaviour in the ARRS GoldMiner search engine and a clinical PACS/RIS.
J Biomed Inform. 2015 Aug;56:57-64. doi: 10.1016/j.jbi.2015.04.013. Epub 2015 May 19.
7
A study of medical and health queries to web search engines.
Health Info Libr J. 2004 Mar;21(1):44-51. doi: 10.1111/j.1471-1842.2004.00481.x.
10
BIOMedical Search Engine Framework: Lightweight and customized implementation of domain-specific biomedical search engines.
Comput Methods Programs Biomed. 2016 Jul;131:63-77. doi: 10.1016/j.cmpb.2016.03.030. Epub 2016 Apr 8.

引用本文的文献

2
Spell checker for consumer language (CSpell).
J Am Med Inform Assoc. 2019 Mar 1;26(3):211-218. doi: 10.1093/jamia/ocy171.
3
How user intelligence is improving PubMed.
Nat Biotechnol. 2018 Oct 1. doi: 10.1038/nbt.4267.
4
A Field Sensor: computing the composition and intent of PubMed queries.
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay052.
5
Towards PubMed 2.0.
Elife. 2017 Oct 30;6:e28801. doi: 10.7554/eLife.28801.
6
An Ensemble Method for Spelling Correction in Consumer Health Questions.
AMIA Annu Symp Proc. 2015 Nov 5;2015:727-36. eCollection 2015.
7
Context-Sensitive Spelling Correction of Consumer-Generated Content on Health Care.
JMIR Med Inform. 2015 Jul 31;3(3):e27. doi: 10.2196/medinform.4211.
8
Studying PubMed usages in the field for complex problem solving: Implications for tool design.
J Am Soc Inf Sci Technol. 2013 May 1;64(5):874-92. doi: 10.1002/asi.22796.
9
Matching health information seekers' queries to medical terms.
BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S11. doi: 10.1186/1471-2105-13-S14-S11. Epub 2012 Sep 7.
10
Evaluating relevance ranking strategies for MEDLINE retrieval.
J Am Med Inform Assoc. 2009 Jan-Feb;16(1):32-6. doi: 10.1197/jamia.M2935. Epub 2008 Oct 24.

本文引用的文献

1
PubMed: bridging the information gap.
CMAJ. 2001 May 1;164(9):1317-9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验