• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学文献数据库检索中用于伪相关反馈的术语排序算法评估

Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval.

作者信息

Yoo Sooyoung, Choi Jinwook

机构信息

Medical Information Center, Seoul National University Bundang Hospital, Seongnam, Korea.

出版信息

Healthc Inform Res. 2011 Jun;17(2):120-30. doi: 10.4258/hir.2011.17.2.120. Epub 2011 Jun 30.

DOI:10.4258/hir.2011.17.2.120
PMID:21886873
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3155169/
Abstract

OBJECTIVES

The purpose of this study was to investigate the effects of query expansion algorithms for MEDLINE retrieval within a pseudo-relevance feedback framework.

METHODS

A number of query expansion algorithms were tested using various term ranking formulas, focusing on query expansion based on pseudo-relevance feedback. The OHSUMED test collection, which is a subset of the MEDLINE database, was used as a test corpus. Various ranking algorithms were tested in combination with different term re-weighting algorithms.

RESULTS

Our comprehensive evaluation showed that the local context analysis ranking algorithm, when used in combination with one of the reweighting algorithms - Rocchio, the probabilistic model, and our variants - significantly outperformed other algorithm combinations by up to 12% (paired t-test; p < 0.05). In a pseudo-relevance feedback framework, effective query expansion would be achieved by the careful consideration of term ranking and re-weighting algorithm pairs, at least in the context of the OHSUMED corpus.

CONCLUSIONS

Comparative experiments on term ranking algorithms were performed in the context of a subset of MEDLINE documents. With medical documents, local context analysis, which uses co-occurrence with all query terms, significantly outperformed various term ranking methods based on both frequency and distribution analyses. Furthermore, the results of the experiments demonstrated that the term rank-based re-weighting method contributed to a remarkable improvement in mean average precision.

摘要

目的

本研究旨在调查在伪相关反馈框架内用于MEDLINE检索的查询扩展算法的效果。

方法

使用各种词项排名公式测试了多种查询扩展算法,重点是基于伪相关反馈的查询扩展。将MEDLINE数据库的一个子集OHSUMED测试集用作测试语料库。将各种排名算法与不同的词项重新加权算法结合进行测试。

结果

我们的综合评估表明,局部上下文分析排名算法与其中一种重新加权算法(罗基奥算法、概率模型以及我们的变体算法)结合使用时,显著优于其他算法组合,最高可达12%(配对t检验;p < 0.05)。在伪相关反馈框架中,至少在OHSUMED语料库的背景下,通过仔细考虑词项排名和重新加权算法对,可以实现有效的查询扩展。

结论

在MEDLINE文档子集的背景下对词项排名算法进行了比较实验。对于医学文档,使用与所有查询词项共现的局部上下文分析明显优于基于频率和分布分析的各种词项排名方法。此外,实验结果表明基于词项排名的重新加权方法显著提高了平均准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6035/3155169/343bf5be8fce/hir-17-120-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6035/3155169/f776a2c57774/hir-17-120-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6035/3155169/343bf5be8fce/hir-17-120-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6035/3155169/f776a2c57774/hir-17-120-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6035/3155169/343bf5be8fce/hir-17-120-g002.jpg

相似文献

1
Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval.医学文献数据库检索中用于伪相关反馈的术语排序算法评估
Healthc Inform Res. 2011 Jun;17(2):120-30. doi: 10.4258/hir.2011.17.2.120. Epub 2011 Jun 30.
2
On the query reformulation technique for effective MEDLINE document retrieval.针对有效 MEDLINE 文档检索的查询改写技术。
J Biomed Inform. 2010 Oct;43(5):686-93. doi: 10.1016/j.jbi.2010.04.005. Epub 2010 Apr 13.
3
G-Bean: an ontology-graph based web tool for biomedical literature retrieval.G-Bean:基于本体图的生物医学文献检索网络工具。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-15-S12-S1. Epub 2014 Nov 6.
4
Semantic concept-enriched dependence model for medical information retrieval.用于医学信息检索的语义概念增强依赖模型
J Biomed Inform. 2014 Feb;47:18-27. doi: 10.1016/j.jbi.2013.08.013. Epub 2013 Sep 11.
5
A Firefly Algorithm-based Approach for Pseudo-Relevance Feedback: Application to Medical Database.一种基于萤火虫算法的伪相关反馈方法:在医学数据库中的应用
J Med Syst. 2016 Nov;40(11):240. doi: 10.1007/s10916-016-0603-5. Epub 2016 Sep 27.
6
Relevance Feedback Based Query Expansion Model Using Borda Count and Semantic Similarity Approach.基于Borda计数和语义相似性方法的相关反馈查询扩展模型
Comput Intell Neurosci. 2015;2015:568197. doi: 10.1155/2015/568197. Epub 2015 Dec 7.
7
Document Retrieval for Precision Medicine Using a Deep Learning Ensemble Method.使用深度学习集成方法进行精准医学的文献检索
JMIR Med Inform. 2021 Jun 29;9(6):e28272. doi: 10.2196/28272.
8
Improved biomedical term selection in pseudo relevance feedback.伪相关反馈中改进的生物医学术语选择。
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay056.
9
Evaluating relevance ranking strategies for MEDLINE retrieval.评估用于MEDLINE检索的相关性排序策略。
J Am Med Inform Assoc. 2009 Jan-Feb;16(1):32-6. doi: 10.1197/jamia.M2935. Epub 2008 Oct 24.
10
Assessing thesaurus-based query expansion using the UMLS Metathesaurus.使用统一医学语言系统(UMLS)元词表评估基于词库的查询扩展。
Proc AMIA Symp. 2000:344-8.

引用本文的文献

1
Triage by ranking to support the curation of protein interactions.通过排名进行分类以支持蛋白质相互作用的整理。
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax040.

本文引用的文献

1
Enabling multi-level relevance feedback on PubMed by integrating rank learning into DBMS.通过将排序学习集成到 DBMS 中,实现 PubMed 上的多层次相关性反馈。
BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-11-S2-S6.
2
On the query reformulation technique for effective MEDLINE document retrieval.针对有效 MEDLINE 文档检索的查询改写技术。
J Biomed Inform. 2010 Oct;43(5):686-93. doi: 10.1016/j.jbi.2010.04.005. Epub 2010 Apr 13.
3
MiSearch adaptive pubMed search tool.MiSearch自适应PubMed搜索工具。
Bioinformatics. 2009 Apr 1;25(7):974-6. doi: 10.1093/bioinformatics/btn033. Epub 2008 Mar 6.
4
Using citation data to improve retrieval from MEDLINE.利用引用数据改进MEDLINE检索。
J Am Med Inform Assoc. 2006 Jan-Feb;13(1):96-105. doi: 10.1197/jamia.M1909. Epub 2005 Oct 12.
5
Networked information and clinical decision making: the experience of Birmingham Heartlands and Solihull National Health Service Trust (Teaching).
Med Educ. 2001 Feb;35(2):167-72. doi: 10.1046/j.1365-2923.2001.00839.x.
6
Assessing thesaurus-based query expansion using the UMLS Metathesaurus.使用统一医学语言系统(UMLS)元词表评估基于词库的查询扩展。
Proc AMIA Symp. 2000:344-8.
7
Knowledge retrieval as one type of knowledge-based decision support in medicine: results of an evaluation study.
Int J Biomed Comput. 1996 Apr;41(2):69-85. doi: 10.1016/0020-7101(96)01160-9.
8
Retrieval feedback in MEDLINE.医学文献数据库(MEDLINE)中的检索反馈
J Am Med Inform Assoc. 1996 Mar-Apr;3(2):157-67. doi: 10.1136/jamia.1996.96236284.
9
Words or concepts: the features of indexing units and their optimal use in information retrieval.词汇或概念:索引单元的特征及其在信息检索中的最佳应用。
Proc Annu Symp Comput Appl Med Care. 1993:685-9.