• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两种用于 COVID-19 文献的商业深度学习信息检索系统的评估。

An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature.

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA.

出版信息

J Am Med Inform Assoc. 2021 Jan 15;28(1):132-137. doi: 10.1093/jamia/ocaa271.

DOI:10.1093/jamia/ocaa271
PMID:33197268
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7717324/
Abstract

The COVID-19 pandemic has resulted in a tremendous need for access to the latest scientific information, leading to both corpora for COVID-19 literature and search engines to query such data. While most search engine research is performed in academia with rigorous evaluation, major commercial companies dominate the web search market. Thus, it is expected that commercial pandemic-specific search engines will gain much higher traction than academic alternatives, leading to questions about the empirical performance of these tools. This paper seeks to empirically evaluate two commercial search engines for COVID-19 (Google and Amazon) in comparison with academic prototypes evaluated in the TREC-COVID task. We performed several steps to reduce bias in the manual judgments to ensure a fair comparison of all systems. We find the commercial search engines sizably underperformed those evaluated under TREC-COVID. This has implications for trust in popular health search engines and developing biomedical search engines for future health crises.

摘要

新冠疫情大流行导致人们对获取最新科学信息的需求巨大,这就需要建立新冠文献语料库和搜索引擎来查询这些数据。虽然大多数搜索引擎研究都是在学术界进行的,并且有严格的评估,但大型商业公司却主导着网络搜索市场。因此,可以预期商业性的新冠专用搜索引擎将比学术性的替代方案获得更高的关注度,这就引发了关于这些工具的实证性能的问题。本文旨在对两个商业搜索引擎(谷歌和亚马逊)进行实证评估,以与 TREC-COVID 任务中评估的学术原型进行比较。我们采取了几个步骤来减少手动判断中的偏差,以确保对所有系统进行公平的比较。我们发现商业搜索引擎的表现明显逊于 TREC-COVID 评估的搜索引擎。这对人们对流行健康搜索引擎的信任和为未来的健康危机开发生物医学搜索引擎产生了影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/54384b1e9c42/ocaa271f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/4b9c8039acda/ocaa271f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/162dfd212099/ocaa271f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/54384b1e9c42/ocaa271f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/4b9c8039acda/ocaa271f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/162dfd212099/ocaa271f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/131e/7810442/54384b1e9c42/ocaa271f3.jpg

相似文献

1
An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature.两种用于 COVID-19 文献的商业深度学习信息检索系统的评估。
J Am Med Inform Assoc. 2021 Jan 15;28(1):132-137. doi: 10.1093/jamia/ocaa271.
2
TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19.TREC-COVID:针对 COVID-19 的信息检索共享任务的原理和结构。
J Am Med Inform Assoc. 2020 Jul 1;27(9):1431-1436. doi: 10.1093/jamia/ocaa091.
3
Information Retrieval in an Infodemic: The Case of COVID-19 Publications.信息疫情中的信息检索:以新冠疫情相关出版物为例
J Med Internet Res. 2021 Sep 17;23(9):e30161. doi: 10.2196/30161.
4
What every researcher should know about searching - clarified concepts, search advice, and an agenda to improve finding in academia.每个研究人员都应该知道的搜索知识——澄清概念、搜索建议以及改善学术界发现的议程。
Res Synth Methods. 2021 Mar;12(2):136-147. doi: 10.1002/jrsm.1457. Epub 2020 Oct 8.
5
Searching for scientific evidence in a pandemic: An overview of TREC-COVID.在大流行中寻找科学证据:TREC-COVID 概述。
J Biomed Inform. 2021 Sep;121:103865. doi: 10.1016/j.jbi.2021.103865. Epub 2021 Jul 8.
6
Learning to rank query expansion terms for COVID-19 scholarly search.学习对 COVID-19 学术搜索进行查询扩展词的排序。
J Biomed Inform. 2023 Jun;142:104386. doi: 10.1016/j.jbi.2023.104386. Epub 2023 May 12.
7
Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study.通过分析伊朗谷歌趋势数据预测 COVID-19 发病率:数据挖掘和深度学习初步研究。
JMIR Public Health Surveill. 2020 Apr 14;6(2):e18828. doi: 10.2196/18828.
8
A comparative analysis of system features used in the TREC-COVID information retrieval challenge.TREC-COVID 信息检索挑战赛中使用的系统特征的比较分析。
J Biomed Inform. 2021 May;117:103745. doi: 10.1016/j.jbi.2021.103745. Epub 2021 Apr 6.
9
Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols.使用无标题评估协议评估生物医学搜索引擎的客观和自动化协议。
BMC Bioinformatics. 2008 Feb 29;9:132. doi: 10.1186/1471-2105-9-132.
10
Searching COVID-19 Clinical Research Using Graph Queries: Algorithm Development and Validation.使用图查询搜索 COVID-19 临床研究:算法开发与验证。
J Med Internet Res. 2024 May 30;26:e52655. doi: 10.2196/52655.

引用本文的文献

1
Searching for evidence in public health emergencies: a white paper of best practices.公共卫生突发事件中的证据检索:最佳实践白皮书。
J Med Libr Assoc. 2023 Apr 21;111(1-2):566-578. doi: 10.5195/jmla.2023.1530.
2
Artificial Intelligence: A Next-Level Approach in Confronting the COVID-19 Pandemic.人工智能:应对新冠疫情的一种进阶方法。
Healthcare (Basel). 2023 Mar 14;11(6):854. doi: 10.3390/healthcare11060854.
3
Heterogeneous deep graph convolutional network with citation relational BERT for COVID-19 inline citation recommendation.

本文引用的文献

1
A Comparative Analysis of Speed and Accuracy for Three Off-the-Shelf De-Identification Tools.三种现成去识别工具的速度与准确性比较分析
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:241-250. eCollection 2020.
2
TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19.TREC-COVID:针对 COVID-19 的信息检索共享任务的原理和结构。
J Am Med Inform Assoc. 2020 Jul 1;27(9):1431-1436. doi: 10.1093/jamia/ocaa091.
3
Deep learning in clinical natural language processing: a methodical review.深度学习在临床自然语言处理中的应用:系统综述。
用于COVID-19内联引用推荐的具有引用关系BERT的异构深度图卷积网络。
Expert Syst Appl. 2023 Mar 1;213:118841. doi: 10.1016/j.eswa.2022.118841. Epub 2022 Sep 17.
4
A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques.基于深度学习技术的 COVID-19 谣言数据集的自然语言处理 (NLP) 评估。
Comput Intell Neurosci. 2022 Sep 14;2022:6561622. doi: 10.1155/2022/6561622. eCollection 2022.
5
GFCNet: Utilizing graph feature collection networks for coronavirus knowledge graph embeddings.GFCNet:利用图特征收集网络进行冠状病毒知识图谱嵌入
Inf Sci (N Y). 2022 Aug;608:1557-1571. doi: 10.1016/j.ins.2022.07.031. Epub 2022 Jul 14.
6
Search like an expert: Reducing expertise disparity using a hybrid neural index for COVID-19 queries.专家级搜索:利用混合神经网络索引减少 COVID-19 查询中的专业知识差距。
J Biomed Inform. 2022 Mar;127:104005. doi: 10.1016/j.jbi.2022.104005. Epub 2022 Feb 8.
7
Applications of artificial intelligence in COVID-19 pandemic: A comprehensive review.人工智能在新冠疫情中的应用:一项全面综述。
Expert Syst Appl. 2021 Dec 15;185:115695. doi: 10.1016/j.eswa.2021.115695. Epub 2021 Aug 4.
8
Information Retrieval in an Infodemic: The Case of COVID-19 Publications.信息疫情中的信息检索:以新冠疫情相关出版物为例
J Med Internet Res. 2021 Sep 17;23(9):e30161. doi: 10.2196/30161.
9
Searching for scientific evidence in a pandemic: An overview of TREC-COVID.在大流行中寻找科学证据:TREC-COVID 概述。
J Biomed Inform. 2021 Sep;121:103865. doi: 10.1016/j.jbi.2021.103865. Epub 2021 Jul 8.
10
COBERT: COVID-19 Question Answering System Using BERT.COBERT:使用BERT的COVID-19问答系统。
Arab J Sci Eng. 2021 Jun 23:1-11. doi: 10.1007/s13369-021-05810-5.
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.