• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用上下文检索器、观点聚合器和问答模型揭示对 COVID-19 问题的看法:模型开发研究。

Revealing Opinions for COVID-19 Questions Using a Context Retriever, Opinion Aggregator, and Question-Answering Model: Model Development Study.

机构信息

Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN, United States.

Department of Linguistics, The Ohio State University, Columbus, OH, United States.

出版信息

J Med Internet Res. 2021 Mar 19;23(3):e22860. doi: 10.2196/22860.

DOI:10.2196/22860
PMID:33739287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7984426/
Abstract

BACKGROUND

COVID-19 has challenged global public health because it is highly contagious and can be lethal. Numerous ongoing and recently published studies about the disease have emerged. However, the research regarding COVID-19 is largely ongoing and inconclusive.

OBJECTIVE

A potential way to accelerate COVID-19 research is to use existing information gleaned from research into other viruses that belong to the coronavirus family. Our objective is to develop a natural language processing method for answering factoid questions related to COVID-19 using published articles as knowledge sources.

METHODS

Given a question, first, a BM25-based context retriever model is implemented to select the most relevant passages from previously published articles. Second, for each selected context passage, an answer is obtained using a pretrained bidirectional encoder representations from transformers (BERT) question-answering model. Third, an opinion aggregator, which is a combination of a biterm topic model and k-means clustering, is applied to the task of aggregating all answers into several opinions.

RESULTS

We applied the proposed pipeline to extract answers, opinions, and the most frequent words related to six questions from the COVID-19 Open Research Dataset Challenge. By showing the longitudinal distributions of the opinions, we uncovered the trends of opinions and popular words in the articles published in the five time periods assessed: before 1990, 1990-1999, 2000-2009, 2010-2018, and since 2019. The changes in opinions and popular words agree with several distinct characteristics and challenges of COVID-19, including a higher risk for senior people and people with pre-existing medical conditions; high contagion and rapid transmission; and a more urgent need for screening and testing. The opinions and popular words also provide additional insights for the COVID-19-related questions.

CONCLUSIONS

Compared with other methods of literature retrieval and answer generation, opinion aggregation using our method leads to more interpretable, robust, and comprehensive question-specific literature reviews. The results demonstrate the usefulness of the proposed method in answering COVID-19-related questions with main opinions and capturing the trends of research about COVID-19 and other relevant strains of coronavirus in recent years.

摘要

背景

COVID-19 对全球公共卫生构成了挑战,因为它具有高度传染性并且可能致命。目前有许多正在进行和最近发表的关于该疾病的研究。然而,关于 COVID-19 的研究在很大程度上仍在进行中,尚无定论。

目的

加速 COVID-19 研究的一种潜在方法是利用从属于冠状病毒家族的其他病毒的研究中获取的现有信息。我们的目标是开发一种自然语言处理方法,使用已发表的文章作为知识来源,回答与 COVID-19 相关的事实问题。

方法

给定一个问题,首先,我们实施基于 BM25 的上下文检索模型,从先前发表的文章中选择最相关的段落。其次,对于每个选定的上下文段落,我们使用预先训练的基于转换器的双向编码器表示(BERT)问答模型来获取答案。然后,我们应用一种观点聚合器,它是双词主题模型和 K 均值聚类的组合,用于将所有答案聚合为几个观点的任务。

结果

我们将提出的管道应用于从 COVID-19 开放研究数据集挑战赛中提取六个问题的答案、观点和最常出现的单词。通过展示观点的纵向分布,我们揭示了在评估的五个时间段(1990 年前、1990-1999 年、2000-2009 年、2010-2018 年和 2019 年以来)发表的文章中观点和流行词汇的趋势。观点和流行词汇的变化与 COVID-19 的几个明显特征和挑战相符,包括老年人和有预先存在的医疗条件的人风险更高;高传染性和快速传播;以及更迫切需要筛查和检测。这些观点和流行词汇也为与 COVID-19 相关的问题提供了更多的见解。

结论

与其他文献检索和答案生成方法相比,使用我们的方法进行观点聚合可生成更具可解释性、稳健性和全面性的特定于问题的文献综述。结果表明,该方法在回答与 COVID-19 相关的问题方面具有实用性,并且可以捕捉近年来 COVID-19 和其他相关冠状病毒株的研究趋势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98a3/7984426/a9b4b38a0527/jmir_v23i3e22860_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98a3/7984426/b3aba399831e/jmir_v23i3e22860_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98a3/7984426/a9b4b38a0527/jmir_v23i3e22860_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98a3/7984426/b3aba399831e/jmir_v23i3e22860_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98a3/7984426/a9b4b38a0527/jmir_v23i3e22860_fig2.jpg

相似文献

1
Revealing Opinions for COVID-19 Questions Using a Context Retriever, Opinion Aggregator, and Question-Answering Model: Model Development Study.使用上下文检索器、观点聚合器和问答模型揭示对 COVID-19 问题的看法:模型开发研究。
J Med Internet Res. 2021 Mar 19;23(3):e22860. doi: 10.2196/22860.
2
SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.SemBioNLQA:一个语义生物医学问答系统,用于检索自然语言问题的准确和理想答案。
Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28.
3
CoQUAD: a COVID-19 question answering dataset system, facilitating research, benchmarking, and practice.CoQUAD:一个 COVID-19 问答数据集系统,促进研究、基准测试和实践。
BMC Bioinformatics. 2022 Jun 2;23(1):210. doi: 10.1186/s12859-022-04751-6.
4
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.一种基于概率信息检索模型和统一医学语言系统(UMLS)概念的生物医学问答中的段落检索方法。
J Biomed Inform. 2017 Apr;68:96-103. doi: 10.1016/j.jbi.2017.03.001. Epub 2017 Mar 7.
5
Building a Discourse-Argument Hybrid System for Vietnamese Why-Question Answering.构建一个用于越南语问答的语篇-论证混合系统。
Comput Intell Neurosci. 2021 Dec 28;2021:6550871. doi: 10.1155/2021/6550871. eCollection 2021.
6
COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization.基于深度学习的语义搜索、问答和摘要生成技术进行的COVID-19信息检索
NPJ Digit Med. 2021 Apr 12;4(1):68. doi: 10.1038/s41746-021-00437-0.
7
A Question-and-Answer System to Extract Data From Free-Text Oncological Pathology Reports (CancerBERT Network): Development Study.从自由文本肿瘤病理学报告(CancerBERT 网络)中提取数据的问答系统:开发研究。
J Med Internet Res. 2022 Mar 23;24(3):e27210. doi: 10.2196/27210.
8
Information extraction from weakly structured radiological reports with natural language queries.利用自然语言查询从弱结构放射学报告中提取信息。
Eur Radiol. 2024 Jan;34(1):330-337. doi: 10.1007/s00330-023-09977-3. Epub 2023 Jul 28.
9
Consumer health information and question answering: helping consumers find answers to their health-related information needs.消费者健康信息与问答:帮助消费者寻找与其健康相关的信息需求的答案。
J Am Med Inform Assoc. 2020 Feb 1;27(2):194-201. doi: 10.1093/jamia/ocz152.
10
Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing.人工智能在行动:利用自然语言处理应对 COVID-19 大流行。
Annu Rev Biomed Data Sci. 2021 Jul 20;4:313-339. doi: 10.1146/annurev-biodatasci-021821-061045. Epub 2021 May 14.

引用本文的文献

1
The emergence of large language models as tools in literature reviews: a large language model-assisted systematic review.大语言模型作为文献综述工具的出现:一项大语言模型辅助的系统综述
J Am Med Inform Assoc. 2025 Jun 1;32(6):1071-1086. doi: 10.1093/jamia/ocaf063.
2
From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?从可解释到可理解的深度学习在医疗保健自然语言处理中的应用:离现实还有多远?
Comput Struct Biotechnol J. 2024 May 9;24:362-373. doi: 10.1016/j.csbj.2024.05.004. eCollection 2024 Dec.

本文引用的文献

1
COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization.基于深度学习的语义搜索、问答和摘要生成技术进行的COVID-19信息检索
NPJ Digit Med. 2021 Apr 12;4(1):68. doi: 10.1038/s41746-021-00437-0.
2
Covid-19 and Immunity in Aging Populations - A New Research Agenda.新冠疫情与老年人群的免疫力——一项新的研究议程。
N Engl J Med. 2020 Aug 27;383(9):804-805. doi: 10.1056/NEJMp2006761. Epub 2020 Apr 17.
3
Clinical Characteristics of Coronavirus Disease 2019 in China.《中国 2019 年冠状病毒病临床特征》
N Engl J Med. 2020 Apr 30;382(18):1708-1720. doi: 10.1056/NEJMoa2002032. Epub 2020 Feb 28.
4
Atypical Repetition in Daily Conversation on Different Days for Detecting Alzheimer Disease: Evaluation of Phone-Call Data From Regular Monitoring Service.通过日常对话中不同日期的非典型重复来检测阿尔茨海默病:对常规监测服务中的电话数据进行评估
JMIR Ment Health. 2020 Jan 12;7(1):e16790. doi: 10.2196/16790.
5
Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study.基于大规模电子健康记录笔记对基于变换器的双向编码器表征(BERT)模型进行微调:一项实证研究。
JMIR Med Inform. 2019 Sep 12;7(3):e14830. doi: 10.2196/14830.
6
Early Detection of Depression: Social Network Analysis and Random Forest Techniques.抑郁症的早期检测:社交网络分析与随机森林技术
J Med Internet Res. 2019 Jun 10;21(6):e12554. doi: 10.2196/12554.
7
Middle East respiratory syndrome.中东呼吸综合征
Lancet. 2015 Sep 5;386(9997):995-1007. doi: 10.1016/S0140-6736(15)60454-8. Epub 2015 Jun 3.