• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《COVID-19 文献综述:基于机器学习的文献计量分析》

A Comprehensive Overview of the COVID-19 Literature: Machine Learning-Based Bibliometric Analysis.

机构信息

Division of Information and Computing Technology, College of Science and Engineering, Hamad Bin Khalifa University, Qatar Foundation, Doha, Qatar.

College of Health and Life Sciences, Hamad Bin Khalifa University, Qatar Foundation, Doha, Qatar.

出版信息

J Med Internet Res. 2021 Mar 8;23(3):e23703. doi: 10.2196/23703.

DOI:10.2196/23703
PMID:33600346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7942394/
Abstract

BACKGROUND

Shortly after the emergence of COVID-19, researchers rapidly mobilized to study numerous aspects of the disease such as its evolution, clinical manifestations, effects, treatments, and vaccinations. This led to a rapid increase in the number of COVID-19-related publications. Identifying trends and areas of interest using traditional review methods (eg, scoping and systematic reviews) for such a large domain area is challenging.

OBJECTIVE

We aimed to conduct an extensive bibliometric analysis to provide a comprehensive overview of the COVID-19 literature.

METHODS

We used the COVID-19 Open Research Dataset (CORD-19) that consists of a large number of research articles related to all coronaviruses. We used a machine learning-based method to analyze the most relevant COVID-19-related articles and extracted the most prominent topics. Specifically, we used a clustering algorithm to group published articles based on the similarity of their abstracts to identify research hotspots and current research directions. We have made our software accessible to the community via GitHub.

RESULTS

Of the 196,630 publications retrieved from the database, we included 28,904 in our analysis. The mean number of weekly publications was 990 (SD 789.3). The country that published the highest number of COVID-19-related articles was China (2950/17,270, 17.08%). The highest number of articles were published in bioRxiv. Lei Liu affiliated with the Southern University of Science and Technology in China published the highest number of articles (n=46). Based on titles and abstracts alone, we were able to identify 1515 surveys, 733 systematic reviews, 512 cohort studies, 480 meta-analyses, and 362 randomized control trials. We identified 19 different topics covered among the publications reviewed. The most dominant topic was public health response, followed by clinical care practices during the COVID-19 pandemic, clinical characteristics and risk factors, and epidemic models for its spread.

CONCLUSIONS

We provide an overview of the COVID-19 literature and have identified current hotspots and research directions. Our findings can be useful for the research community to help prioritize research needs and recognize leading COVID-19 researchers, institutes, countries, and publishers. Our study shows that an AI-based bibliometric analysis has the potential to rapidly explore a large corpus of academic publications during a public health crisis. We believe that this work can be used to analyze other eHealth-related literature to help clinicians, administrators, and policy makers to obtain a holistic view of the literature and be able to categorize different topics of the existing research for further analyses. It can be further scaled (for instance, in time) to clinical summary documentation. Publishers should avoid noise in the data by developing a way to trace the evolution of individual publications and unique authors.

摘要

背景

新冠疫情爆发后不久,研究人员迅速行动起来,研究疾病的诸多方面,如疾病的演变、临床表现、影响、治疗和疫苗接种等。这导致与 COVID-19 相关的出版物数量迅速增加。使用传统的综述方法(例如,范围和系统综述)来确定如此大的领域的趋势和关注领域具有挑战性。

目的

我们旨在进行广泛的文献计量分析,以提供 COVID-19 文献的综合概述。

方法

我们使用了包含大量与所有冠状病毒相关研究文章的 COVID-19 开放研究数据集 (CORD-19)。我们使用基于机器学习的方法来分析与 COVID-19 最相关的文章,并提取最突出的主题。具体来说,我们使用聚类算法根据其摘要的相似性对已发表的文章进行分组,以识别研究热点和当前的研究方向。我们已经通过 GitHub 向社区提供了我们的软件。

结果

从数据库中检索到的 196630 篇出版物中,我们纳入了 28904 篇进行分析。每周出版物的平均数量为 990 篇(SD 789.3)。发表与 COVID-19 相关文章数量最多的国家是中国(2950/17270,17.08%)。发表文章最多的出版物是 bioRxiv。中国南方科技大学的 Lei Liu 发表的文章数量最多(n=46)。仅根据标题和摘要,我们就能够识别出 1515 项调查、733 项系统评价、512 项队列研究、480 项荟萃分析和 362 项随机对照试验。我们在审查的出版物中确定了 19 个不同的主题。最主要的主题是公共卫生应对,其次是 COVID-19 大流行期间的临床护理实践、临床特征和危险因素以及传播的流行模型。

结论

我们提供了 COVID-19 文献的概述,并确定了当前的热点和研究方向。我们的研究结果可以为研究界提供帮助,确定研究需求的优先级,并识别领先的 COVID-19 研究人员、机构、国家和出版商。我们的研究表明,基于人工智能的文献计量分析具有在公共卫生危机期间快速探索大量学术出版物的潜力。我们相信这项工作可以用于分析其他电子健康相关文献,以帮助临床医生、管理人员和政策制定者全面了解文献,并能够对现有研究的不同主题进行分类,以便进一步分析。它可以进一步扩展(例如,在时间上)到临床总结文件。出版商应通过开发一种跟踪单个出版物和独特作者演变的方法来避免数据中的噪音。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28c/7942394/207fac8b57f2/jmir_v23i3e23703_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28c/7942394/0e85b82cbdec/jmir_v23i3e23703_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28c/7942394/207fac8b57f2/jmir_v23i3e23703_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28c/7942394/0e85b82cbdec/jmir_v23i3e23703_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28c/7942394/207fac8b57f2/jmir_v23i3e23703_fig2.jpg

相似文献

1
A Comprehensive Overview of the COVID-19 Literature: Machine Learning-Based Bibliometric Analysis.《COVID-19 文献综述:基于机器学习的文献计量分析》
J Med Internet Res. 2021 Mar 8;23(3):e23703. doi: 10.2196/23703.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Scars of COVID-19: A bibliometric analysis of post-COVID-19 fibrosis.新冠疫情后的疤痕:COVID-19 后纤维化的文献计量分析。
Front Public Health. 2022 Sep 20;10:967829. doi: 10.3389/fpubh.2022.967829. eCollection 2022.
4
Analysis of Scientific Publications During the Early Phase of the COVID-19 Pandemic: Topic Modeling Study.新冠疫情早期科学出版物分析:主题建模研究
J Med Internet Res. 2020 Nov 10;22(11):e21559. doi: 10.2196/21559.
5
Mapping the situation of research on coronavirus disease-19 (COVID-19): a preliminary bibliometric analysis during the early stage of the outbreak.绘制新型冠状病毒病(COVID-19)研究态势图:疫情爆发早期的初步文献计量分析。
BMC Infect Dis. 2020 Aug 1;20(1):561. doi: 10.1186/s12879-020-05293-z.
6
Global Research Trends in Pediatric COVID-19: A Bibliometric Analysis.全球儿童 COVID-19 研究趋势:文献计量分析。
Front Public Health. 2022 Feb 16;10:798005. doi: 10.3389/fpubh.2022.798005. eCollection 2022.
7
Global research trends in MERS-CoV: A comprehensive bibliometric analysis from 2012 to 2021.全球 MERS-CoV 研究趋势:2012 年至 2021 年的综合文献计量分析。
Front Public Health. 2022 Aug 4;10:933333. doi: 10.3389/fpubh.2022.933333. eCollection 2022.
8
Mapping Trends and Hotspots Regarding Clinical Research on COVID-19: A Bibliometric Analysis of Global Research.对 COVID-19 临床研究的趋势和热点进行映射:全球研究的文献计量分析。
Front Public Health. 2021 Aug 23;9:713487. doi: 10.3389/fpubh.2021.713487. eCollection 2021.
9
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
10
Global research on emerging trends of obstetrics during the COVID-19 pandemic: A bibliometric analysis.全球 COVID-19 大流行期间产科新兴趋势的研究:文献计量分析。
Medicine (Baltimore). 2024 Aug 2;103(31):e39182. doi: 10.1097/MD.0000000000039182.

引用本文的文献

1
Artificial intelligence empowering rare diseases: a bibliometric perspective over the last two decades.人工智能赋能罕见病研究:过去二十年的文献计量学视角
Orphanet J Rare Dis. 2024 Sep 13;19(1):345. doi: 10.1186/s13023-024-03352-1.
2
Machine Learning-Based Approach for Identifying Research Gaps: COVID-19 as a Case Study.基于机器学习的研究空白识别方法:以COVID-19为例
JMIR Form Res. 2024 Mar 5;8:e49411. doi: 10.2196/49411.
3
The impact of COVID-19-related chronic disease is gradually emerging: discovery and trends from a bibliometric analysis.

本文引用的文献

1
Accuracy and Reproducibility of Low-Dose Submillisievert Chest CT for the Diagnosis of COVID-19.低剂量亚毫西弗胸部CT诊断新型冠状病毒肺炎的准确性和可重复性
Radiol Cardiothorac Imaging. 2020 Apr 21;2(2):e200196. doi: 10.1148/ryct.2020200196. eCollection 2020 Apr.
2
Thin-Section Chest CT Imaging of COVID-19 Pneumonia: A Comparison Between Patients with Mild and Severe Disease.新型冠状病毒肺炎的胸部薄层CT成像:轻型与重型患者的比较
Radiol Cardiothorac Imaging. 2020 Apr 23;2(2):e200126. doi: 10.1148/ryct.2020200126. eCollection 2020 Apr.
3
Coronavirus Disease 2019: Initial Detection on Chest CT in a Retrospective Multicenter Study of 103 Chinese Patients.
新型冠状病毒肺炎相关慢性病的影响正逐渐显现:文献计量分析的发现与趋势
Am J Transl Res. 2023 Dec 15;15(12):6905-6910. eCollection 2023.
4
Impact of COVID-19 research: a study on predicting influential scholarly documents using machine learning and a domain-independent knowledge graph.新冠疫情研究的影响:一项使用机器学习和领域无关知识图谱预测有影响力学术文献的研究。
J Biomed Semantics. 2023 Nov 28;14(1):18. doi: 10.1186/s13326-023-00298-4.
5
The impact of COVID-19 on open access publishing in radiology and nuclear medicine: an in-depth analysis.COVID-19 对放射学和核医学开放获取出版的影响:深入分析。
J Med Life. 2023 Jul;16(7):967-973. doi: 10.25122/jml-2023-0075.
6
Emerging advances in biosecurity to underpin human, animal, plant, and ecosystem health.生物安全领域的新进展为人类、动物、植物和生态系统健康提供支撑。
iScience. 2023 Jul 31;26(9):107462. doi: 10.1016/j.isci.2023.107462. eCollection 2023 Sep 15.
7
Artificial Intelligence and COVID-19: A Systematic umbrella review and roads ahead.人工智能与2019冠状病毒病:系统综合评价及未来展望
J King Saud Univ Comput Inf Sci. 2022 Sep;34(8):5898-5920. doi: 10.1016/j.jksuci.2021.07.010. Epub 2021 Jul 15.
8
Lessons Learned from a Global Perspective of Coronavirus Disease-2019.从全球视角看 2019 冠状病毒病汲取的教训。
Clin Chest Med. 2023 Jun;44(2):435-449. doi: 10.1016/j.ccm.2022.11.020. Epub 2022 Nov 24.
9
Non-compliance with ethical rules caused by misuse of ORCID accounts: Implications for medical publications in the COVID-19 era.ORCID账户滥用导致的违反道德规则行为:对COVID-19时代医学出版物的影响。
Ethics Med Public Health. 2021 Sep;18:100692. doi: 10.1016/j.jemep.2021.100692. Epub 2021 Jun 9.
10
Impact of COVID-19 Pandemic on Biomedical Publications and Their Citation Frequency.COVID-19 大流行对生物医学出版物及其引文频次的影响。
J Korean Med Sci. 2022 Oct 17;37(40):e296. doi: 10.3346/jkms.2022.37.e296.
2019冠状病毒病:103例中国患者回顾性多中心研究中胸部CT的初步检测
Radiol Cardiothorac Imaging. 2020 Apr 6;2(2):e200092. doi: 10.1148/ryct.2020200092. eCollection 2020 Apr.
4
Chest CT Severity Score: An Imaging Tool for Assessing Severe COVID-19.胸部CT严重程度评分:一种评估重症COVID-19的影像学工具。
Radiol Cardiothorac Imaging. 2020 Mar 30;2(2):e200047. doi: 10.1148/ryct.2020200047. eCollection 2020 Apr.
5
Binding of the SARS-CoV-2 spike protein to glycans.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)刺突蛋白与聚糖的结合。
Sci Bull (Beijing). 2021 Jun 30;66(12):1205-1214. doi: 10.1016/j.scib.2021.01.010. Epub 2021 Jan 19.
6
Artificial Intelligence in the Fight Against COVID-19: Scoping Review.人工智能在抗击 COVID-19 中的应用:范围综述。
J Med Internet Res. 2020 Dec 15;22(12):e20756. doi: 10.2196/20756.
7
Transcription Factor Nrf2 as a Potential Therapeutic Target for Prevention of Cytokine Storm in COVID-19 Patients.转录因子Nrf2作为预防COVID-19患者细胞因子风暴的潜在治疗靶点
Biochemistry (Mosc). 2020 Jul;85(7):833-837. doi: 10.1134/S0006297920070111.
8
Machine Learning Maps Research Needs in COVID-19 Literature.机器学习描绘新冠疫情文献中的研究需求。
Patterns (N Y). 2020 Dec 11;1(9):100123. doi: 10.1016/j.patter.2020.100123. Epub 2020 Sep 16.
9
The psychological status of 8817 hospital workers during COVID-19 Epidemic: A cross-sectional study in Chongqing.新型冠状病毒肺炎疫情期间 8817 名医院工作人员的心理状况:重庆市的一项横断面研究。
J Affect Disord. 2020 Nov 1;276:555-561. doi: 10.1016/j.jad.2020.07.092. Epub 2020 Jul 19.
10
COVID-19 infection outbreak increases anxiety level of general public in China: involved mechanisms and influencing factors.新型冠状病毒肺炎疫情爆发增加了中国公众的焦虑水平:涉及的机制和影响因素。
J Affect Disord. 2020 Nov 1;276:446-452. doi: 10.1016/j.jad.2020.07.085. Epub 2020 Jul 20.