• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COVID-19-CT-CXR:一个可免费获取的、基于生物医学文献的关于COVID-19的弱标注胸部X光和CT图像集。

COVID-19-CT-CXR: A Freely Accessible and Weakly Labeled Chest X-Ray and CT Image Collection on COVID-19 From Biomedical Literature.

作者信息

Peng Yifan, Tang Yuxing, Lee Sungwon, Zhu Yingying, Summers Ronald M, Lu Zhiyong

机构信息

NCBI/NLM/NIH and Department of Population Health Sciences, Weill Cornell Medicine, New York, NY 10065 USA.

Imaging Biomarkers and Computer-Aided Diagnosis Laboratory, Radiology and Imaging Sciences Department, National Institutes of Health (NIH) Clinical Center, Bethesda, MD 20892 USA.

出版信息

IEEE Trans Big Data. 2021 Mar 1;7(1):3-12. doi: 10.1109/tbdata.2020.3035935. Epub 2020 Nov 4.

DOI:10.1109/tbdata.2020.3035935
PMID:33997112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8117951/
Abstract

The latest threat to global health is the COVID-19 outbreak. Although there exist large datasets of chest X-rays (CXR) and computed tomography (CT) scans, few COVID-19 image collections are currently available due to patient privacy. At the same time, there is a rapid growth of COVID-19-relevant articles in the biomedical literature, including those that report findings on radiographs. Here, we present COVID-19-CT-CXR, a public database of COVID-19 CXR and CT images, which are automatically extracted from COVID-19-relevant articles from the PubMed Central Open Access (PMC-OA) Subset. We extracted figures, associated captions, and relevant figure descriptions in the article and separated compound figures into subfigures. Because a large portion of figures in COVID-19 articles are not CXR or CT, we designed a deep-learning model to distinguish them from other figure types and to classify them accordingly. The final database includes 1,327 CT and 263 CXR images (as of May 9, 2020) with their relevant text. To demonstrate the utility of COVID-19-CT-CXR, we conducted four case studies. (1) We show that COVID-19-CT-CXR, when used as additional training data, is able to contribute to improved deep-learning (DL) performance for the classification of COVID-19 and non-COVID-19 CT. (2) We collected CT images of influenza, another common infectious respiratory illness that may present similarly to COVID-19, and fine-tuned a baseline deep neural network to distinguish a diagnosis of COVID-19, influenza, or normal or other types of diseases on CT. (3) We fine-tuned an unsupervised one-class classifier from non-COVID-19 CXR and performed anomaly detection to detect COVID-19 CXR. (4) From text-mined captions and figure descriptions, we compared 15 clinical symptoms and 20 clinical findings of COVID-19 versus those of influenza to demonstrate the disease differences in the scientific publications. Our database is unique, as the figures are retrieved along with relevant text with fine-grained descriptions, and it can be extended easily in the future. We believe that our work is complementary to existing resources and hope that it will contribute to medical image analysis of the COVID-19 pandemic. The dataset, code, and DL models are publicly available at https://github.com/ncbi-nlp/COVID-19-CT-CXR.

摘要

全球健康面临的最新威胁是新型冠状病毒肺炎(COVID-19)疫情。尽管存在大量胸部X光(CXR)和计算机断层扫描(CT)数据集,但由于患者隐私问题,目前可用的COVID-19图像集很少。与此同时,生物医学文献中与COVID-19相关的文章数量迅速增长,包括那些报告X光片检查结果的文章。在此,我们展示了COVID-19-CT-CXR,这是一个COVID-19 CXR和CT图像的公共数据库,这些图像是从美国国立医学图书馆开放获取(PMC-OA)子集中与COVID-19相关的文章中自动提取的。我们提取了文章中的图表、相关标题和相关的图表描述,并将复合图分成子图。由于COVID-19文章中的大部分图表不是CXR或CT,我们设计了一个深度学习模型来将它们与其他图表类型区分开来,并进行相应分类。最终数据库包括1327张CT图像和263张CXR图像(截至2020年5月日)及其相关文本。为了证明COVID-19-CT-CXR的实用性,我们进行了四个案例研究。(1)我们表明,COVID-19-CT-CXR用作额外的训练数据时,能够有助于提高深度学习(DL)对COVID-19和非COVID-19 CT分类的性能。(2)我们收集了流感的CT图像,流感是另一种常见的传染性呼吸道疾病,其表现可能与COVID-19相似,并对一个基线深度神经网络进行微调,以区分COVID-19、流感或正常或其他类型疾病的CT诊断。(3)我们从非COVID-19 CXR中微调了一个无监督单类分类器,并进行异常检测以检测COVID-19 CXR。(4)从文本挖掘的标题和图表描述中,我们比较了COVID-19与流感的15种临床症状和20种临床检查结果,以证明科学出版物中的疾病差异。我们的数据库是独一无二的,因为图表是与带有细粒度描述的相关文本一起检索的,并且将来可以轻松扩展。我们相信我们的工作是对现有资源的补充,并希望它将有助于COVID-19疫情的医学图像分析。该数据集、代码和DL模型可在https://github.com/ncbi-nlp/COVID-19-CT-CXR上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/8089e54d502a/peng7-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/df683df05f4d/peng1-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/29f1d601e10c/peng2-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/84b2b52f6746/peng3-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/bdcbe40b1cb5/peng4-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/bd21146b3ab1/peng5-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/68a7ffe7b45f/peng6-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/8089e54d502a/peng7-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/df683df05f4d/peng1-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/29f1d601e10c/peng2-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/84b2b52f6746/peng3-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/bdcbe40b1cb5/peng4-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/bd21146b3ab1/peng5-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/68a7ffe7b45f/peng6-3035935.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eae1/8769023/8089e54d502a/peng7-3035935.jpg

相似文献

1
COVID-19-CT-CXR: A Freely Accessible and Weakly Labeled Chest X-Ray and CT Image Collection on COVID-19 From Biomedical Literature.COVID-19-CT-CXR:一个可免费获取的、基于生物医学文献的关于COVID-19的弱标注胸部X光和CT图像集。
IEEE Trans Big Data. 2021 Mar 1;7(1):3-12. doi: 10.1109/tbdata.2020.3035935. Epub 2020 Nov 4.
2
COVID-19-CT-CXR: a freely accessible and weakly labeled chest X-ray and CT image collection on COVID-19 from biomedical literature.COVID-19-CT-CXR:一个可免费获取的、基于生物医学文献的关于COVID-19的弱标注胸部X光和CT图像集。
ArXiv. 2020 Oct 22:arXiv:2006.06177v2.
3
Enhancing thoracic disease detection using chest X-rays from PubMed Central Open Access.利用 PubMed Central 开放获取中的胸部 X 光片增强胸部疾病检测。
Comput Biol Med. 2023 Jun;159:106962. doi: 10.1016/j.compbiomed.2023.106962. Epub 2023 Apr 20.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Deep learning-based meta-classifier approach for COVID-19 classification using CT scan and chest X-ray images.基于深度学习的元分类器方法,用于使用CT扫描和胸部X光图像对新冠肺炎进行分类。
Multimed Syst. 2022;28(4):1401-1415. doi: 10.1007/s00530-021-00826-1. Epub 2021 Jul 6.
6
Classification of COVID-19 chest X-Ray and CT images using a type of dynamic CNN modification method.利用一种动态卷积神经网络改进方法对 COVID-19 胸部 X 射线和 CT 图像进行分类。
Comput Biol Med. 2021 Jul;134:104425. doi: 10.1016/j.compbiomed.2021.104425. Epub 2021 Apr 29.
7
COVID-19 detection in CT and CXR images using deep learning models.使用深度学习模型进行 CT 和 CXR 图像中的 COVID-19 检测。
Biogerontology. 2022 Feb;23(1):65-84. doi: 10.1007/s10522-021-09946-7. Epub 2022 Jan 22.
8
Pneumonia Classification Using Deep Learning from Chest X-ray Images During COVID-19.新冠疫情期间基于胸部X光图像利用深度学习进行肺炎分类
Cognit Comput. 2021 Jan 4:1-13. doi: 10.1007/s12559-020-09787-5.
9
Chest X-ray image phase features for improved diagnosis of COVID-19 using convolutional neural network.基于卷积神经网络的胸部 X 射线图像相位特征提高 COVID-19 诊断性能
Int J Comput Assist Radiol Surg. 2021 Feb;16(2):197-206. doi: 10.1007/s11548-020-02305-w. Epub 2021 Jan 9.
10
Deep Learning-Based Classification of Chest Diseases Using X-rays, CT Scans, and Cough Sound Images.基于深度学习的胸部疾病分类:使用X光、CT扫描和咳嗽声图像
Diagnostics (Basel). 2023 Aug 26;13(17):2772. doi: 10.3390/diagnostics13172772.

引用本文的文献

1
A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis.一种用于零样本临床诊断的多模态、多领域、多语言医学基础模型。
NPJ Digit Med. 2025 Feb 6;8(1):86. doi: 10.1038/s41746-024-01339-7.
2
A medical multimodal large language model for future pandemics.用于应对未来大流行的医学多模态大语言模型。
NPJ Digit Med. 2023 Dec 2;6(1):226. doi: 10.1038/s41746-023-00952-2.
3
A Systematic Review on Deep Structured Learning for COVID-19 Screening Using Chest CT from 2020 to 2022.2020年至2022年基于胸部CT的COVID-19筛查深度结构化学习系统综述

本文引用的文献

1
A deep learning algorithm using CT images to screen for Corona virus disease (COVID-19).利用 CT 图像进行冠状病毒病(COVID-19)筛查的深度学习算法。
Eur Radiol. 2021 Aug;31(8):6096-6104. doi: 10.1007/s00330-021-07715-1. Epub 2021 Feb 24.
2
Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography.基于深度学习的高分辨率计算机断层扫描 2019 年新型冠状病毒肺炎检测模型。
Sci Rep. 2020 Nov 5;10(1):19196. doi: 10.1038/s41598-020-76282-0.
3
A Deep Learning System to Screen Novel Coronavirus Disease 2019 Pneumonia.
Healthcare (Basel). 2023 Aug 24;11(17):2388. doi: 10.3390/healthcare11172388.
4
Enhancing biomedical search interfaces with images.利用图像增强生物医学搜索界面。
Bioinform Adv. 2023 Jul 17;3(1):vbad095. doi: 10.1093/bioadv/vbad095. eCollection 2023.
5
Computer-aided methods for combating Covid-19 in prevention, detection, and service provision approaches.用于在预防、检测和服务提供方法中抗击新冠疫情的计算机辅助方法。
Neural Comput Appl. 2023;35(20):14739-14778. doi: 10.1007/s00521-023-08612-y. Epub 2023 May 5.
6
Enhancing thoracic disease detection using chest X-rays from PubMed Central Open Access.利用 PubMed Central 开放获取中的胸部 X 光片增强胸部疾病检测。
Comput Biol Med. 2023 Jun;159:106962. doi: 10.1016/j.compbiomed.2023.106962. Epub 2023 Apr 20.
7
A dataset of COVID-19 x-ray chest images.一个新冠肺炎胸部X光图像数据集。
Data Brief. 2023 Apr;47:109000. doi: 10.1016/j.dib.2023.109000. Epub 2023 Feb 18.
8
Optimal Ensemble learning model for COVID-19 detection using chest X-ray images.用于使用胸部X光图像检测新冠肺炎的最优集成学习模型。
Biomed Signal Process Control. 2023 Mar;81:104392. doi: 10.1016/j.bspc.2022.104392. Epub 2022 Nov 21.
9
A comprehensive review on variants of SARS-CoVs-2: Challenges, solutions and open issues.关于严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体的全面综述:挑战、解决方案及未解决问题
Comput Commun. 2023 Jan 1;197:34-51. doi: 10.1016/j.comcom.2022.10.013. Epub 2022 Oct 26.
10
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement.深度学习模型在 COVID-19 胸部 X 射线分类中的应用:使用特征解缠预防捷径学习。
PLoS One. 2022 Oct 6;17(10):e0274098. doi: 10.1371/journal.pone.0274098. eCollection 2022.
一种用于筛查2019冠状病毒病肺炎的深度学习系统。
Engineering (Beijing). 2020 Oct;6(10):1122-1129. doi: 10.1016/j.eng.2020.04.010. Epub 2020 Jun 27.
4
Automated abnormality classification of chest radiographs using deep convolutional neural networks.使用深度卷积神经网络对胸部X光片进行自动异常分类。
NPJ Digit Med. 2020 May 14;3:70. doi: 10.1038/s41746-020-0273-z. eCollection 2020.
5
Artificial intelligence-enabled rapid diagnosis of patients with COVID-19.人工智能助力 COVID-19 患者快速诊断。
Nat Med. 2020 Aug;26(8):1224-1228. doi: 10.1038/s41591-020-0931-3. Epub 2020 May 19.
6
Clinically Applicable AI System for Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography.利用计算机断层扫描技术对 COVID-19 肺炎进行准确诊断、定量测量和预后的临床适用人工智能系统。
Cell. 2020 Jun 11;181(6):1423-1433.e11. doi: 10.1016/j.cell.2020.04.045. Epub 2020 May 4.
7
Using a diagnostic model based on routine laboratory tests to distinguish patients infected with SARS-CoV-2 from those infected with influenza virus.利用基于常规实验室检测的诊断模型来区分感染 SARS-CoV-2 的患者与感染流感病毒的患者。
Int J Infect Dis. 2020 Jun;95:436-440. doi: 10.1016/j.ijid.2020.04.078. Epub 2020 May 1.
8
Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation, and Diagnosis for COVID-19.COVID-19 成像数据采集、分割和诊断中人工智能技术的综述。
IEEE Rev Biomed Eng. 2021;14:4-15. doi: 10.1109/RBME.2020.2987975. Epub 2021 Jan 22.
9
Keep up with the latest coronavirus research.跟上冠状病毒的最新研究进展。
Nature. 2020 Mar;579(7798):193. doi: 10.1038/d41586-020-00694-1.
10
Clinical Characteristics of Coronavirus Disease 2019 in China.《中国 2019 年冠状病毒病临床特征》
N Engl J Med. 2020 Apr 30;382(18):1708-1720. doi: 10.1056/NEJMoa2002032. Epub 2020 Feb 28.