利用深度学习对 SARS-CoV-2 进行准确检测的分类和特异性引物设计。

Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning.

机构信息

Division of Pharmacology, Utrecht Institute for Pharmaceutical Sciences, Faculty of Science, Utrecht University, Universiteitsweg 99, 3584 CG, Utrecht, The Netherlands.

UMR 518 MIA-Paris, INRAE, c/o 113 rue Nationale, 75103, Paris, France.

出版信息

Sci Rep. 2021 Jan 13;11(1):947. doi: 10.1038/s41598-020-80363-5.

DOI:10.1038/s41598-020-80363-5

PMID:33441822

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7806918/

Abstract

In this paper, deep learning is coupled with explainable artificial intelligence techniques for the discovery of representative genomic sequences in SARS-CoV-2. A convolutional neural network classifier is first trained on 553 sequences from the National Genomics Data Center repository, separating the genome of different virus strains from the Coronavirus family with 98.73% accuracy. The network's behavior is then analyzed, to discover sequences used by the model to identify SARS-CoV-2, ultimately uncovering sequences exclusive to it. The discovered sequences are validated on samples from the National Center for Biotechnology Information and Global Initiative on Sharing All Influenza Data repositories, and are proven to be able to separate SARS-CoV-2 from different virus strains with near-perfect accuracy. Next, one of the sequences is selected to generate a primer set, and tested against other state-of-the-art primer sets, obtaining competitive results. Finally, the primer is synthesized and tested on patient samples (n = 6 previously tested positive), delivering a sensitivity similar to routine diagnostic methods, and 100% specificity. The proposed methodology has a substantial added value over existing methods, as it is able to both automatically identify promising primer sets for a virus from a limited amount of data, and deliver effective results in a minimal amount of time. Considering the possibility of future pandemics, these characteristics are invaluable to promptly create specific detection methods for diagnostics.

摘要

在本文中，深度学习与可解释人工智能技术相结合，用于发现 SARS-CoV-2 中有代表性的基因组序列。首先，在国家基因组学数据中心存储库的 553 个序列上训练卷积神经网络分类器，该分类器以 98.73%的准确率将不同病毒株的基因组与冠状病毒家族区分开来。然后，分析网络的行为，以发现模型用于识别 SARS-CoV-2 的序列，最终揭示其独有的序列。在所发现的序列上，对来自国家生物技术信息中心和全球共享所有流感数据倡议存储库的样本进行验证，结果表明，它们能够以近乎完美的准确度将 SARS-CoV-2 与不同的病毒株区分开来。接下来，选择其中一条序列生成一组引物，并与其他最先进的引物组进行测试，结果具有竞争力。最后，合成引物并在患者样本（n=6 个先前检测呈阳性的样本）上进行测试，结果与常规诊断方法具有相似的灵敏度，且特异性为 100%。与现有方法相比，该方法具有实质性的附加价值，因为它不仅能够从有限的数据中自动识别病毒的有前途的引物组，而且能够在最短的时间内提供有效的结果。考虑到未来可能发生大流行的情况，这些特征对于及时创建特定的诊断检测方法非常宝贵。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2711/7806918/2abacc7dd105/41598_2020_80363_Fig1_HTML.jpg

相似文献

Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning.利用深度学习对 SARS-CoV-2 进行准确检测的分类和特异性引物设计。

Sci Rep. 2021 Jan 13;11(1):947. doi: 10.1038/s41598-020-80363-5.

Design and in silico validation of polymerase chain reaction primers to detect severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).设计并在计算机上验证聚合酶链反应引物以检测严重急性呼吸综合征冠状病毒 2（SARS-CoV-2）。

Sci Rep. 2021 Jun 15;11(1):12565. doi: 10.1038/s41598-021-91817-9.

How to choose the right real-time RT-PCR primer sets for the SARS-CoV-2 genome detection?如何选择用于 SARS-CoV-2 基因组检测的合适实时 RT-PCR 引物？

J Virol Methods. 2021 Sep;295:114197. doi: 10.1016/j.jviromet.2021.114197. Epub 2021 May 24.

Optimization of primer sets and detection protocols for SARS-CoV-2 of coronavirus disease 2019 (COVID-19) using PCR and real-time PCR.优化用于 2019 年冠状病毒病（COVID-19）的 SARS-CoV-2 冠状病毒的聚合酶链反应（PCR）和实时 PCR 引物和检测方案。

Exp Mol Med. 2020 Jun;52(6):963-977. doi: 10.1038/s12276-020-0452-7. Epub 2020 Jun 16.

In silico evaluation of the impact of Omicron variant of concern sublineage BA.4 and BA.5 on the sensitivity of RT-qPCR assays for SARS-CoV-2 detection using whole genome sequencing.基于全基因组测序的奥密克戎变异株 BA.4 和 BA.5 对 SARS-CoV-2 检测 RT-qPCR 检测方法敏感性影响的计算机评估。

J Med Virol. 2023 Jan;95(1):e28241. doi: 10.1002/jmv.28241. Epub 2022 Nov 8.

An Innovative AI-based primer design tool for precise and accurate detection of SARS-CoV-2 variants of concern.一种创新的基于人工智能的引物设计工具，用于精确和准确地检测 SARS-CoV-2 关注变体。

Sci Rep. 2023 Sep 22;13(1):15782. doi: 10.1038/s41598-023-42348-y.

Deepvirusclassifier: a deep learning tool for classifying SARS-CoV-2 based on viral subtypes within the coronaviridae family.深病毒分类器：一种基于冠状病毒科内病毒亚型对 SARS-CoV-2 进行分类的深度学习工具。

BMC Bioinformatics. 2024 Jul 5;25(1):231. doi: 10.1186/s12859-024-05754-1.

Performance Evaluation of Real-Time RT-PCR Assays for the Detection of Severe Acute Respiratory Syndrome Coronavirus-2 Developed by the National Institute of Infectious Diseases, Japan.日本传染病研究所开发的实时 RT-PCR 检测严重急性呼吸综合征冠状病毒-2 的性能评估。

Jpn J Infect Dis. 2021 Sep 22;74(5):465-472. doi: 10.7883/yoken.JJID.2020.1079. Epub 2021 Feb 26.

Comparison of SARS-CoV-2 N gene real-time RT-PCR targets and commercially available mastermixes.比较 SARS-CoV-2 N 基因实时 RT-PCR 靶标和市售主混合物。

J Virol Methods. 2021 Sep;295:114215. doi: 10.1016/j.jviromet.2021.114215. Epub 2021 Jun 21.

Accuracy of Real-Time Polymerase Chain Reaction in COVID-19 Patients.实时聚合酶链反应在 COVID-19 患者中的准确性。

Microbiol Spectr. 2022 Feb 23;10(1):e0059121. doi: 10.1128/spectrum.00591-21. Epub 2022 Feb 16.

引用本文的文献

A privacy-preserving dependable deep federated learning model for identifying new infections from genome sequences.一种用于从基因组序列中识别新感染的隐私保护可靠深度联邦学习模型。

Sci Rep. 2025 Mar 1;15(1):7291. doi: 10.1038/s41598-025-89612-x.

HIV-1 M group subtype classification using deep learning approach.利用深度学习方法对 HIV-1 M 组亚型进行分类。

Comput Biol Med. 2024 Dec;183:109218. doi: 10.1016/j.compbiomed.2024.109218. Epub 2024 Oct 5.

Utilizing genomic signatures to gain insights into the dynamics of SARS-CoV-2 through Machine and Deep Learning techniques.利用基因组特征，通过机器学习和深度学习技术深入了解 SARS-CoV-2 的动态。

BMC Bioinformatics. 2024 Mar 27;25(1):131. doi: 10.1186/s12859-024-05648-2.

Leveraging Artificial Intelligence to Expedite Antibody Design and Enhance Antibody-Antigen Interactions.利用人工智能加速抗体设计并增强抗体-抗原相互作用。

Bioengineering (Basel). 2024 Feb 15;11(2):185. doi: 10.3390/bioengineering11020185.

Machine learning-based approach KEVOLVE efficiently identifies SARS-CoV-2 variant-specific genomic signatures.基于机器学习的方法 KEVOLVE 能够有效地识别 SARS-CoV-2 变异特异性基因组特征。

PLoS One. 2024 Jan 19;19(1):e0296627. doi: 10.1371/journal.pone.0296627. eCollection 2024.

Reimagining Healthcare: Unleashing the Power of Artificial Intelligence in Medicine.重塑医疗保健：释放人工智能在医学中的力量。

Cureus. 2023 Sep 4;15(9):e44658. doi: 10.7759/cureus.44658. eCollection 2023 Sep.

Exploring the Intersection of Artificial Intelligence and Clinical Healthcare: A Multidisciplinary Review.探索人工智能与临床医疗的交叉领域：一项多学科综述

Diagnostics (Basel). 2023 Jun 7;13(12):1995. doi: 10.3390/diagnostics13121995.

Automatic diagnosis of COVID-19 from CT images using CycleGAN and transfer learning.使用循环生成对抗网络（CycleGAN）和迁移学习从CT图像中自动诊断新型冠状病毒肺炎（COVID-19）

Appl Soft Comput. 2023 Sep;144:110511. doi: 10.1016/j.asoc.2023.110511. Epub 2023 Jun 13.

Fast Evaluation of Viral Emerging Risks (FEVER): A computational tool for biosurveillance, diagnostics, and mutation typing of emerging viral pathogens.快速评估病毒新出现风险（FEVER）：一种用于对新出现病毒病原体进行生物监测、诊断和突变分型的计算工具。

PLOS Glob Public Health. 2022 Feb 24;2(2):e0000207. doi: 10.1371/journal.pgph.0000207. eCollection 2022.

New proposal of viral genome representation applied in the classification of SARS-CoV-2 with deep learning.新的病毒基因组表示方法在深度学习 SARS-CoV-2 分类中的应用。

BMC Bioinformatics. 2023 Mar 11;24(1):92. doi: 10.1186/s12859-023-05188-1.

本文引用的文献

Identifying viruses from metagenomic data using deep learning.利用深度学习从宏基因组数据中识别病毒。

Quant Biol. 2020 Mar;8(1):64-77. doi: 10.1007/s40484-019-0187-4.

A deep learning algorithm using CT images to screen for Corona virus disease (COVID-19).利用 CT 图像进行冠状病毒病（COVID-19）筛查的深度学习算法。

Eur Radiol. 2021 Aug;31(8):6096-6104. doi: 10.1007/s00330-021-07715-1. Epub 2021 Feb 24.

False-negative results of initial RT-PCR assays for COVID-19: A systematic review.COVID-19 初始 RT-PCR 检测的假阴性结果：系统评价。

PLoS One. 2020 Dec 10;15(12):e0242958. doi: 10.1371/journal.pone.0242958. eCollection 2020.

Fatal Interstitial Pneumonia Associated with Bovine Coronavirus in Cows from Southern Italy.意大利南部奶牛的牛冠状病毒相关性致死性间质性肺炎。

Viruses. 2020 Nov 19;12(11):1331. doi: 10.3390/v12111331.

Seasonality of Coronavirus 229E, HKU1, NL63, and OC43 From 2014 to 2020.2014 年至 2020 年冠状病毒 229E、HKU1、NL63 和 OC43 的季节性。

Mayo Clin Proc. 2020 Aug;95(8):1701-1703. doi: 10.1016/j.mayocp.2020.05.032. Epub 2020 Jun 6.

Machine Learning-Based Ensemble Recursive Feature Selection of Circulating miRNAs for Cancer Tumor Classification.基于机器学习的循环miRNA集成递归特征选择用于癌症肿瘤分类

Cancers (Basel). 2020 Jul 3;12(7):1785. doi: 10.3390/cancers12071785.

The ORF3a protein of SARS-CoV-2 induces apoptosis in cells.新型冠状病毒（SARS-CoV-2）的ORF3a蛋白可诱导细胞凋亡。

Cell Mol Immunol. 2020 Aug;17(8):881-883. doi: 10.1038/s41423-020-0485-9. Epub 2020 Jun 18.

False Negative Tests for SARS-CoV-2 Infection - Challenges and Implications.新型冠状病毒2型感染的假阴性检测——挑战与影响

N Engl J Med. 2020 Aug 6;383(6):e38. doi: 10.1056/NEJMp2015897. Epub 2020 Jun 5.

Co-infections in people with COVID-19: a systematic review and meta-analysis.COVID-19 患者合并感染：系统评价和荟萃分析。

J Infect. 2020 Aug;81(2):266-275. doi: 10.1016/j.jinf.2020.05.046. Epub 2020 May 27.

Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study.利用内在基因组特征进行机器学习，快速分类新型病原体：COVID-19 案例研究。

PLoS One. 2020 Apr 24;15(4):e0232391. doi: 10.1371/journal.pone.0232391. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用深度学习对 SARS-CoV-2 进行准确检测的分类和特异性引物设计。

Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献