• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习早期检测新冠病毒高风险变异株

Machine Learning Early Detection of SARS-CoV-2 High-Risk Variants.

作者信息

Li Lun, Li Cuiping, Li Na, Zou Dong, Zhao Wenming, Luo Hong, Xue Yongbiao, Zhang Zhang, Bao Yiming, Song Shuhui

机构信息

China National Center for Bioinformation, Beijing, 100101, China.

National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.

出版信息

Adv Sci (Weinh). 2024 Dec;11(45):e2405058. doi: 10.1002/advs.202405058. Epub 2024 Oct 14.

DOI:10.1002/advs.202405058
PMID:39401400
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11615786/
Abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has evolved many high-risk variants, resulting in repeated COVID-19 waves over the past years. Therefore, accurate early warning of high-risk variants is vital for epidemic prevention and control. However, detecting high-risk variants through experimental and epidemiological research is time-consuming and often lags behind the emergence and spread of these variants. In this study, HiRisk-Detector a machine learning algorithm based on haplotype network, is developed for computationally early detecting high-risk SARS-CoV-2 variants. Leveraging over 7.6 million high-quality and complete SARS-CoV-2 genomes and metadata, the effectiveness, robustness, and generalizability of HiRisk-Detector are validated. First, HiRisk-Detector is evaluated on actual empirical data, successfully detecting all 13 high-risk variants, preceding World Health Organization announcements by 27 days on average. Second, its robustness is tested by reducing sequencing intensity to one-fourth, noting only a minimal delay of 3.8 days, demonstrating its effectiveness. Third, HiRisk-Detector is applied to detect risks among SARS-CoV-2 Omicron variant sub-lineages, confirming its broad applicability and high ROC-AUC and PR-AUC performance. Overall, HiRisk-Detector features powerful capacity for early detection of high-risk variants, bearing great utility for any public emergency caused by infectious diseases or viruses.

摘要

严重急性呼吸综合征冠状病毒2(SARS-CoV-2)已经进化出许多高风险变异株,导致在过去几年中新冠疫情反复出现。因此,对高风险变异株进行准确的早期预警对于疫情防控至关重要。然而,通过实验和流行病学研究来检测高风险变异株既耗时,而且往往滞后于这些变异株的出现和传播。在本研究中,开发了基于单倍型网络的机器学习算法HiRisk-Detector,用于对高风险SARS-CoV-2变异株进行计算早期检测。利用超过760万个高质量和完整的SARS-CoV-2基因组及元数据,验证了HiRisk-Detector的有效性、稳健性和通用性。首先,在实际经验数据上对HiRisk-Detector进行评估,成功检测出所有13种高风险变异株,平均比世界卫生组织宣布提前27天。其次,通过将测序强度降低到四分之一来测试其稳健性,发现仅延迟3.8天,证明了其有效性。第三,将HiRisk-Detector应用于检测SARS-CoV-2奥密克戎变异株亚谱系中的风险,证实了其广泛的适用性以及较高的ROC-AUC和PR-AUC性能。总体而言,HiRisk-Detector具有强大的早期检测高风险变异株的能力,对于由传染病或病毒引起的任何公共紧急情况都具有很大的实用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/e433a78cf49f/ADVS-11-2405058-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/f6f5e5313244/ADVS-11-2405058-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/1cbac848782e/ADVS-11-2405058-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/68193a114af9/ADVS-11-2405058-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/b325136811f5/ADVS-11-2405058-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/0c7a54e10c14/ADVS-11-2405058-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/e433a78cf49f/ADVS-11-2405058-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/f6f5e5313244/ADVS-11-2405058-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/1cbac848782e/ADVS-11-2405058-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/68193a114af9/ADVS-11-2405058-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/b325136811f5/ADVS-11-2405058-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/0c7a54e10c14/ADVS-11-2405058-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf8/11615786/e433a78cf49f/ADVS-11-2405058-g006.jpg

相似文献

1
Machine Learning Early Detection of SARS-CoV-2 High-Risk Variants.机器学习早期检测新冠病毒高风险变异株
Adv Sci (Weinh). 2024 Dec;11(45):e2405058. doi: 10.1002/advs.202405058. Epub 2024 Oct 14.
2
Emergency SARS-CoV-2 Variants of Concern: Novel Multiplex Real-Time RT-PCR Assay for Rapid Detection and Surveillance.关注的紧急 SARS-CoV-2 变异株:用于快速检测和监测的新型多重实时 RT-PCR 检测方法。
Microbiol Spectr. 2022 Feb 23;10(1):e0251321. doi: 10.1128/spectrum.02513-21.
3
Sustained applicability of SARS-CoV-2 variants identification by Sanger Sequencing Strategy on emerging various SARS-CoV-2 Omicron variants in Hiroshima, Japan.日本广岛市基于 Sanger 测序策略对新型 SARS-CoV-2 奥密克戎变异株进行持续鉴定的适用性。
BMC Genomics. 2024 Nov 11;25(1):1063. doi: 10.1186/s12864-024-10973-0.
4
Phylogenomic early warning signals for SARS-CoV-2 epidemic waves.新冠病毒流行波的系统发生基因组预警信号。
EBioMedicine. 2024 Feb;100:104939. doi: 10.1016/j.ebiom.2023.104939. Epub 2024 Jan 8.
5
Prospective clinical performance of CoVarScan in identifying SARS-CoV-2 Omicron subvariants.CoVarScan在识别严重急性呼吸综合征冠状病毒2(SARS-CoV-2)奥密克戎亚变体方面的前瞻性临床性能。
Microbiol Spectr. 2025 Jan 7;13(1):e0138524. doi: 10.1128/spectrum.01385-24. Epub 2024 Dec 11.
6
A prospective, randomized, single-blinded, crossover trial to investigate the effect of a wearable device in addition to a daily symptom diary for the Remote Early Detection of SARS-CoV-2 infections (COVID-RED): a structured summary of a study protocol for a randomized controlled trial.一项前瞻性、随机、单盲、交叉试验,旨在研究可穿戴设备对 SARS-CoV-2 感染(COVID-RED)的远程早期检测的影响:一项随机对照试验研究方案的结构化总结。
Trials. 2021 Oct 11;22(1):694. doi: 10.1186/s13063-021-05643-5.
7
Evolutionary and Phylogenetic Dynamics of SARS-CoV-2 Variants: A Genetic Comparative Study of Taiyuan and Wuhan Cities of China.SARS-CoV-2 变异株的进化与系统发育动态:中国太原市与武汉市的遗传比较研究。
Viruses. 2024 Jun 3;16(6):907. doi: 10.3390/v16060907.
8
A prospective, randomized, single-blinded, crossover trial to investigate the effect of a wearable device in addition to a daily symptom diary for the remote early detection of SARS-CoV-2 infections (COVID-RED): a structured summary of a study protocol for a randomized controlled trial.一项前瞻性、随机、单盲、交叉试验,旨在研究可穿戴设备对远程早期检测 SARS-CoV-2 感染(COVID-RED)的影响:一项随机对照试验研究方案的结构化总结。
Trials. 2021 Jun 22;22(1):412. doi: 10.1186/s13063-021-05241-5.
9
The Evolution and Biology of SARS-CoV-2 Variants.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体的进化与生物学
Cold Spring Harb Perspect Med. 2022 May 27;12(5):a041390. doi: 10.1101/cshperspect.a041390.
10
Rapid classification of SARS-CoV-2 variant strains using machine learning-based label-free SERS strategy.基于机器学习的无标记 SERS 策略快速分类 SARS-CoV-2 变异株。
Talanta. 2024 Jan 15;267:125080. doi: 10.1016/j.talanta.2023.125080. Epub 2023 Aug 17.

引用本文的文献

1
VirusWarn: A mutation-based early warning system to prioritize concerning SARS-CoV-2 and influenza virus variants from sequencing data.病毒预警:一种基于突变的早期预警系统,用于根据测序数据对值得关注的严重急性呼吸综合征冠状病毒2(SARS-CoV-2)和流感病毒变体进行优先级排序。
Comput Struct Biotechnol J. 2025 Mar 12;27:1081-1088. doi: 10.1016/j.csbj.2025.03.010. eCollection 2025.
2
RCoV19: A One-stop Hub for SARS-CoV-2 Genome Data Integration, Variant Monitoring, and Risk Pre-warning.RCoV19:一个用于严重急性呼吸综合征冠状病毒2基因组数据整合、变异监测和风险预警的一站式中心。
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1066-1079. doi: 10.1016/j.gpb.2023.10.004. Epub 2023 Oct 26.

本文引用的文献

1
Using big sequencing data to identify chronic SARS-Coronavirus-2 infections.利用大测序数据识别慢性严重急性呼吸系统综合征冠状病毒 2 感染。
Nat Commun. 2024 Jan 20;15(1):648. doi: 10.1038/s41467-024-44803-4.
2
RCoV19: A One-stop Hub for SARS-CoV-2 Genome Data Integration, Variant Monitoring, and Risk Pre-warning.RCoV19:一个用于严重急性呼吸综合征冠状病毒2基因组数据整合、变异监测和风险预警的一站式中心。
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1066-1079. doi: 10.1016/j.gpb.2023.10.004. Epub 2023 Oct 26.
3
McAN: a novel computational algorithm and platform for constructing and visualizing haplotype networks.
McAN:一种用于构建和可视化单倍型网络的新型计算算法和平台。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad174.
4
Early computational detection of potential high-risk SARS-CoV-2 variants.早期计算检测潜在的高风险 SARS-CoV-2 变体。
Comput Biol Med. 2023 Mar;155:106618. doi: 10.1016/j.compbiomed.2023.106618. Epub 2023 Feb 2.
5
Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm.通过一类动态异常检测算法预测新兴的 SARS-CoV-2 关注变异株。
BMJ Health Care Inform. 2022 Dec;29(1). doi: 10.1136/bmjhci-2022-100643.
6
Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness.分析 640 万例 SARS-CoV-2 基因组,鉴定与适应性相关的突变。
Science. 2022 Jun 17;376(6599):1327-1332. doi: 10.1126/science.abm1208. Epub 2022 May 24.
7
Deep learning based on biologically interpretable genome representation predicts two types of human adaptation of SARS-CoV-2 variants.基于具有生物学可解释性的基因组表示的深度学习预测了 SARS-CoV-2 变体的两种人类适应类型。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac036.
8
Predicting the mutational drivers of future SARS-CoV-2 variants of concern.预测未来引起关注的 SARS-CoV-2 变异株的突变驱动因素。
Sci Transl Med. 2022 Feb 23;14(633):eabk3445. doi: 10.1126/scitranslmed.abk3445.
9
Genomic Epidemiology of SARS-CoV-2 in Pakistan.巴基斯坦 SARS-CoV-2 的基因组流行病学研究。
Genomics Proteomics Bioinformatics. 2021 Oct;19(5):727-740. doi: 10.1016/j.gpb.2021.08.007. Epub 2021 Oct 23.
10
VarEPS: an evaluation and prewarning system of known and virtual variations of SARS-CoV-2 genomes.VarEPS:一种用于评估和预警 SARS-CoV-2 基因组已知和虚拟变异的系统。
Nucleic Acids Res. 2022 Jan 7;50(D1):D888-D897. doi: 10.1093/nar/gkab921.