• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COVID-深度预测器:用于预测严重急性呼吸综合征冠状病毒2及其他致病病毒的循环神经网络

COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.

作者信息

Saha Indrajit, Ghosh Nimisha, Maity Debasree, Seal Arjit, Plewczynski Dariusz

机构信息

Department of Computer Science and Engineering, National Institute of Technical Teachers' Training and Research, Kolkata, India.

Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha 'O' Anusandhan (Deemed to Be University), Bhubaneswar, India.

出版信息

Front Genet. 2021 Feb 11;12:569120. doi: 10.3389/fgene.2021.569120. eCollection 2021.

DOI:10.3389/fgene.2021.569120
PMID:33643375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7906283/
Abstract

The COVID-19 disease for Novel coronavirus (SARS-CoV-2) has turned out to be a global pandemic. The high transmission rate of this pathogenic virus demands an early prediction and proper identification for the subsequent treatment. However, polymorphic nature of this virus allows it to adapt and sustain in different kinds of environment which makes it difficult to predict. On the other hand, there are other pathogens like SARS-CoV-1, MERS-CoV, Ebola, Dengue, and Influenza as well, so that a predictor is highly required to distinguish them with the use of their genomic information. To mitigate this problem, in this work COVID-DeepPredictor is proposed on the framework of deep learning to identify an unknown sequence of these pathogens. COVID-DeepPredictor uses Long Short Term Memory as Recurrent Neural Network for the underlying prediction with an alignment-free technique. In this regard, -mer technique is applied to create Bag-of-Descriptors (BoDs) in order to generate Bag-of-Unique-Descriptors (BoUDs) as vocabulary and subsequently embedded representation is prepared for the given virus sequences. This predictor is not only validated for the dataset using -fold cross-validation but also for unseen test datasets of SARS-CoV-2 sequences and sequences from other viruses as well. To verify the efficacy of COVID-DeepPredictor, it has been compared with other state-of-the-art prediction techniques based on Linear Discriminant Analysis, Random Forests, and Gradient Boosting Method. COVID-DeepPredictor achieves 100% prediction accuracy on validation dataset while on test datasets, the accuracy ranges from 99.51 to 99.94%. It shows superior results over other prediction techniques as well. In addition to this, accuracy and runtime of COVID-DeepPredictor are considered simultaneously to determine the value of in -mer, a comparative study among values in -mer, Bag-of-Descriptors (BoDs), and Bag-of-Unique-Descriptors (BoUDs) and a comparison between COVID-DeepPredictor and Nucleotide BLAST have also been performed. The code, training, and test datasets used for COVID-DeepPredictor are available at .

摘要

新型冠状病毒(SARS-CoV-2)引发的COVID-19疾病已成为全球大流行。这种致病病毒的高传播率要求进行早期预测和准确识别,以便后续治疗。然而,这种病毒的多态性使其能够在不同环境中适应和生存,这使得预测变得困难。另一方面,还有其他病原体,如SARS-CoV-1、MERS-CoV、埃博拉病毒、登革热病毒和流感病毒等,因此迫切需要一种预测器,利用它们的基因组信息来区分它们。为缓解这一问题,本文基于深度学习框架提出了COVID-DeepPredictor,用于识别这些病原体的未知序列。COVID-DeepPredictor使用长短期记忆作为递归神经网络,采用无比对技术进行基础预测。在这方面,应用-mer技术创建描述符袋(BoDs),以生成唯一描述符袋(BoUDs)作为词汇表,随后为给定的病毒序列准备嵌入表示。该预测器不仅使用-fold交叉验证对数据集进行了验证,还对SARS-CoV-2序列的未见测试数据集以及其他病毒的序列进行了验证。为验证COVID-DeepPredictor的有效性,将其与基于线性判别分析、随机森林和梯度提升方法的其他先进预测技术进行了比较。COVID-DeepPredictor在验证数据集上实现了100%的预测准确率,而在测试数据集上,准确率范围为99.51%至99.94%。它也显示出优于其他预测技术的结果。除此之外,同时考虑了COVID-DeepPredictor的准确率和运行时间,以确定-mer中的值,还对-mer中的值、描述符袋(BoDs)和唯一描述符袋(BoUDs)进行了比较研究,并将COVID-DeepPredictor与核苷酸BLAST进行了比较。用于COVID-DeepPredictor的代码、训练和测试数据集可在获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/048d4ecfe648/fgene-12-569120-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/0aea84a365e7/fgene-12-569120-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/ec2462ef97ce/fgene-12-569120-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/16e80ffcb304/fgene-12-569120-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/048d4ecfe648/fgene-12-569120-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/0aea84a365e7/fgene-12-569120-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/ec2462ef97ce/fgene-12-569120-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/16e80ffcb304/fgene-12-569120-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72aa/7906283/048d4ecfe648/fgene-12-569120-g0004.jpg

相似文献

1
COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.COVID-深度预测器:用于预测严重急性呼吸综合征冠状病毒2及其他致病病毒的循环神经网络
Front Genet. 2021 Feb 11;12:569120. doi: 10.3389/fgene.2021.569120. eCollection 2021.
2
Multi-Stage Temporal Convolution Network for COVID-19 Variant Classification.用于新冠病毒变异株分类的多阶段时间卷积网络
Diagnostics (Basel). 2022 Nov 9;12(11):2736. doi: 10.3390/diagnostics12112736.
3
Online Predictor Using Machine Learning to Predict Novel Coronavirus and Other Pathogenic Viruses.使用机器学习预测新型冠状病毒及其他致病病毒的在线预测工具
ACS Omega. 2022 Jun 28;7(27):23069-23074. doi: 10.1021/acsomega.2c00215. eCollection 2022 Jul 12.
4
Accurate and fast clade assignment via deep learning and frequency chaos game representation.通过深度学习和频率混沌游戏表示实现准确快速的进化枝分配。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giac119.
5
SARS-CoV-2 virus classification based on stacked sparse autoencoder.基于堆叠稀疏自动编码器的严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒分类
Comput Struct Biotechnol J. 2023;21:284-298. doi: 10.1016/j.csbj.2022.12.007. Epub 2022 Dec 9.
6
Whole genome analysis of more than 10 000 SARS-CoV-2 virus unveils global genetic diversity and target region of NSP6.对超过 10000 株 SARS-CoV-2 病毒进行全基因组分析揭示了全球遗传多样性和 NSP6 的靶标区域。
Brief Bioinform. 2021 Mar 22;22(2):1106-1121. doi: 10.1093/bib/bbab025.
7
Cluster Analysis of Coronavirus Sequences using Computational Sequence Descriptors: With Applications to SARS, MERS and SARS-CoV-2 (CoVID-19).冠状病毒序列的计算序列描述聚类分析:在 SARS、MERS 和 SARS-CoV-2(CoVID-19)中的应用。
Curr Comput Aided Drug Des. 2021;17(7):936-945. doi: 10.2174/1573409917666210202092646.
8
Topological Analysis for Sequence Variability: Case Study on more than 2K SARS-CoV-2 sequences of COVID-19 infected 54 countries in comparison with SARS-CoV-1 and MERS-CoV.拓扑分析用于序列变异:以 54 个国家 2000 多个 COVID-19 感染 SARS-CoV-2 序列为例,与 SARS-CoV-1 和 MERS-CoV 进行比较。
Infect Genet Evol. 2021 Mar;88:104708. doi: 10.1016/j.meegid.2021.104708. Epub 2021 Jan 6.
9
A deep bidirectional recurrent neural network for identification of SARS-CoV-2 from viral genome sequences.一种用于从病毒基因组序列中识别 SARS-CoV-2 的深度双向循环神经网络。
Math Biosci Eng. 2021 Oct 15;18(6):8933-8950. doi: 10.3934/mbe.2021440.
10
COVID-19 prediction based on genome similarity of human SARS-CoV-2 and bat SARS-CoV-like coronavirus.基于人类严重急性呼吸综合征冠状病毒2(SARS-CoV-2)与蝙蝠类严重急性呼吸综合征冠状病毒(SARS-CoV)基因组相似性的2019冠状病毒病(COVID-19)预测
Comput Ind Eng. 2021 Nov;161:107666. doi: 10.1016/j.cie.2021.107666. Epub 2021 Sep 8.

引用本文的文献

1
Microbial Technologies Enhanced by Artificial Intelligence for Healthcare Applications.用于医疗保健应用的人工智能增强微生物技术。
Microb Biotechnol. 2025 Mar;18(3):e70131. doi: 10.1111/1751-7915.70131.
2
A hybrid deep learning approach for COVID-19 detection based on genomic image processing techniques.基于基因组图像处理技术的 COVID-19 检测的混合深度学习方法。
Sci Rep. 2023 Mar 10;13(1):4003. doi: 10.1038/s41598-023-30941-0.
3
SARS-CoV-2 Morphometry Analysis and Prediction of Real Virus Levels Based on Full Recurrent Neural Network Using TEM Images.

本文引用的文献

1
A survey of word embeddings for clinical text.临床文本词嵌入研究
J Biomed Inform. 2019;100S:100057. doi: 10.1016/j.yjbinx.2019.100057. Epub 2019 Oct 28.
2
Automated detection of COVID-19 cases using deep neural networks with X-ray images.使用 X 射线图像的深度学习神经网络自动检测 COVID-19 病例。
Comput Biol Med. 2020 Jun;121:103792. doi: 10.1016/j.compbiomed.2020.103792. Epub 2020 Apr 28.
3
Sex-specific clinical characteristics and prognosis of coronavirus disease-19 infection in Wuhan, China: A retrospective study of 168 severe patients.
基于 TEM 图像的全递归神经网络对 SARS-CoV-2 形态计量分析及真病毒水平预测
Viruses. 2022 Oct 28;14(11):2386. doi: 10.3390/v14112386.
4
Integrated COVID-19 Predictor: Differential expression analysis to reveal potential biomarkers and prediction of coronavirus using RNA-Seq profile data.整合 COVID-19 预测器:差异表达分析揭示潜在生物标志物和使用 RNA-Seq 谱数据预测冠状病毒
Comput Biol Med. 2022 Aug;147:105684. doi: 10.1016/j.compbiomed.2022.105684. Epub 2022 Jun 3.
5
Exploring the Lethality of Human-Adapted Coronavirus Through Alignment-Free Machine Learning Approaches Using Genomic Sequences.利用基因组序列通过无比对机器学习方法探索适应人类的冠状病毒的致死性。
Curr Genomics. 2021 Dec 31;22(8):583-595. doi: 10.2174/1389202923666211221110857.
6
Mapping Data to Deep Understanding: Making the Most of the Deluge of SARS-CoV-2 Genome Sequences.将数据映射到深入理解:充分利用 SARS-CoV-2 基因组序列的洪流。
mSystems. 2022 Apr 26;7(2):e0003522. doi: 10.1128/msystems.00035-22. Epub 2022 Mar 21.
7
Sequencing meets machine learning to fight emerging pathogens: A preview.测序与机器学习携手对抗新出现的病原体:预览
Patterns (N Y). 2022 Feb 11;3(2):100448. doi: 10.1016/j.patter.2022.100448.
8
Conserved molecular signatures in the spike protein provide evidence indicating the origin of SARS-CoV-2 and a Pangolin-CoV (MP789) by recombination(s) between specific lineages of Sarbecoviruses.刺突蛋白中保守的分子特征提供了证据,表明严重急性呼吸综合征冠状病毒2(SARS-CoV-2)和穿山甲冠状病毒(MP789)起源于Sarbecoviruses特定谱系之间的重组。
PeerJ. 2021 Nov 12;9:e12434. doi: 10.7717/peerj.12434. eCollection 2021.
性别特异性临床特征和新型冠状病毒感染的预后:中国武汉 168 例重症患者的回顾性研究。
PLoS Pathog. 2020 Apr 28;16(4):e1008520. doi: 10.1371/journal.ppat.1008520. eCollection 2020 Apr.
4
Deep-learning-based Prediction of Late Age-Related Macular Degeneration Progression.基于深度学习的晚期年龄相关性黄斑变性进展预测
Nat Mach Intell. 2020 Feb;2(2):141-150. doi: 10.1038/s42256-020-0154-9. Epub 2020 Feb 14.
5
Potential inhibitors against 2019-nCoV coronavirus M protease from clinically approved medicines.来自临床批准药物的针对2019新型冠状病毒M蛋白酶的潜在抑制剂。
J Genet Genomics. 2020 Feb 20;47(2):119-121. doi: 10.1016/j.jgg.2020.02.001. Epub 2020 Feb 13.
6
Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses.SARS-CoV-2 及其他 B 属β冠状病毒的细胞进入和受体使用功能评估。
Nat Microbiol. 2020 Apr;5(4):562-569. doi: 10.1038/s41564-020-0688-y. Epub 2020 Feb 24.
7
A pneumonia outbreak associated with a new coronavirus of probable bat origin.一种新型冠状病毒引发的肺炎疫情,该病毒可能来源于蝙蝠。
Nature. 2020 Mar;579(7798):270-273. doi: 10.1038/s41586-020-2012-7. Epub 2020 Feb 3.
8
Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.新冠病毒的基因组特征和流行病学:对病毒起源和受体结合的影响。
Lancet. 2020 Feb 22;395(10224):565-574. doi: 10.1016/S0140-6736(20)30251-8. Epub 2020 Jan 30.
9
Full-genome evolutionary analysis of the novel corona virus (2019-nCoV) rejects the hypothesis of emergence as a result of a recent recombination event.对新型冠状病毒(2019-nCoV)的全基因组进化分析否定了其是近期重组事件导致出现的假说。
Infect Genet Evol. 2020 Apr;79:104212. doi: 10.1016/j.meegid.2020.104212. Epub 2020 Jan 29.
10
Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus.新型冠状病毒受体识别:基于 SARS 冠状病毒长达十年结构研究的分析。
J Virol. 2020 Mar 17;94(7). doi: 10.1128/JVI.00127-20.