Suppr超能文献

VirHunter:一种基于深度学习的方法,用于在植物测序数据中检测新型RNA病毒。

VirHunter: A Deep Learning-Based Method for Detection of Novel RNA Viruses in Plant Sequencing Data.

作者信息

Sukhorukov Grigorii, Khalili Maryam, Gascuel Olivier, Candresse Thierry, Marais-Colombel Armelle, Nikolski Macha

机构信息

CNRS, IBGC, UMR 5095, Université de Bordeaux, Bordeaux, France.

Bordeaux Bioinformatics Center, Université de Bordeaux, Bordeaux, France.

出版信息

Front Bioinform. 2022 May 13;2:867111. doi: 10.3389/fbinf.2022.867111. eCollection 2022.

Abstract

High-throughput sequencing has provided the capacity of broad virus detection for both known and unknown viruses in a variety of hosts and habitats. It has been successfully applied for novel virus discovery in many agricultural crops, leading to the current drive to apply this technology routinely for plant health diagnostics. For this, efficient and precise methods for sequencing-based virus detection and discovery are essential. However, both existing alignment-based methods relying on reference databases and even more recent machine learning approaches are not efficient enough in detecting unknown viruses in RNAseq datasets of plant viromes. We present VirHunter, a deep learning convolutional neural network approach, to detect novel and known viruses in assemblies of sequencing datasets. While our method is generally applicable to a variety of viruses, here, we trained and evaluated it specifically for RNA viruses by reinforcing the coding sequences' content in the training dataset. Trained on the NCBI plant viruses data for three different host species (peach, grapevine, and sugar beet), VirHunter outperformed the state-of-the-art method, DeepVirFinder, for the detection of novel viruses, both in the synthetic leave-out setting and on the 12 newly acquired RNAseq datasets. Compared with the traditional tBLASTx approach, VirHunter has consistently exhibited better results in the majority of leave-out experiments. In conclusion, we have shown that VirHunter can be used to streamline the analyses of plant HTS-acquired viromes and is particularly well suited for the detection of novel viral contigs, in RNAseq datasets.

摘要

高通量测序能够在多种宿主和生境中对已知和未知病毒进行广泛检测。它已成功应用于多种农作物中新型病毒的发现,推动了目前将该技术常规应用于植物健康诊断的趋势。为此,基于测序的高效、精确的病毒检测和发现方法至关重要。然而,现有的基于比对的依赖参考数据库的方法以及更新的机器学习方法,在检测植物病毒组RNA测序数据集中的未知病毒时效率都不够高。我们提出了VirHunter,一种深度学习卷积神经网络方法,用于在测序数据集的组装中检测新型和已知病毒。虽然我们的方法通常适用于多种病毒,但在这里,我们通过增强训练数据集中编码序列的含量,专门针对RNA病毒对其进行了训练和评估。在针对三种不同宿主物种(桃子、葡萄和甜菜)的NCBI植物病毒数据上进行训练后,在合成遗漏设置和12个新获得的RNA测序数据集上,VirHunter在检测新型病毒方面优于现有最先进的方法DeepVirFinder。与传统的tBLASTx方法相比,在大多数遗漏实验中,VirHunter始终表现出更好的结果。总之,我们已经表明,VirHunter可用于简化植物高通量测序获得的病毒组分析,并且特别适合于检测RNA测序数据集中的新型病毒重叠群。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/92e6/9580956/c33802b533c6/fbinf-02-867111-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验