Computer Science and Engineering, Michigan State University, East Lansing, 48824, USA.
Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou, 510006, China.
BMC Bioinformatics. 2019 Jun 4;20(1):305. doi: 10.1186/s12859-019-2878-2.
Strain-level RNA virus characterization is essential for developing prevention and treatment strategies. Viral metagenomic data, which can contain sequences of both known and novel viruses, provide new opportunities for characterizing RNA viruses. Although there are a number of pipelines for analyzing viruses in metagenomic data, they have different limitations. First, viruses that lack closely related reference genomes cannot be detected with high sensitivity. Second, strain-level analysis is usually missing.
In this study, we developed a hybrid pipeline named TAR-VIR that reconstructs viral strains without relying on complete or high-quality reference genomes. It is optimized for identifying RNA viruses from metagenomic data by combining an effective read classification method and our in-house strain-level de novo assembly tool. TAR-VIR was tested on both simulated and real viral metagenomic data sets. The results demonstrated that TAR-VIR competes favorably with other tested tools.
TAR-VIR can be used standalone for viral strain reconstruction from metagenomic data. Or, its read recruiting stage can be used with other de novo assembly tools for superior viral functional and taxonomic analyses. The source code and the documentation of TAR-VIR are available at https://github.com/chjiao/TAR-VIR .
对病毒株水平的 RNA 病毒进行特征分析对于开发预防和治疗策略至关重要。病毒宏基因组数据可以包含已知和新型病毒的序列,为 RNA 病毒的特征分析提供了新的机会。尽管有许多用于分析宏基因组数据中病毒的方法,但它们都存在不同的局限性。首先,对于那些缺乏密切相关参考基因组的病毒,无法进行高灵敏度的检测。其次,通常缺少病毒株水平的分析。
在本研究中,我们开发了一种名为 TAR-VIR 的混合方法,该方法不需要完整或高质量的参考基因组即可重建病毒株。它通过结合有效的读段分类方法和我们内部的病毒株水平从头组装工具,针对从宏基因组数据中识别 RNA 病毒进行了优化。TAR-VIR 在模拟和真实的病毒宏基因组数据集上进行了测试。结果表明,TAR-VIR 与其他测试工具相比具有竞争力。
TAR-VIR 可单独用于从宏基因组数据中重建病毒株。或者,其读段招募阶段可以与其他从头组装工具结合使用,以进行更优越的病毒功能和分类学分析。TAR-VIR 的源代码和文档可在 https://github.com/chjiao/TAR-VIR 上获得。