U.S. Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, Plum Island Animal Disease Center (PIADC), P.O. Box 848, Greenport, NY 11944, USA.
U.S. Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, National Bio and Agro-Defense Facility, Manhattan, KS 66502, USA.
Viruses. 2024 Aug 13;16(8):1293. doi: 10.3390/v16081293.
Obtaining a complete good-quality sequence and annotation for the long double-stranded DNA genome of the African swine fever virus (ASFV) from next-generation sequencing (NGS) technology has proven difficult, despite the increasing availability of reference genome sequences and the increasing affordability of NGS. A gap analysis conducted by the global African swine fever research alliance (GARA) partners identified that a standardized, automatic pipeline for NGS analysis was urgently needed, particularly for new outbreak strains. Whilst there are several diagnostic and research labs worldwide that collect isolates of the ASFV from outbreaks, many do not have the capability to analyze, annotate, and format NGS data from outbreaks for submission to NCBI, and some publicly available ASFV genomes have missing or incorrect annotations. We developed an automated, standardized pipeline for the analysis of NGS reads that directly provides users with assemblies and annotations formatted for their submission to NCBI. This pipeline is freely available on GitHub and has been tested through the GARA partners by examining two previously sequenced ASFV genomes; this study also aimed to assess the accuracy and limitations of two strategies present within the pipeline: reference-based (Illumina reads) and de novo assembly (Illumina and Nanopore reads) strategies.
从下一代测序(NGS)技术中获得完整的高质量的非洲猪瘟病毒(ASFV)长双链 DNA 基因组序列和注释一直很困难,尽管参考基因组序列的可用性越来越高,NGS 的成本也越来越低。全球非洲猪瘟研究联盟(GARA)合作伙伴进行的差距分析表明,特别对于新爆发的菌株,非常需要一个标准化的、自动的 NGS 分析管道。虽然全世界有几个诊断和研究实验室从疫情中收集 ASFV 的分离株,但许多实验室没有能力分析、注释和格式化疫情 NGS 数据以提交给 NCBI,并且一些公开的 ASFV 基因组存在缺失或错误的注释。我们开发了一种用于 NGS 读取分析的自动化、标准化管道,该管道直接为用户提供已格式化的组装和注释,以供提交给 NCBI。该管道可在 GitHub 上免费获取,并且已经通过 GARA 合作伙伴进行了测试,他们通过检查两个先前测序的 ASFV 基因组来进行测试;本研究还旨在评估管道内存在的两种策略的准确性和局限性:基于参考(Illumina 读取)和从头组装(Illumina 和 Nanopore 读取)策略。