Nuffield Department of Medicine, John Radcliffe Hospital, University of Oxford, Oxford, UK.
Virol J. 2013 Nov 13;10:335. doi: 10.1186/1743-422X-10-335.
Norovirus is the commonest cause of epidemic gastroenteritis among people of all ages. Outbreaks frequently occur in hospitals and the community, costing the UK an estimated £110 m per annum. An evolutionary explanation for periodic increases in norovirus cases, despite some host-specific post immunity is currently limited to the identification of obvious recombinants. Our understanding could be significantly enhanced by full length genome sequences for large numbers of intensively sampled viruses, which would also assist control and vaccine design. Our objective is to develop rapid, high-throughput, end-to-end methods yielding complete norovirus genome sequences. We apply these methods to recent English outbreaks, placing them in the wider context of the international norovirus epidemic of winter 2012.
Norovirus sequences were generated from 28 unique clinical samples by Illumina RNA sequencing (RNA-Seq) of total faecal RNA. A range of de novo sequence assemblers were attempted. The best assembler was identified by validation against three replicate samples and two norovirus qPCR negative samples, together with an additional 20 sequences determined by PCR and fractional capillary sequencing. Phylogenetic methods were used to reconstruct evolutionary relationships from the whole genome sequences.
Full length norovirus genomes were generated from 23/28 samples. 5/28 partial norovirus genomes were associated with low viral copy numbers. The de novo assembled sequences differed from sequences determined by capillary sequencing by <0.003%. Intra-host nucleotide sequence diversity was rare, but detectable by mapping short sequence reads onto its de novo assembled consensus. Genomes similar to the Sydney 2012 strain caused 78% (18/23) of cases, consistent with its previously documented association with the winter 2012 global outbreak. Interestingly, phylogenetic analysis and recombination detection analysis of the consensus sequences identified two related viruses as recombinants, containing sequences in prior circulation to Sydney 2012 in open reading frame (ORF) 2.
Our approach facilitates the rapid determination of complete norovirus genomes. This method provides high resolution of full norovirus genomes which, when coupled with detailed epidemiology, may improve the understanding of evolution and control of this important healthcare-associated pathogen.
诺如病毒是各年龄段人群中最常见的流行性肠胃炎病原体。医院和社区经常爆发此类病毒,给英国造成的年损失估计达 1.1 亿英镑。目前,对于诺如病毒周期性爆发的进化解释仅限于明显重组病毒的识别。如果能对大量强化采样病毒进行全长基因组测序,我们对该病毒的了解将会显著增强,这也将有助于控制和疫苗设计。我们的目标是开发快速、高通量、端到端的方法,获得完整的诺如病毒基因组序列。我们将这些方法应用于最近英国的疫情爆发,将其置于 2012 年冬季国际诺如病毒流行的更广泛背景下。
通过 Illumina RNA 测序(RNA-Seq)对总粪便 RNA 对 28 个独特的临床样本进行诺如病毒测序,生成诺如病毒序列。尝试了一系列从头开始的序列组装程序。通过对三个重复样本和两个诺如病毒 qPCR 阴性样本的验证,以及对另外 20 个通过 PCR 和分柱测序确定的序列进行验证,来确定最佳的组装程序。使用系统发育方法从全基因组序列重建进化关系。
从 23/28 个样本中生成了全长诺如病毒基因组。28 个样本中有 5 个样本的部分诺如病毒基因组与低病毒拷贝数有关。从头组装的序列与毛细管测序确定的序列差异<0.003%。宿主内核苷酸序列多样性很少,但通过将短序列读段映射到其从头组装的共识上可以检测到。与悉尼 2012 年毒株相似的基因组导致 78%(18/23)的病例,这与该毒株与 2012 年冬季全球疫情的先前记录一致。有趣的是,对共识序列的系统发育分析和重组检测分析鉴定出两个相关病毒为重组病毒,其开放阅读框(ORF)2 中包含了之前在悉尼 2012 年循环的序列。
我们的方法促进了完整诺如病毒基因组的快速测定。该方法提供了完整诺如病毒基因组的高分辨率,如果结合详细的流行病学数据,可能会提高对这种重要的与医疗保健相关病原体的进化和控制的理解。