Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark.
Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark.
Clin Microbiol Infect. 2019 Oct;25(10):1277-1285. doi: 10.1016/j.cmi.2019.04.028. Epub 2019 May 4.
Sample preparation for high-throughput sequencing (HTS) includes treatment with various laboratory components, potentially carrying viral nucleic acids, the extent of which has not been thoroughly investigated. Our aim was to systematically examine a diverse repertoire of laboratory components used to prepare samples for HTS in order to identify contaminating viral sequences.
A total of 322 samples of mainly human origin were analysed using eight protocols, applying a wide variety of laboratory components. Several samples (60% of human specimens) were processed using different protocols. In total, 712 sequencing libraries were investigated for viral sequence contamination.
Among sequences showing similarity to viruses, 493 were significantly associated with the use of laboratory components. Each of these viral sequences had sporadic appearance, only being identified in a subset of the samples treated with the linked laboratory component, and some were not identified in the non-template control samples. Remarkably, more than 65% of all viral sequences identified were within viral clusters linked to the use of laboratory components.
We show that high prevalence of contaminating viral sequences can be expected in HTS-based virome data and provide an extensive list of novel contaminating viral sequences that can be used for evaluation of viral findings in future virome and metagenome studies. Moreover, we show that detection can be problematic due to stochastic appearance and limited non-template controls. Although the exact origin of these viral sequences requires further research, our results support laboratory-component-linked viral sequence contamination of both biological and synthetic origin.
高通量测序(HTS)的样品制备包括用各种实验室组件处理,这些组件可能带有病毒核酸,但尚未对其进行彻底研究。我们的目的是系统地检查用于 HTS 样品制备的各种实验室组件,以鉴定污染的病毒序列。
使用 8 种方案分析了总共 322 个主要来自人类的样本,应用了各种各样的实验室组件。一些样本(60%的人类标本)采用不同的方案进行处理。总共对 712 个测序文库进行了病毒序列污染的检测。
在显示与病毒相似的序列中,有 493 个与实验室组件的使用显著相关。这些病毒序列中的每一个都有零星的出现,仅在与相关实验室组件处理的部分样本中被识别,并且一些在非模板对照样本中未被识别。值得注意的是,鉴定出的所有病毒序列中,超过 65%的序列都属于与实验室组件使用相关的病毒簇。
我们表明,在基于 HTS 的病毒组数据中可以预期存在高污染的病毒序列,并提供了一份广泛的新的污染病毒序列列表,可用于未来病毒组和宏基因组研究中评估病毒发现。此外,我们表明,由于随机出现和有限的非模板对照,检测可能存在问题。尽管这些病毒序列的确切来源需要进一步研究,但我们的结果支持生物和合成来源的与实验室组件相关的病毒序列污染。