Department of Chemical and Environmental Engineering, Yale University, New Haven, CT 06520, USA.
Lett Appl Microbiol. 2011 Apr;52(4):386-92. doi: 10.1111/j.1472-765X.2011.03014.x. Epub 2011 Feb 21.
The aim of this study was to develop and demonstrate an approach for describing the diversity of human pathogenic viruses in an environmentally isolated viral metagenome.
In silico bioinformatic experiments were used to select an optimum annotation strategy for discovering human viruses in virome data sets and applied to annotate a class B biosolid virome. Results from the in silico study indicated that <1% errors in virus identification could be achieved when nucleotide-based search programs (BLASTn or tBLASTx), viral genome only databases and sequence reads >200 nt were considered. Within the 51,925 annotated sequences, 94 DNA and 19 RNA sequences were identified as human viruses. Virus diversity included environmentally transmitted agents such as parechovirus, coronavirus, adenovirus and aichi virus, as well as viruses associated with chronic human infections such as human herpes and hepatitis C viruses.
This study provided a bioinformatic approach for identifying pathogens in a virome data set and demonstrated the human virus diversity in a relevant environmental sample.
As the costs of next-generation sequencing decrease, the pathogen diversity described by virus metagenomes will provide an unbiased guide for subsequent cell culture and quantitative pathogen analyses and ensures that highly enriched and relevant pathogens are not neglected in exposure and risk assessments.
本研究旨在开发并展示一种方法,用于描述环境分离病毒宏基因组中人类致病病毒的多样性。
本研究采用计算机生物信息学实验,选择了一种最佳的注释策略,用于在病毒组数据集发现人类病毒,并将其应用于注释 B 级生物固体病毒组。计算机研究的结果表明,当考虑基于核苷酸的搜索程序(BLASTn 或 tBLASTx)、仅病毒基因组数据库和长度>200nt 的序列读段时,病毒鉴定的错误率可<1%。在注释的 51925 个序列中,有 94 个 DNA 和 19 个 RNA 序列被鉴定为人类病毒。病毒多样性包括环境传播的病原体,如细小病毒、冠状病毒、腺病毒和甲肝病毒,以及与慢性人类感染相关的病毒,如人类疱疹病毒和丙型肝炎病毒。
本研究提供了一种在病毒组数据集识别病原体的生物信息学方法,并展示了相关环境样本中的人类病毒多样性。
随着下一代测序成本的降低,病毒宏基因组所描述的病原体多样性将为后续的细胞培养和定量病原体分析提供一个无偏倚的指导,并确保在暴露和风险评估中不会忽略高度富集和相关的病原体。