Camiolo Salvatore, Hughes Joseph, Baldanti Fausto, Furione Milena, Lilleri Daniele, Lombardi Giuseppina, Angelini Micol, Gerna Giuseppe, Zavattoni Maurizio, Davison Andrew J, Suárez Nicolás M
School of Infection and Immunity, MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK.
Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, School of Infection and Immunity, University of Pavia, Pavia 27100, Italy.
Virus Evol. 2022 Dec 5;8(2):veac114. doi: 10.1093/ve/veac114. eCollection 2022.
Understanding the intrahost evolution of viral populations has implications in pathogenesis, diagnosis, and treatment and has recently made impressive advances from developments in high-throughput sequencing. However, the underlying analyses are very sensitive to sources of bias, error, and artefact in the data, and it is important that these are addressed adequately if robust conclusions are to be drawn. The key factors include (1) determining the number of viral strains present in the sample analysed; (2) monitoring the extent to which the data represent these strains and assessing the quality of these data; (3) dealing with the effects of cross-contamination; and (4) ensuring that the results are reproducible. We investigated these factors by generating sequence datasets, including biological and technical replicates, directly from clinical samples obtained from a small cohort of patients who had been infected congenitally with the herpesvirus human cytomegalovirus, with the aim of developing a strategy for identifying high-confidence intrahost variants. We found that such variants were few in number and typically present in low proportions and concluded that human cytomegalovirus exhibits a very low level of intrahost variability. In addition to clarifying the situation regarding human cytomegalovirus, our strategy has wider applicability to understanding the intrahost variability of other viruses.
了解病毒群体在宿主体内的进化对发病机制、诊断和治疗具有重要意义,并且最近通过高通量测序技术的发展取得了令人瞩目的进展。然而,基础分析对数据中的偏差、误差和假象来源非常敏感,如果要得出可靠的结论,充分解决这些问题很重要。关键因素包括:(1)确定所分析样本中存在的病毒株数量;(2)监测数据代表这些病毒株的程度并评估这些数据的质量;(3)处理交叉污染的影响;(4)确保结果具有可重复性。我们通过直接从一小群先天性感染人巨细胞病毒这一疱疹病毒的患者所获得的临床样本中生成序列数据集(包括生物学和技术重复样本)来研究这些因素,目的是制定一种识别高可信度宿主体内变异体的策略。我们发现此类变异体数量很少且通常所占比例很低,并得出结论:人巨细胞病毒在宿主体内的变异性非常低。除了阐明人巨细胞病毒的情况外,我们的策略在理解其他病毒的宿主体内变异性方面具有更广泛的适用性。