Bellecave Pantxika, Recordon-Pinson Patricia, Fleury Hervé
1 CNRS-UMR 5234, Microbiologie Fondamentale et Pathogénicité, Université Bordeaux Segalen , Bordeaux, France .
2 Centre Hospitalier Universitaire de Bordeaux (CHU) , Laboratoire de Virologie, Bordeaux, France .
AIDS Res Hum Retroviruses. 2016 Jan;32(1):85-92. doi: 10.1089/AID.2015.0201. Epub 2015 Dec 14.
A major obstacle to using next generation sequencing (NGS) technology in clinical routine practice is reliable data analysis. Thousands of sequences need to be aligned and validated, to exclude sequencing artifacts and generate accurate results. We compared two analysis pipelines for Roche 454 ultradeep pyrosequencing (UDPS) raw data generated from HIV-1 clinical samples: a commercial and fully automated Web-based software NGS HIV-1 Module (SmartGene, Zug, Switzerland) vs. the Amplicon Variant Analyzer software (AVA, 454 Life Sciences; Roche). Results were also compared to those obtained with Sanger sequencing. HIV-1 reverse transcriptase and protease genes from 34 plasma samples were submitted to Sanger sequencing and GS Junior UDPS. Raw UDPS data (sff files) from all samples were analyzed with AVA 2.7 software plus manual review of the alignments and the fully automated SmartGene NGS HIV-1 Module prototype (SMG). Results obtained with both analysis pipelines showed good correlation (85.0%). Divergent results were mainly observed at homopolymer positions, such as K101, where the frame-aware alignment and error corrections of the automated approach were more efficient and more accurate, both in terms of detecting and quantifying drug resistance mutations. Our study shows that NGS data can easily be analyzed via a fully automated analysis pipeline, here the SmartGene NGS HIV-1 Module, thus minimizing the need for manual review of alignments by the user, otherwise essential to ensure accurate results. Such automated analysis pipelines may facilitate the adoption of NGS platforms in the routine clinical laboratory.
在临床常规实践中使用下一代测序(NGS)技术的一个主要障碍是可靠的数据分析。需要对数千条序列进行比对和验证,以排除测序假象并生成准确的结果。我们比较了两种针对从HIV-1临床样本生成的罗氏454超深度焦磷酸测序(UDPS)原始数据的分析流程:一种是商业化的基于网络的全自动软件NGS HIV-1模块(SmartGene,瑞士楚格),另一种是扩增子变异分析软件(AVA,454生命科学公司;罗氏)。结果还与通过桑格测序获得的结果进行了比较。将34份血浆样本中的HIV-1逆转录酶和蛋白酶基因进行桑格测序和GS Junior UDPS。所有样本的UDPS原始数据(sff文件)用AVA 2.7软件进行分析,并对比对结果进行人工审核,以及使用全自动的SmartGene NGS HIV-1模块原型(SMG)进行分析。两种分析流程获得的结果显示出良好的相关性(85.0%)。不同的结果主要出现在同聚物位点,如K101,在检测和定量耐药性突变方面,自动化方法的框架感知比对和纠错在效率和准确性上都更高。我们的研究表明,通过全自动分析流程(此处为SmartGene NGS HIV-1模块)可以轻松分析NGS数据,从而最大限度地减少用户对比对结果进行人工审核的需求,而人工审核原本对于确保准确结果是必不可少的。这种自动化分析流程可能会促进NGS平台在临床常规实验室中的应用。