Department of Laboratory Medicine, Karolinska Institutet, 141 86, Stockholm, Sweden.
Hopsworks AB, Medborgarplatsen 25, 118 72, Stockholm, Sweden.
Sci Rep. 2022 Jul 29;12(1):13058. doi: 10.1038/s41598-022-17318-5.
In the era of cervical cancer elimination, accurate and validated pipelines to detect human papillomavirus are essential to elucidate and understand HPV association with human cancers. We aimed to provide an open-source pipeline, "HPV-meta", to detect HPV transcripts in RNA sequencing data, including several steps to warn operators for possible viral contamination. The "HPV-meta" pipeline automatically performs several steps, starting with quality trimming, human genome filtering, HPV detection (blastx), cut-off settlement (10 reads and 690 bp coverage to make an HPV call) and finishing with fasta sequence generation for HPV positive samples. Fasta sequences can then be aligned to assess sequence diversity among HPV positive samples. All RNA sequencing files (n = 10,908) present in the cancer genome atlas (TCGA) were analyzed. "HPV-meta" identified 25 different HPV types being present in 488/10,904 specimens. Validation of results showed 99.98% agreement (10,902/10,904). Multiple alignment from fasta files warned about high sequence identity between several HPV 18 and 38 positive samples, whose contamination had previously been reported. The "HPV-meta" pipeline is a robust and validated pipeline that detects HPV in RNA sequencing data. Obtaining the fasta files enables contamination investigation, a non very rare occurrence in next generation sequencing.
在宫颈癌消除时代,准确和经过验证的 HPV 检测管道对于阐明和理解 HPV 与人类癌症的关联至关重要。我们旨在提供一个开源管道“HPV-meta”,用于检测 RNA 测序数据中的 HPV 转录本,其中包括几个步骤,以警告操作人员可能存在病毒污染。“HPV-meta”管道自动执行多个步骤,从质量修剪、人类基因组过滤、HPV 检测(blastx)、截止值设置(10 个读数和 690 bp 覆盖度以进行 HPV 检测)开始,并为 HPV 阳性样本生成 fasta 序列结束。然后可以对 fasta 序列进行对齐,以评估 HPV 阳性样本之间的序列多样性。分析了癌症基因组图谱 (TCGA) 中存在的所有 RNA 测序文件(n=10908)。“HPV-meta”鉴定了 25 种不同的 HPV 类型,存在于 488/10904 个标本中。结果验证显示,99.98%的结果一致(10902/10904)。来自 fasta 文件的多重比对警告了多个 HPV 18 和 38 阳性样本之间存在高序列同一性,这些样本的污染先前已有报道。“HPV-meta”管道是一种强大且经过验证的管道,可检测 RNA 测序数据中的 HPV。获得 fasta 文件可以进行污染调查,这在下一代测序中并非非常罕见。