Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Institute of Forensic Medicine, West China School of Basic Science & Forensic Medicine, Sichuan University, Chengdu, China.
Forensic Sci Int Genet. 2020 Sep;48:102331. doi: 10.1016/j.fsigen.2020.102331. Epub 2020 Jun 20.
High throughput sequencing of multiplexed PCR amplicons with Short Tandem Repeats (STRs) requires software solutions that sort the information and allow a comprehensive overview of the results without overwhelming the data analyst with details. Here, we present an updated version (2.0) of the STR analysis tool STRinNGS. It is freely available as a Docker image or zip file ready for downloading. STRinNGS predicts genotypes using criteria for read depth, noise, flanking region lengths, mismatches in the flanking regions, locus balance, and heterozygote balance. Warning flags highlight suspicious genotypes as well as suspicious sequences that are not identified as either noise or alleles in the result table used for the manual analysis. STRinNGS analyses both the STR and the flanking regions, and names the alleles according to the STRidER guidelines as well as an in-house nomenclature that also include variants in the flanking regions. Furthermore, STRinNGS generates files with analysed data in a format that may be uploaded directly to the STRidER database. We re-analysed 627 sample files from eight different MiSeq FGx runs with STRinNGS v2.0. The samples were previously typed with the ForenSeq™ Signature Prep Kit and analysed with STRinNGS v1.0 and the Universal Analysis Software. Apart from three poorly performing loci with large heterozygote imbalances (Penta E and D22S1045) or frequent single nucleotide errors (DYS461), only 58 genotype calls (0.2 %) had to be manually corrected and only 14 genotype calls were discordant with the previous analyses. The discordant calls were primarily caused by manual oversights and in every case, the STRinNGS v2.0 analysis was correct.
高通量测序多重 PCR 扩增子中的短串联重复序列 (STRs) 需要软件解决方案来对信息进行分类,并允许全面概述结果,而不会让数据分析师被细节淹没。在这里,我们介绍 STR 分析工具 STRinNGS 的更新版本(2.0)。它可以作为 Docker 镜像或 zip 文件免费获得,随时可供下载。STRinNGS 使用读取深度、噪声、侧翼区域长度、侧翼区域中的错配、基因座平衡和杂合子平衡的标准来预测基因型。警告标志突出显示可疑基因型以及可疑序列,这些序列在用于手动分析的结果表中未被识别为噪声或等位基因。STRinNGS 分析 STR 和侧翼区域,并根据 STRidER 指南以及内部命名法命名等位基因,该命名法还包括侧翼区域中的变体。此外,STRinNGS 以可直接上传到 STRidER 数据库的格式生成带有分析数据的文件。我们使用 STRinNGS v2.0 重新分析了来自 8 个不同 MiSeq FGx 运行的 627 个样本文件。这些样本之前使用 ForenSeq™ Signature Prep Kit 进行了分型,并使用 STRinNGS v1.0 和通用分析软件进行了分析。除了 3 个表现不佳的基因座(Penta E 和 D22S1045)存在较大的杂合子不平衡或频繁的单核苷酸错误(DYS461)外,只有 58 个基因型(0.2%)需要手动更正,只有 14 个基因型与之前的分析不一致。不一致的调用主要是由于人为疏忽,在每种情况下,STRinNGS v2.0 分析都是正确的。