Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway.
Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
Forensic Sci Int Genet. 2024 Jul;71:103057. doi: 10.1016/j.fsigen.2024.103057. Epub 2024 May 3.
In recent years, probabilistic genotyping software has been adapted for the analysis of massively parallel sequencing (MPS) forensic data. Likelihood ratios (LR) are based on allele frequencies selected from populations of interest. This study provides an outline of sequence-based (SB) allele frequencies for autosomal short tandem repeats (aSTRs) and identity single nucleotide polymorphisms (iSNPs) in 371 individuals from Southern Norway. 27 aSTRs and 94 iSNPs were previously analysed with the ForenSeq™ DNA Signature Prep Kit (Verogen). The number of alleles with frequencies less than 0.05 for sequenced-based alleles was 4.6 times higher than for length-based alleles. Consistent with previous studies, it was observed that sequence-based data (both with and without flanks) exhibited higher allele diversity compared to length-based (LB) data; random match probabilities were lower for SB alleles confirming their advantage to discriminate between individuals. Two alleles in markers D22S1045 and Penta D were observed with SNPs in the 3´ flanking region, which have not been reported before. Also, a novel SNP with a minor allele frequency (MAF) of 0.001, was found in marker TH01. The impact of the sample size on minor allele frequency (MAF) values was studied in 88 iSNPs from Southern Norway (n = 371). The findings were then compared to a larger Norwegian population dataset (n = 15,769). The results showed that the smaller Southern Norway dataset provided similar results, and it was a representative sample. Population structure was analyzed for regions within Southern Norway; F estimates for aSTR and iSNPs did not indicate any genetic structure. Finally, we investigated the genetic differences between Southern Norway and two other populations: Northern Norway and Denmark. Allele frequencies between these populations were compared, and we found no significant frequency differences (p-values > 0.0001). We also calculated the pairwise F values per marker and comparisons between Southern and Northern Norway showed small differences. In contrast, the comparisons between Southern Norway and Denmark showed higher F values for some markers, possibly driven by distinct alleles that were present in only one of the populations. In summary, we propose that allele frequencies from each population considered in this study could be used interchangeably to calculate genotype probabilities.
近年来,概率基因分型软件已被用于分析大规模平行测序(MPS)法医数据。似然比(LR)基于从感兴趣的人群中选择的等位基因频率。本研究提供了来自挪威南部 371 个人的基于序列(SB)的常染色体短串联重复(aSTR)和身份单核苷酸多态性(iSNP)等位基因频率的概述。使用 ForenSeq™ DNA Signature Prep Kit(Verogen)对 27 个 aSTR 和 94 个 iSNP 进行了分析。基于序列的等位基因中频率小于 0.05 的等位基因数量是基于长度的等位基因的 4.6 倍。与之前的研究一致,观察到与基于长度(LB)数据相比,基于序列的数据(带侧翼和不带侧翼)表现出更高的等位基因多样性;随机匹配概率对于 SB 等位基因较低,证实了它们在区分个体方面的优势。在标记 D22S1045 和 Penta D 中观察到两个等位基因,它们在 3'侧翼区域存在 SNP,这是以前没有报道过的。此外,在标记 TH01 中发现了一个新的 SNP,其次要等位基因频率(MAF)为 0.001。研究了 88 个来自挪威南部(n = 371)的 iSNP 中样本量对次要等位基因频率(MAF)值的影响。然后将结果与更大的挪威人群数据集(n = 15769)进行比较。结果表明,较小的挪威南部数据集提供了相似的结果,并且是一个有代表性的样本。分析了挪威南部地区的种群结构;aSTR 和 iSNP 的 F 估计值表明没有遗传结构。最后,我们研究了挪威南部与另外两个种群(挪威北部和丹麦)之间的遗传差异。比较了这些种群之间的等位基因频率,发现没有显著的频率差异(p 值>0.0001)。我们还计算了每个标记的成对 F 值,挪威南部和北部之间的比较显示出较小的差异。相比之下,挪威南部和丹麦之间的比较显示出一些标记的 F 值较高,这可能是由仅存在于一个种群中的独特等位基因驱动的。总之,我们建议可以相互替换本研究中考虑的每个种群的等位基因频率来计算基因型概率。