DNA Fingerprinting Unit, Integrated High-Tech Complex, Forensic Science Laboratory, Bhopal, Madhya Pradesh, 462003, India.
DNA Division, State Forensic Science Laboratory, Jaipur, Rajasthan, 302016, India.
Sci Rep. 2021 Dec 1;11(1):23238. doi: 10.1038/s41598-021-02690-5.
Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data of 138 central Indian individuals at 31 autosomal STR loci by NGS. The sequence data at each allele was compared to the reference hg19 sequence. The length-based allelic results were found in concordance with the CE-based results. 20 out of 31 autosomal STR loci showed an increase in the number of alleles by the presence of sequence variation and/or SNPs in the flanking regions. The highest gain in the heterozygosity and allele numbers was observed in D5S2800, D1S1656, D16S539, D5S818, and vWA. rs25768 (A/G) at D5S818 was found to be the most frequent SNP in the studied population. Allele no. 15 of D3S1358, allele no. 19 of D2S1338, and allele no. 22 of D12S391 showed 5 isoalleles each with the same size and with different intervening sequences. Length-based determination of the alleles showed Penta E to be the most useful marker in the central Indian population among 31 STRs studied; however, sequence-based analysis advocated D2S1338 to be the most useful marker in terms of various forensic parameters. Population genetics analysis showed a shared genetic ancestry of the studied population with other Indian populations. This first-ever study to the best of our knowledge on sequence-based STR analysis in the central Indian population is expected to prove the use of NGS in forensic case-work and in forensic DNA laboratories.
基于毛细管电泳的分析无法反映 STR 基因座的确切等位基因数量变化,因为重复区域的序列变异和侧翼区域的 SNP 数据不可用。在此,本研究通过 NGS 报告了 138 名印度中部个体在 31 个常染色体 STR 基因座上的基于长度和基于序列的等位基因数据。每个等位基因的序列数据与参考 hg19 序列进行了比较。基于长度的等位基因结果与基于 CE 的结果一致。由于侧翼区域的序列变异和/或 SNP 的存在,31 个常染色体 STR 基因座中有 20 个显示出等位基因数量的增加。在 D5S2800、D1S1656、D16S539、D5S818 和 vWA 中观察到杂合度和等位基因数量的最大增益。在研究人群中发现 D5S818 中的 rs25768(A/G) 是最常见的 SNP。D3S1358 的等位基因 15、D2S1338 的等位基因 19 和 D12S391 的等位基因 22 各自具有相同大小但具有不同插入序列的 5 个同工等位基因。基于长度的等位基因确定显示,在研究的 31 个 STR 中,Penta E 是印度中部人群中最有用的标记;然而,基于序列的分析主张 D2S1338 是最有用的标记,因为它在各种法医参数方面。群体遗传学分析显示,研究人群与其他印度人群具有共同的遗传渊源。据我们所知,这是首次对印度中部人群进行基于序列的 STR 分析的研究,有望证明 NGS 在法医案件工作和法医 DNA 实验室中的应用。