Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia.
Department of Medical Biology, The University of Melbourne, Melbourne, Australia.
PLoS Genet. 2022 Jan 10;18(1):e1009604. doi: 10.1371/journal.pgen.1009604. eCollection 2022 Jan.
Short tandem repeats (STRs) are highly informative genetic markers that have been used extensively in population genetics analysis. They are an important source of genetic diversity and can also have functional impact. Despite the availability of bioinformatic methods that permit large-scale genome-wide genotyping of STRs from whole genome sequencing data, they have not previously been applied to sequencing data from large collections of malaria parasite field samples. Here, we have genotyped STRs using HipSTR in more than 3,000 Plasmodium falciparum and 174 Plasmodium vivax published whole-genome sequence data from samples collected across the globe. High levels of noise and variability in the resultant callset necessitated the development of a novel method for quality control of STR genotype calls. A set of high-quality STR loci (6,768 from P. falciparum and 3,496 from P. vivax) were used to study Plasmodium genetic diversity, population structures and genomic signatures of selection and these were compared to genome-wide single nucleotide polymorphism (SNP) genotyping data. In addition, the genome-wide information about genetic variation and other characteristics of STRs in P. falciparum and P. vivax have been available in an interactive web-based R Shiny application PlasmoSTR (https://github.com/bahlolab/PlasmoSTR).
短串联重复序列(STRs)是高度信息丰富的遗传标记,已广泛应用于群体遗传学分析。它们是遗传多样性的重要来源,也可能具有功能影响。尽管有生物信息学方法可以从全基因组测序数据中对 STR 进行大规模全基因组基因分型,但这些方法以前从未应用于来自大量疟疾寄生虫现场样本的测序数据。在这里,我们使用 HipSTR 在来自全球各地采集的超过 3000 个恶性疟原虫和 174 个间日疟原虫的已发表全基因组序列数据中对 STR 进行了基因分型。由于在所得的基因型调用集中存在高水平的噪声和可变性,因此需要开发一种新的 STR 基因型调用质量控制方法。一组高质量的 STR 基因座(恶性疟原虫 6768 个,间日疟原虫 3496 个)用于研究疟原虫遗传多样性、种群结构以及选择的基因组特征,并与全基因组单核苷酸多态性(SNP)基因分型数据进行了比较。此外,恶性疟原虫和间日疟原虫中 STR 的遗传变异和其他特征的全基因组信息已在交互式基于网络的 R Shiny 应用程序 PlasmoSTR(https://github.com/bahlolab/PlasmoSTR)中提供。