Brown Pat J
Department of Plant Sciences, University of California, Davis, California, USA.
Plant Genome. 2023 Jun;16(2):e20324. doi: 10.1002/tpg2.20324. Epub 2023 Apr 14.
Sequencing-based genotyping of heterozygous diploids requires sufficient depth to accurately call heterozygous genotypes. In interspecific hybrids, alignment of reads to both parental genomes simultaneously can generate haploid data, potentially eliminating the problem of heterozygosity. Two populations of interspecific hybrid rootstocks of walnut (Juglans) and pistachio (Pistacia) were genotyped using alignment to the maternal genome, paternal genome, and dual alignment to both genomes simultaneously. Downsampling was used to examine concordance of imputed genotype calls as a function of sequencing depth. Dual alignment resulted in datasets essentially free of heterozygous genotypes, simplifying the identification and removal of cross-contaminated samples. Concordance between full and downsampled genotype calls was always highest after dual alignment. Nearly all single nucleotide polymorphisms (SNPs) in dual alignment datasets were shared with the corresponding single-parent datasets, but 60%-90% of single-parent SNPs were private to that dataset. Private SNPs in single-parent datasets had higher rates of heterozygosity, lower levels of concordance, and were enriched in fixed differences between parental genomes ("homeo-SNPs") compared to shared SNPs in the same dataset. In multi-parental walnut hybrids, the paternal-aligned dataset was ineffective at resolving population structure in the maternal parent. Overall, the dual alignment strategy effectively produced phased, haploid data, increasing data quality and reducing cost.
基于测序的杂合二倍体基因分型需要足够的深度来准确地调用杂合基因型。在种间杂种中,同时将读取序列与双亲本基因组进行比对可以生成单倍体数据,从而可能消除杂合性问题。使用基于母本、父本和双亲本同时比对的方法对核桃(Juglans)和开心果(Pistacia)种间杂种砧木的两个群体进行了基因分型。通过对测序深度的功能进行下采样,研究了推测基因型调用的一致性。双亲本同时比对导致数据集基本上没有杂合基因型,简化了交叉污染样本的识别和去除。全基因组和下采样基因型调用的一致性在双亲本同时比对后总是最高。几乎所有在双亲本同时比对数据集中的单核苷酸多态性(SNP)都与相应的单亲本数据集共享,但 60%-90%的单亲本 SNP 是该数据集独有的。单亲本数据集的私有 SNP 具有更高的杂合率、更低的一致性,并且在亲本基因组之间的固定差异(“同型 SNP”)中富集,而不是在同一数据集的共享 SNP 中。在多亲本核桃杂种中,父本比对数据集在解析母本的群体结构方面效果不佳。总的来说,双亲本同时比对策略有效地产生了相联的、单倍体数据,提高了数据质量并降低了成本。