Department of Horticulture, Cornell University, New York State Agricultural Experiment Station, Geneva, New York 14456, USA.
BMC Genomics. 2013 Jan 16;14:2. doi: 10.1186/1471-2164-14-2.
Rapid development of highly saturated genetic maps aids molecular breeding, which can accelerate gain per breeding cycle in woody perennial plants such as Rubus idaeus (red raspberry). Recently, robust genotyping methods based on high-throughput sequencing were developed, which provide high marker density, but result in some genotype errors and a large number of missing genotype values. Imputation can reduce the number of missing values and can correct genotyping errors, but current methods of imputation require a reference genome and thus are not an option for most species.
Genotyping by Sequencing (GBS) was used to produce highly saturated maps for a R. idaeus pseudo-testcross progeny. While low coverage and high variance in sequencing resulted in a large number of missing values for some individuals, a novel method of imputation based on maximum likelihood marker ordering from initial marker segregation overcame the challenge of missing values, and made map construction computationally tractable. The two resulting parental maps contained 4521 and 2391 molecular markers spanning 462.7 and 376.6 cM respectively over seven linkage groups. Detection of precise genomic regions with segregation distortion was possible because of map saturation. Microsatellites (SSRs) linked these results to published maps for cross-validation and map comparison.
GBS together with genome-independent imputation provides a rapid method for genetic map construction in any pseudo-testcross progeny. Our method of imputation estimates the correct genotype call of missing values and corrects genotyping errors that lead to inflated map size and reduced precision in marker placement. Comparison of SSRs to published R. idaeus maps showed that the linkage maps constructed with GBS and our method of imputation were robust, and marker positioning reliable. The high marker density allowed identification of genomic regions with segregation distortion in R. idaeus, which may help to identify deleterious alleles that are the basis of inbreeding depression in the species.
高度饱和的遗传图谱的快速发展有助于分子育种,这可以加速木本多年生植物如悬钩子属(覆盆子)的每一个育种周期的增益。最近,基于高通量测序的强大基因分型方法已经开发出来,这些方法提供了高标记密度,但会导致一些基因型错误和大量缺失的基因型值。插补可以减少缺失值的数量,并可以纠正基因分型错误,但目前的插补方法需要参考基因组,因此对于大多数物种来说不是一个选择。
测序基因分型(GBS)用于产生悬钩子属假测交后代的高度饱和图谱。虽然低覆盖度和测序的高方差导致一些个体的大量缺失值,但一种基于初始标记分离的最大似然标记排序的新的插补方法克服了缺失值的挑战,并使图谱构建在计算上可行。两个亲本图谱分别包含 4521 个和 2391 个分子标记,分别跨越 7 个连锁群,长度为 462.7 和 376.6cM。由于图谱饱和,可以检测到具有分离失真的精确基因组区域。微卫星(SSR)将这些结果与发表的图谱联系起来进行交叉验证和图谱比较。
GBS 与独立于基因组的插补相结合,为任何假测交后代的遗传图谱构建提供了一种快速方法。我们的缺失值插补方法估计了正确的基因型调用,并纠正了导致图谱大小膨胀和标记位置精度降低的基因分型错误。SSR 与发表的悬钩子属图谱的比较表明,用 GBS 和我们的插补方法构建的连锁图谱是稳健的,标记定位是可靠的。高标记密度允许识别悬钩子属中分离失真的基因组区域,这有助于识别可能导致该物种自交衰退的有害等位基因。