Keeble-Gagnère Gabriel, Pasam Raj, Forrest Kerrie L, Wong Debbie, Robinson Hannah, Godoy Jayfred, Rattey Allan, Moody David, Mullan Daniel, Walmsley Tresslyn, Daetwyler Hans D, Tibbits Josquin, Hayden Matthew J
Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.
InterGrain, Bibra Lake, WA, Australia.
Front Plant Sci. 2021 Dec 22;12:756877. doi: 10.3389/fpls.2021.756877. eCollection 2021.
Array-based single nucleotide polymorphism (SNP) genotyping platforms have low genotype error and missing data rates compared to genotyping-by-sequencing technologies. However, design decisions used to create array-based SNP genotyping assays for both research and breeding applications are critical to their success. We describe a novel approach applicable to any animal or plant species for the design of cost-effective imputation-enabled SNP genotyping arrays with broad utility and demonstrate its application through the development of the Illumina Infinium Wheat Barley 40K SNP array Version 1.0. We show that the approach delivers high quality and high resolution data for wheat and barley, including when samples are jointly hybridised. The new array aims to maximally capture haplotypic diversity in globally diverse wheat and barley germplasm while minimizing ascertainment bias. Comprising mostly biallelic markers that were designed to be species-specific and single-copy, the array permits highly accurate imputation in diverse germplasm to improve the statistical power of genome-wide association studies (GWAS) and genomic selection. The SNP content captures tetraploid wheat (A- and B-genome) and Coss. (D-genome) diversity and delineates synthetic and tetraploid wheat from other wheat, as well as tetraploid species and subgroups. The content includes SNP tagging key trait loci in wheat and barley, as well as direct connections to other genotyping platforms and legacy datasets. The utility of the array is enhanced through the web-based tool, (https://plantinformatics.io/) which enables the content of the array to be visualized and interrogated interactively in the context of numerous genetic and genomic resources to be connected more seamlessly to research and breeding. The array is available for use by the international wheat and barley community.
与基于测序的基因分型技术相比,基于芯片的单核苷酸多态性(SNP)基因分型平台具有较低的基因型错误率和数据缺失率。然而,用于为研究和育种应用创建基于芯片的SNP基因分型检测的设计决策对其成功至关重要。我们描述了一种适用于任何动植物物种的新颖方法,用于设计具有广泛实用性且经济高效的支持插补的SNP基因分型芯片,并通过开发Illumina Infinium小麦大麦40K SNP芯片版本1.0展示了其应用。我们表明,该方法可为小麦和大麦提供高质量和高分辨率的数据,包括样本联合杂交时的数据。新芯片旨在最大程度地捕获全球多样化小麦和大麦种质中的单倍型多样性,同时将确定偏差降至最低。该芯片主要由设计为物种特异性和单拷贝的双等位基因标记组成,可在不同种质中进行高度准确的插补,以提高全基因组关联研究(GWAS)和基因组选择的统计功效。SNP内容捕获了四倍体小麦(A和B基因组)和波斯小麦(D基因组)的多样性,并将人工合成小麦和四倍体小麦与其他小麦以及四倍体物种和亚组区分开来。其内容包括标记小麦和大麦关键性状位点的SNP,以及与其他基因分型平台和传统数据集的直接联系。通过基于网络的工具(https://plantinformatics.io/)增强了该芯片的实用性,该工具能够在众多遗传和基因组资源的背景下交互式地可视化和查询芯片内容,从而更无缝地连接到研究和育种。该芯片可供国际小麦和大麦研究群体使用。