Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA.
Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
Cells. 2019 Aug 5;8(8):835. doi: 10.3390/cells8080835.
Single nucleotide polymorphisms (SNPs) are usually the most frequent genomic variants. Directly pedigree-phased multi-SNP haplotypes provide a more accurate view of polymorphic population genomic structure than individual SNPs. The former are, therefore, more useful in genetic correlation with subject phenotype. We describe a new pedigree-based methodology for generating non-ambiguous SNP haplotypes for genetic study. SNP data for haplotype analysis were extracted from a larger Type 1 Diabetes Genetics Consortium SNP dataset based on minor allele frequency variation and redundancy, coverage rate (the frequency of phased haplotypes in which each SNP is defined) and genomic location. Redundant SNPs were eliminated, overall haplotype polymorphism was optimized and the number of undefined haplotypes was minimized. These edited SNP haplotypes from a region containing (DR) and (DQ) both correlated well with HLA-typed DR,DQ haplotypes and differentiated HLA-DR,DQ fragments shared by three pairs of previously identified megabase-length conserved extended haplotypes. In a pedigree-based genetic association assay for type 1 diabetes, edited SNP haplotypes and HLA-typed HLA-DR,DQ haplotypes from the same families generated essentially identical qualitative and quantitative results. Therefore, this edited SNP haplotype method is useful for both genomic polymorphic architecture and genetic association evaluation using SNP markers with diverse minor allele frequencies.
单核苷酸多态性(SNPs)通常是最常见的基因组变异。直接家系相分的多 SNP 单倍型比单个 SNP 更能准确地反映多态性群体基因组结构。因此,前者在与受试者表型的遗传相关性方面更有用。我们描述了一种新的基于家系的方法,用于生成用于遗传研究的非歧义 SNP 单倍型。根据次要等位基因频率变化和冗余、覆盖率(每个 SNP 定义的相单倍型的频率)和基因组位置,从更大的 1 型糖尿病遗传学联合会(Type 1 Diabetes Genetics Consortium,T1DGC)SNP 数据集提取 SNP 数据用于单倍型分析。消除了冗余的 SNP,优化了整体单倍型多态性,并最小化了未定义的单倍型数量。来自包含 (DR) 和 (DQ) 的区域的这些编辑后的 SNP 单倍型与 HLA 分型的 DR、DQ 单倍型高度相关,并区分了由三个先前鉴定的兆碱基长度的保守扩展单倍型共享的 HLA-DR、DQ 片段。在基于家系的 1 型糖尿病遗传关联分析中,来自同一家庭的编辑后的 SNP 单倍型和 HLA 分型的 HLA-DR、DQ 单倍型产生了基本相同的定性和定量结果。因此,这种编辑后的 SNP 单倍型方法对于使用具有不同次要等位基因频率的 SNP 标记评估基因组多态性结构和遗传关联都非常有用。