Alper Chester A, Larsen Charles E
Program in Cellular and Molecular Medicine, Boston Children's Hospital, CLS_03, 3 Blackfan Circle, Boston, MA, 02115, USA.
Department of Pediatrics, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA.
Methods Mol Biol. 2017;1551:113-127. doi: 10.1007/978-1-4939-6750-6_6.
A haplotype is a string of nucleotides or alleles at nearby loci on one chromosome, usually inherited as a unit. Within the major histocompatibility complex (MHC) region on human chromosome 6p, independent population studies of multiple families have identified conserved extended haplotypes (CEHs) that segregate as long stretches (≥1 megabase) of essentially identical DNA sequence at relatively high (≥0.5 %) population frequency ("genetic fixity"). CEHs were first identified through segregation analysis in the early 1980s. In European Caucasian populations, the most frequent 30 CEHs account for at least one-third of all MHC haplotypes. These CEHs provide all of the known individual MHC susceptibility and protective genetic markers within those populations for several complex genetic diseases. Haplotypes are rigorously determined directly by sequencing single chromosomes or by Mendelian segregation analysis using families with informative genotypes. Four parental haplotypes are assigned unambiguously using genotypes from the two parents and from two of their haploidentical (to each other) children. However, the most common current technique to phase haplotypes is probabilistic statistical imputation, using unrelated subjects. Such probabilistic techniques have failed to detect CEHs and are thus of questionable value in identifying long-range haplotype structure and, consequently, genetic structure-function relationships. Finally, with haplotypes rigorously defined, association studies can determine frequencies of alleles among unrelated patient haplotypes vs. those among only unaffected family members (i.e., control alleles/haplotypes). Such studies reduce, as much as possible, the confounding effects of population stratification common to all genetic studies.
单倍型是指一条染色体上相邻基因座处的一串核苷酸或等位基因,通常作为一个单位遗传。在人类6号染色体短臂上的主要组织相容性复合体(MHC)区域内,对多个家庭进行的独立群体研究已经确定了保守的扩展单倍型(CEH),这些单倍型以相对较高(≥0.5%)的群体频率(“遗传固定性”)作为基本相同的DNA序列的长片段(≥1兆碱基)进行分离。CEH最早是在20世纪80年代初通过分离分析确定的。在欧洲白种人群体中,最常见的30种CEH占所有MHC单倍型的至少三分之一。这些CEH为这些群体中几种复杂遗传疾病提供了所有已知的个体MHC易感性和保护性遗传标记。单倍型可以通过对单条染色体进行测序或使用具有信息丰富基因型的家庭进行孟德尔分离分析来严格确定。使用来自父母双方及其两个单倍型相同(彼此之间)的孩子的基因型,可以明确地确定四种亲本单倍型。然而,目前最常用的单倍型分型技术是概率统计推断,使用不相关的受试者。这种概率技术未能检测到CEH,因此在识别长程单倍型结构以及由此确定遗传结构-功能关系方面的价值值得怀疑。最后,在严格定义单倍型后,关联研究可以确定不相关患者单倍型中等位基因的频率与仅未受影响家庭成员(即对照等位基因/单倍型)中等位基因的频率。此类研究尽可能减少了所有遗传研究中常见的群体分层的混杂效应。