Patel Anamika, Zhang Xing, Blumenthal Robert M, Cheng Xiaodong
From the Department of Biochemistry, Emory University School of Medicine, Atlanta, Georgia 30322.
the Department of Molecular and Cellular Oncology, University of Texas M.D. Anderson Cancer Center, Houston, Texas 77030, and.
J Biol Chem. 2017 Sep 29;292(39):15994-16002. doi: 10.1074/jbc.M117.805754. Epub 2017 Aug 11.
is the only mammalian gene that has been associated with speciation. The PR/SET domain 9 (PRDM9) protein is a major determinant of meiotic recombination hot spots and acts through sequence-specific DNA binding via its C2H2 zinc finger (ZF) tandem array, which is highly polymorphic within and between species. The most common human variant, PRDM9 allele A (PRDM9a), contains 13 fingers (ZF1-13). Allele C (PRDM9c) is the second-most common among African populations and differs from PRDM9a by an arginine-to-serine change (R764S) in ZF9 and by replacement of ZF11 with two other fingers, yielding 14 fingers in PRDM9c. Here we co-crystallized the six-finger fragment ZF8-13 of PRDM9c, in complex with an oligonucleotide representing a known PRDM9c-specific hot spot sequence, and compared the structure with that of a characterized PRDM9a-specific complex. There are three major differences. First, Ser in ZF9 allows PRDM9c to accommodate a variable base, whereas PRDM9a Arg recognizes a conserved guanine. Second, the two-finger expansion of ZF11 allows PRDM9c to recognize three-base-pair-longer sequences. A tryptophan in the additional ZF interacts with a conserved thymine methyl group. Third, an Arg-Asp dipeptide immediately preceding the ZF helix, conserved in two PRDM9a fingers and three PRDM9c fingers, permits adaptability to variations from a C:G base pair (G-Arg interaction) to a G:C base pair (C-Asp interaction). This Arg-Asp conformational switch allows identical ZF modules to recognize different sequences. Our findings illuminate the molecular mechanisms for flexible and conserved binding of human PRDM9 alleles to their cognate DNA sequences.
是唯一与物种形成相关的哺乳动物基因。PR/SET结构域9(PRDM9)蛋白是减数分裂重组热点的主要决定因素,通过其C2H2锌指(ZF)串联阵列进行序列特异性DNA结合,该阵列在物种内部和物种之间具有高度多态性。最常见的人类变体PRDM9等位基因A(PRDM9a)包含13个锌指(ZF1 - 13)。等位基因C(PRDM9c)在非洲人群中是第二常见的,与PRDM9a的不同之处在于ZF9中精氨酸到丝氨酸的变化(R764S)以及用另外两个锌指取代ZF11,使得PRDM9c中有14个锌指。在这里,我们将PRDM9c的六指片段ZF8 - 13与代表已知PRDM9c特异性热点序列的寡核苷酸共结晶,并将该结构与已表征的PRDM9a特异性复合物的结构进行比较。有三个主要差异。首先,ZF9中的丝氨酸使PRDM9c能够容纳可变碱基,而PRDM9a中的精氨酸识别保守的鸟嘌呤。其次,ZF11的双指扩展使PRDM9c能够识别长三个碱基对的序列。额外锌指中的一个色氨酸与保守的胸腺嘧啶甲基相互作用。第三,在ZF螺旋之前紧邻的精氨酸 - 天冬氨酸二肽,在两个PRDM9a锌指和三个PRDM9c锌指中保守,允许从C:G碱基对(G - 精氨酸相互作用)到G:C碱基对(C - 天冬氨酸相互作用)的变化具有适应性。这种精氨酸 - 天冬氨酸构象开关使相同的ZF模块能够识别不同的序列。我们的研究结果阐明了人类PRDM9等位基因与其同源DNA序列灵活且保守结合的分子机制。