Yuan Chaohui, Chu Charles C, Yan Xiao-Jie, Bagnara Davide, Chiorazzi Nicholas, MacCarthy Thomas
Department of Applied Mathematics and Statistics, Stony Brook University, NY, United States of America.
The Feinstein Institute for Medical Research, Northwell Health, Manhasset, NY, United States of America.
PLoS One. 2017 Jan 26;12(1):e0167602. doi: 10.1371/journal.pone.0167602. eCollection 2017.
The targeting of mutations by Activation-Induced Deaminase (AID) is a key step in generating antibody diversity at the Immunoglobulin (Ig) loci but is also implicated in B-cell malignancies such as chronic lymphocytic leukemia (CLL). AID has previously been shown to preferentially deaminate WRC (W = A/T, R = A/G) hotspots. WGCW sites, which contain an overlapping WRC hotspot on both DNA strands, mutate at much higher frequency than single hotspots. Human Ig heavy chain (IGHV) genes differ in terms of WGCW numbers, ranging from 4 for IGHV3-4803 to as many as 12 in IGHV1-6901. An absence of V-region mutations in CLL patients ("IGHV unmutated", or U-CLL) is associated with a poorer prognosis compared to "IGHV mutated" (M-CLL) patients. The reasons for this difference are still unclear, but it has been noted that particular IGHV genes associate with U-CLL vs M-CLL. For example, patients with IGHV1-69 clones tend to be U-CLL with a poor prognosis, whereas patients with IGHV3-30 tend to be M-CLL and have a better prognosis. Another distinctive feature of CLL is that ~30% of (mostly poor prognosis) patients can be classified into "stereotyped" subsets, each defined by HCDR3 similarity, suggesting selection, possibly for a self-antigen. We analyzed >1000 IGHV genes from CLL patients and found a highly significant statistical relationship between the number of WGCW hotspots in the germline V-region and the observed mutation frequency in patients. However, paradoxically, this correlation was inverse, with V-regions with more WGCW hotspots being less likely to be mutated, i.e., more likely to be U-CLL. The number of WGCW hotspots in particular, are more strongly correlated with mutation frequency than either non-overlapping (WRC) hotspots or more general models of mutability derived from somatic hypermutation data. Furthermore, this correlation is not observed in sequences from the B cell repertoires of normal individuals and those with autoimmune diseases.
激活诱导脱氨酶(AID)对突变的靶向作用是在免疫球蛋白(Ig)基因座产生抗体多样性的关键步骤,但也与慢性淋巴细胞白血病(CLL)等B细胞恶性肿瘤有关。先前已证明AID优先使WRC(W = A/T,R = A/G)热点发生脱氨基作用。WGCW位点在两条DNA链上都含有重叠的WRC热点,其突变频率比单个热点高得多。人类免疫球蛋白重链(IGHV)基因在WGCW数量方面存在差异,从IGHV3-4803的4个到IGHV1-6901的多达12个不等。与“IGHV突变型”(M-CLL)患者相比,CLL患者中缺乏V区突变(“IGHV未突变型”,或U-CLL)与较差的预后相关。这种差异的原因尚不清楚,但已经注意到特定的IGHV基因与U-CLL和M-CLL相关。例如,具有IGHV1-69克隆的患者往往是预后较差的U-CLL,而具有IGHV3-30的患者往往是M-CLL且预后较好。CLL的另一个显著特征是,约30%(大多数预后较差)的患者可被归类为“定型”亚组,每个亚组由重链互补决定区3(HCDR3)相似性定义,这表明可能存在针对自身抗原的选择。我们分析了来自CLL患者的1000多个IGHV基因,发现种系V区中WGCW热点的数量与患者中观察到的突变频率之间存在高度显著的统计关系。然而,矛盾的是,这种相关性是相反的,WGCW热点较多的V区发生突变的可能性较小,即更可能是U-CLL。特别是WGCW热点的数量与突变频率的相关性比非重叠(WRC)热点或从体细胞超突变数据得出的更一般的突变模型更强。此外,在正常个体和自身免疫性疾病患者的B细胞库序列中未观察到这种相关性。