Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511; and.
Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511; and
J Immunol. 2020 Dec 15;205(12):3468-3479. doi: 10.4049/jimmunol.2000496. Epub 2020 Nov 13.
Somatic hypermutation (SHM) generates much of the Ab diversity necessary for affinity maturation and effective humoral immunity. The activation-induced cytidine deaminase-induced DNA lesions and error-prone repair that underlie SHM are known to exhibit intrinsic biases when targeting the Ig sequences. Computational models for SHM targeting often model the targeting probability of a nucleotide in a motif-based fashion, assuming that the same DNA motif is equally likely to be targeted regardless of its position along the Ig sequence. The validity of this assumption, however, has not been rigorously studied in vivo. In this study, by analyzing a large collection of 956,157 human Ig sequences while controlling for the confounding influence of selection, we show that the likelihood of a DNA 5-mer motif being targeted by SHM is not the same at different positions in the same Ig sequence. We found position-dependent differential SHM targeting for about three quarters of the 38 and 269 unique motifs from more than half of the 292 and 1912 motif-allele pairs analyzed using productive and nonproductive Ig sequences, respectively. The direction of the differential SHM targeting was largely conserved across individuals with no allele-specific effect within an IgH variable gene family, but was not consistent with general decay of SHM targeting with increasing distance from the transcription start site. However, SHM targeting did correlate positively with the mutability of the wider sequence neighborhood surrounding the motif. These findings provide insights and future directions for computational efforts toward modeling SHM.
体细胞超突变(SHM)产生了大量抗体亲和力成熟和有效体液免疫所必需的 Ab 多样性。激活诱导的胞苷脱氨酶诱导的 DNA 损伤和易错修复是 SHM 的基础,已知其在靶向 Ig 序列时存在内在偏向。SHM 靶向的计算模型通常以基于基序的方式对核苷酸的靶向概率进行建模,假设无论其在 Ig 序列中的位置如何,相同的 DNA 基序都具有相同的靶向可能性。然而,这一假设的有效性尚未在体内进行严格研究。在这项研究中,我们通过分析大量的 956,157 个人类 Ig 序列,同时控制选择的混杂影响,表明 SHM 靶向 DNA 5- 碱基基序的可能性在同一 Ig 序列的不同位置是不同的。我们发现,大约四分之三的 38 个和 269 个独特基序,以及超过一半的 292 个和 1912 个基序-等位基因对,无论是使用有功能的还是无功能的 Ig 序列进行分析,其 SHM 靶向都存在位置依赖性差异。这种差异 SHM 靶向的方向在个体之间基本保持一致,在 IgH 可变基因家族内没有等位基因特异性效应,但与从转录起始位点开始的距离增加导致 SHM 靶向逐渐减少不一致。然而,SHM 靶向确实与围绕基序的更广泛序列邻域的易突变性呈正相关。这些发现为计算模型建模 SHM 的努力提供了新的思路和方向。