Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States.
Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, United States.
Front Immunol. 2021 Feb 2;11:618409. doi: 10.3389/fimmu.2020.618409. eCollection 2020.
The somatic hypermutation (SHM) of Immunoglobulin (Ig) genes is a key process during antibody affinity maturation in B cells. The mutagenic enzyme activation induced deaminase (AID) is required for SHM and has a preference for WRC hotspots in DNA. Error-prone repair mechanisms acting downstream of AID introduce further mutations, including DNA polymerase eta (Polη), part of the non-canonical mismatch repair pathway (ncMMR), which preferentially generates mutations at WA hotspots. Previously proposed mechanistic models lead to a variety of predictions concerning interactions between hotspots, for example, how mutations in one hotspot will affect another hotspot. Using a large, high-quality, Ig repertoire sequencing dataset, we evaluated pairwise correlations between mutations site-by-site using an unbiased measure similar to mutual information which we termed "mutational association" (MA). Interactions are dominated by relatively strong correlations between nearby sites (short-range MAs), which can be almost entirely explained by interactions between overlapping hotspots for AID and/or Polη. We also found relatively weak dependencies between almost all sites throughout each gene (longer-range MAs), although these arise mostly as a statistical consequence of high pairwise mutation frequencies. The dominant short-range interactions are also highest within the most highly mutating IGHV sub-regions, such as the complementarity determining regions (CDRs), where there is a high hotspot density. Our results suggest that the hotspot preferences for AID and Polη have themselves evolved to allow for greater interactions between AID and/or Polη induced mutations.
体细胞超突变(SHM)是 B 细胞中抗体亲和力成熟的关键过程。免疫球蛋白(Ig)基因的突变酶激活诱导脱氨酶(AID)是 SHM 所必需的,并且对 DNA 中的 WRC 热点具有偏好性。AID 下游的易错修复机制会引入进一步的突变,包括 DNA 聚合酶 eta(Polη),非经典错配修复途径(ncMMR)的一部分,它优先在 WA 热点处产生突变。以前提出的机制模型导致了各种关于热点之间相互作用的预测,例如,一个热点的突变将如何影响另一个热点。我们使用一个大型的、高质量的 Ig 库测序数据集,使用类似于互信息的无偏测量方法,对每个突变位点进行了逐点的突变相关性评估,我们称之为“突变关联”(MA)。热点之间的相互作用主要由附近位点之间的相对较强的相关性(短程 MA)主导,这些相关性几乎可以完全用 AID 和/或 Polη 重叠热点之间的相互作用来解释。我们还发现了几乎所有基因中几乎所有位点之间的相对较弱的依赖性(长程 MA),尽管这些依赖性主要是由于高的成对突变频率的统计结果。主导的短程相互作用也在最易突变的 IGHV 亚区(如互补决定区(CDR))中最高,这些区域的热点密度很高。我们的结果表明,AID 和 Polη 的热点偏好本身已经进化,以允许 AID 和/或 Polη 诱导的突变之间有更大的相互作用。