Hershberg Uri, Uduman Mohamed, Shlomchik Mark J, Kleinstein Steven H
Department of Laboratory Medicine, Yale University School of Medicine, New Haven, CT 06520, USA.
Int Immunol. 2008 May;20(5):683-94. doi: 10.1093/intimm/dxn026. Epub 2008 Apr 7.
Statistical methods based on the relative frequency of replacement mutations in B lymphocyte Ig V region sequences have been widely used to detect the forces of selection that shape the B cell repertoire. However, current methods produce an unexpectedly high frequency of false positives and are sensitive to intrinsic biases of somatic hypermutation that can give the appearance of selection. The new statistical test proposed here provides a better trade-off between sensitivity and specificity compared with previous approaches. The low specificity of existing methods was shown in silico to result from an interaction between the effects of positive and negative selection. False detection of positive selection was confirmed in vivo through a re-analysis of published sequence data from diffuse large B cell lymphomas, highlighting the need for re-analysis of some existing studies. The sensitivity of the proposed method to detect selection was validated using new Ig transgenic mouse models in which positive selection was expected to be a significant force, as well as with a simulation-based approach. Previous concerns that intrinsic biases of somatic hypermutation could give the appearance of selection were addressed by extending the current mutation models to more fully account for the impact of microsequence on relative mutability and to include transition bias. High specificity was confirmed using a large set of non-productively rearranged Ig sequences. These results show that selection can be detected in vivo with high specificity using the new method proposed here, allowing greater insight into the existence and direction of antigen-driven selection.
基于B淋巴细胞Ig V区序列中替换突变相对频率的统计方法已被广泛用于检测塑造B细胞库的选择力。然而,目前的方法产生了出乎意料的高假阳性频率,并且对体细胞超突变的内在偏差敏感,这些偏差可能会给人一种选择的假象。与先前的方法相比,这里提出的新统计检验在敏感性和特异性之间提供了更好的权衡。现有方法的低特异性在计算机模拟中显示是由正选择和负选择效应之间的相互作用导致的。通过对弥漫性大B细胞淋巴瘤已发表序列数据的重新分析,在体内证实了对正选择的错误检测,突出了对一些现有研究进行重新分析的必要性。使用新的Ig转基因小鼠模型(其中正选择预计是一个重要因素)以及基于模拟的方法,验证了所提出方法检测选择的敏感性。通过扩展当前的突变模型以更充分地考虑微序列对相对突变性的影响并纳入转换偏差,解决了先前关于体细胞超突变的内在偏差可能会给人一种选择假象的担忧。使用大量非生产性重排Ig序列证实了高特异性。这些结果表明,使用这里提出的新方法可以在体内以高特异性检测选择,从而更深入地了解抗原驱动选择的存在和方向。