School of Biomedical Engineering, Science and Health Systems Drexel University, Philadelphia, PA, USA.
Phys Biol. 2013 Jun;10(3):035005. doi: 10.1088/1478-3975/10/3/035005. Epub 2013 Jun 4.
The immune system can detect most invading pathogens. The potential for detection of pathogens is dependent on the somatic diversity of the immune repertoires. While it is known that this somatic diversity is carefully generated, it is unclear how the diversity is distributed in the different genes encoding receptors of immune cells. Utilizing different metrics for richness and diversity at the level of small sequence fragments, we present here an analysis of the entire known human germline repertoire as represented by the sequences from the ImMunoGeneTics database of immune receptors. We have developed a fragment sequence quantification analysis to track variation of repertoires with different degrees of precision. Somatic diversity has previously been functionally characterized mostly by division of the V gene sequences into the more conserved and invariant framework (FR) of the receptor and more varied complementarity determining regions (CDR), that interact with the antigen. We find that CDR and FR can be explicitly identified with our sequence fragment diversity quantification technique. In terms of diversity, CDR and FR are especially distinct in B cell V genes. T cell V genes show less of the CDR/FR periodicity but are more diverse overall. Our analysis further shows that there are other areas of diversity outside the CDR and FR that are found widely dispersed in T cell receptor V genes and more tightly focused in FR1 and FR3 in the B cell receptor V genes. The diversity we observe is not dependent on allelic differences nor is this diversity segregated by individual V gene families. We would thus expect that each individual exhibit a diversity equivalent to that of the entire potential repertoire.
免疫系统可以检测到大多数入侵的病原体。病原体检测的潜力取决于免疫反应的体细胞多样性。虽然已知这种体细胞多样性是精心产生的,但尚不清楚多样性在编码免疫细胞受体的不同基因中是如何分布的。利用小序列片段水平的丰富度和多样性的不同度量标准,我们在此分析了整个已知的人类种系免疫受体序列库(ImMunoGeneTics 数据库)所代表的人类种系免疫受体的全部序列。我们开发了一种片段序列定量分析方法,以跟踪不同精度的受体库的变化。体细胞多样性以前主要通过将 V 基因序列分为受体的更保守和不变的框架 (FR) 和更多变的互补决定区 (CDR) 来进行功能表征,CDR 与抗原相互作用。我们发现,我们的序列片段多样性定量技术可以明确识别 CDR 和 FR。就多样性而言,CDR 和 FR 在 B 细胞 V 基因中尤为独特。T 细胞 V 基因的 CDR/FR 周期性较弱,但总体上更具多样性。我们的分析进一步表明,在 CDR 和 FR 之外,还有其他多样性区域,这些区域在 T 细胞受体 V 基因中广泛分散,而在 B 细胞受体 V 基因中 FR1 和 FR3 更为集中。我们观察到的多样性不依赖于等位基因差异,也不以个体 V 基因家族为单位进行分离。因此,我们预计每个人都表现出与整个潜在受体库相当的多样性。