Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA.
Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA.
Genome Res. 2023 Jun;33(6):923-931. doi: 10.1101/gr.277585.122. Epub 2023 May 11.
Killer cell immunoglobulin like receptor (KIR) genes and human leukocyte antigen (HLA) genes play important roles in innate and adaptive immunity. They are highly polymorphic and cannot be genotyped with standard variant calling pipelines. Compared with HLA genes, many KIR genes are similar to each other in sequences and may be absent in the chromosomes. Therefore, although many tools have been developed to genotype HLA genes using common sequencing data, none of them work for KIR genes. Even specialized KIR genotypers could not resolve all the KIR genes. Here we describe T1K, a novel computational method for the efficient and accurate inference of KIR or HLA alleles from RNA-seq, whole-genome sequencing, or whole-exome sequencing data. T1K jointly considers alleles across all genotyped genes, so it can reliably identify present genes and distinguish homologous genes, including the challenging / genes. This model also benefits HLA genotyping, where T1K achieves high accuracy in benchmarks. Moreover, T1K can call novel single-nucleotide variants and process single-cell data. Applying T1K to tumor single-cell RNA-seq data, we found that expression was enriched in tumor-specific CD8 T cells. T1K may open the opportunity for HLA and KIR genotyping across various sequencing applications.
杀伤细胞免疫球蛋白样受体 (KIR) 基因和人类白细胞抗原 (HLA) 基因在先天和适应性免疫中发挥重要作用。它们高度多态性,无法使用标准变异调用管道进行基因分型。与 HLA 基因相比,许多 KIR 基因在序列上彼此相似,并且可能在染色体中缺失。因此,尽管已经开发了许多工具来使用常见的测序数据对 HLA 基因进行基因分型,但没有一种工具适用于 KIR 基因。即使是专门的 KIR 基因分型器也无法解析所有的 KIR 基因。在这里,我们描述了 T1K,这是一种从 RNA-seq、全基因组测序或全外显子组测序数据中高效准确推断 KIR 或 HLA 等位基因的新型计算方法。T1K 联合考虑了所有已基因分型基因的等位基因,因此它可以可靠地识别存在的基因并区分同源基因,包括具有挑战性的 / 基因。该模型还受益于 HLA 基因分型,其中 T1K 在基准测试中达到了高精度。此外,T1K 可以调用新的单核苷酸变体并处理单细胞数据。将 T1K 应用于肿瘤单细胞 RNA-seq 数据,我们发现 在肿瘤特异性 CD8 T 细胞中富集表达。T1K 可能为各种测序应用中的 HLA 和 KIR 基因分型开辟机会。