DKMS Life Science Lab, Dresden, Germany.
DKMS, Tübingen, Germany.
BMC Bioinformatics. 2021 May 10;22(1):236. doi: 10.1186/s12859-021-04153-0.
High resolution HLA genotyping of donors and recipients is a crucially important prerequisite for haematopoetic stem-cell transplantation and relies heavily on the quality and completeness of immunogenetic reference sequence databases of allelic variation.
Here, we report on DR2S, an R package that leverages the strengths of two sequencing technologies-the accuracy of next-generation sequencing with the read length of third-generation sequencing technologies like PacBio's SMRT sequencing or ONT's nanopore sequencing-to reconstruct fully-phased high-quality full-length haplotype sequences. Although optimised for HLA and KIR genes, DR2S is applicable to all loci with known reference sequences provided that full-length sequencing data is available for analysis. In addition, DR2S integrates supporting tools for easy visualisation and quality control of the reconstructed haplotype to ensure suitability for submission to public allele databases.
DR2S is a largely automated workflow designed to create high-quality fully-phased reference allele sequences for highly polymorphic gene regions such as HLA or KIR. It has been used by biologists to successfully characterise and submit more than 500 HLA alleles and more than 500 KIR alleles to the IPD-IMGT/HLA and IPD-KIR databases.
供者和受者的高分辨率 HLA 基因分型是造血干细胞移植的一个极其重要的前提条件,这严重依赖于等位基因变异免疫遗传参考序列数据库的质量和完整性。
在这里,我们报告了 DR2S,这是一个 R 包,利用了两种测序技术的优势——下一代测序的准确性和第三代测序技术(如 PacBio 的 SMRT 测序或 ONT 的纳米孔测序的读长),以重建完全定相的高质量全长单倍型序列。尽管针对 HLA 和 KIR 基因进行了优化,但 DR2S 适用于所有具有已知参考序列的基因座,前提是有全长测序数据可供分析。此外,DR2S 集成了易于可视化和质量控制的支持工具,以确保适用于提交给公共等位基因数据库。
DR2S 是一个自动化程度很高的工作流程,旨在为 HLA 或 KIR 等高度多态性基因区域创建高质量的完全定相参考等位基因序列。它已被生物学家用于成功地对 500 多个 HLA 等位基因和 500 多个 KIR 等位基因进行特征描述和提交到 IPD-IMGT/HLA 和 IPD-KIR 数据库。