Suppr超能文献

HLA-HD:一种用于下一代测序数据的精确HLA分型算法。

HLA-HD: An accurate HLA typing algorithm for next-generation sequencing data.

作者信息

Kawaguchi Shuji, Higasa Koichiro, Shimizu Masakazu, Yamada Ryo, Matsuda Fumihiko

机构信息

Center for Genomic Medicine, Kyoto University Graduate School of Medicine, Sakyo-ku, Kyoto, Japan.

出版信息

Hum Mutat. 2017 Jul;38(7):788-797. doi: 10.1002/humu.23230. Epub 2017 May 12.

Abstract

The accurate typing of human leukocyte antigen (HLA) alleles is critical for a variety of medical applications, such as genomic studies of multifactorial diseases, including immune system and inflammation-related disorders, and donor selection in organ transplantation and regenerative medicine. Here, we developed a new algorithm for determining HLA alleles using next-generation sequencing (NGS) results. The method consists of constructing an extensive dictionary of HLA alleles, precise mapping of the NGS reads, and calculating a score based on weighted read counts to select the most suitable pair of alleles. The developed algorithm compares the score of all allele pairs, taking into account variation not only in the domain for antigen presentation (G-DOMAIN), but also outside this domain. Using this method, HLA alleles could be determined with 6-digit precision. We showed that our method was more accurate than other NGS-based methods and revealed limitations of the conventional HLA typing technologies. Furthermore, we determined the complete genomic sequence of an HLA-A-like-pseudogene when we assembled NGS reads that had caused arguable typing, and found its identity with HLA-Y02:01. The accuracy of the HLA-A allele typing was improved after the HLA-Y02:01 sequence was included in the HLA allele dictionary.

摘要

人类白细胞抗原(HLA)等位基因的准确分型对于多种医学应用至关重要,例如多因素疾病的基因组研究,包括免疫系统和炎症相关疾病,以及器官移植和再生医学中的供体选择。在此,我们开发了一种利用下一代测序(NGS)结果确定HLA等位基因的新算法。该方法包括构建一个广泛的HLA等位基因字典、对NGS读数进行精确映射,以及基于加权读数计数计算分数以选择最合适的等位基因对。所开发的算法比较所有等位基因对的分数,不仅考虑抗原呈递结构域(G结构域)内的变异,还考虑该结构域之外的变异。使用这种方法,可以以6位数字的精度确定HLA等位基因。我们表明,我们的方法比其他基于NGS的方法更准确,并揭示了传统HLA分型技术的局限性。此外,当我们组装导致分型存在争议的NGS读数时,我们确定了一个HLA - A类假基因的完整基因组序列,并发现它与HLA - Y02:01相同。在将HLA - Y02:01序列纳入HLA等位基因字典后,HLA - A等位基因分型的准确性得到了提高。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验