• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HIBAG——基于属性装袋法的HLA基因型推算

HIBAG--HLA genotype imputation with attribute bagging.

作者信息

Zheng X, Shen J, Cox C, Wakefield J C, Ehm M G, Nelson M R, Weir B S

机构信息

Department of Biostatistics, University of Washington, Seattle, WA, USA.

Quantitative Sciences, GlaxoSmithKline, Research Triangle Park, NC, USA.

出版信息

Pharmacogenomics J. 2014 Apr;14(2):192-200. doi: 10.1038/tpj.2013.18. Epub 2013 May 28.

DOI:10.1038/tpj.2013.18
PMID:23712092
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3772955/
Abstract

Genotyping of classical human leukocyte antigen (HLA) alleles is an essential tool in the analysis of diseases and adverse drug reactions with associations mapping to the major histocompatibility complex (MHC). However, deriving high-resolution HLA types subsequent to whole-genome single-nucleotide polymorphism (SNP) typing or sequencing is often cost prohibitive for large samples. An alternative approach takes advantage of the extended haplotype structure within the MHC to predict HLA alleles using dense SNP genotypes, such as those available from genome-wide SNP panels. Current methods for HLA imputation are difficult to apply or may require the user to have access to large training data sets with SNP and HLA types. We propose HIBAG, HLA Imputation using attribute BAGging, that makes predictions by averaging HLA-type posterior probabilities over an ensemble of classifiers built on bootstrap samples. We assess the performance of HIBAG using our study data (n=2668 subjects of European ancestry) as a training set and HLA data from the British 1958 birth cohort study (n≈1000 subjects) as independent validation samples. Prediction accuracies for HLA-A, B, C, DRB1 and DQB1 range from 92.2% to 98.1% using a set of SNP markers common to the Illumina 1M Duo, OmniQuad, OmniExpress, 660K and 550K platforms. HIBAG performed well compared with the other two leading methods, HLA*IMP and BEAGLE. This method is implemented in a freely available HIBAG R package that includes pre-fit classifiers for European, Asian, Hispanic and African ancestries, providing a readily available imputation approach without the need to have access to large training data sets.

摘要

经典人类白细胞抗原(HLA)等位基因的基因分型是分析与主要组织相容性复合体(MHC)相关的疾病和药物不良反应的重要工具。然而,在全基因组单核苷酸多态性(SNP)分型或测序之后获得高分辨率HLA类型对于大样本来说成本往往过高。一种替代方法利用MHC内扩展的单倍型结构,使用密集的SNP基因型(如全基因组SNP面板中的那些)来预测HLA等位基因。目前的HLA推断方法难以应用,或者可能要求用户能够获得具有SNP和HLA类型的大型训练数据集。我们提出了HIBAG,即使用属性装袋法进行HLA推断,它通过对基于自助抽样构建的分类器集合上的HLA类型后验概率求平均值来进行预测。我们使用我们的研究数据(n = 2668名欧洲血统受试者)作为训练集,并将来自英国1958年出生队列研究的HLA数据(n≈1000名受试者)作为独立验证样本,评估HIBAG的性能。使用Illumina 1M Duo、OmniQuad、OmniExpress、660K和550K平台共有的一组SNP标记,HLA - A、B、C、DRB1和DQB1的预测准确率在92.2%至98.1%之间。与其他两种领先方法HLA*IMP和BEAGLE相比,HIBAG表现良好。该方法在一个免费提供的HIBAG R包中实现,该包包括针对欧洲、亚洲、西班牙裔和非洲血统的预拟合分类器,提供了一种无需访问大型训练数据集即可随时使用的推断方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/2f2d4634579f/tpj201318f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/7f1230969867/tpj201318f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/ef35aaa7a41e/tpj201318f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/2f2d4634579f/tpj201318f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/7f1230969867/tpj201318f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/ef35aaa7a41e/tpj201318f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3b8/3992870/2f2d4634579f/tpj201318f3.jpg

相似文献

1
HIBAG--HLA genotype imputation with attribute bagging.HIBAG——基于属性装袋法的HLA基因型推算
Pharmacogenomics J. 2014 Apr;14(2):192-200. doi: 10.1038/tpj.2013.18. Epub 2013 May 28.
2
Imputation-Based HLA Typing with GWAS SNPs.基于 GWAS SNPs 的推断性 HLA 分型。
Methods Mol Biol. 2024;2809:127-143. doi: 10.1007/978-1-0716-3874-3_9.
3
Construction and benchmarking of a multi-ethnic reference panel for the imputation of HLA class I and II alleles.构建并基准测试用于 HLA Ⅰ类和Ⅱ类等位基因推断的多民族参考面板。
Hum Mol Genet. 2019 Jun 15;28(12):2078-2092. doi: 10.1093/hmg/ddy443.
4
High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references.基于群体特异性参考的高分辨率单核苷酸多态性数据对HLA I类和II类基因进行高精度填充。
Pharmacogenomics J. 2015 Dec;15(6):530-7. doi: 10.1038/tpj.2015.4. Epub 2015 Feb 24.
5
Performance of HLA allele prediction methods in African Americans for class II genes HLA-DRB1, -DQB1, and -DPB1.HLA等位基因预测方法在非裔美国人中对II类基因HLA-DRB1、-DQB1和-DPB1的性能表现。
BMC Genet. 2014 Jun 16;15:72. doi: 10.1186/1471-2156-15-72.
6
Imputation-Based HLA Typing with SNPs in GWAS Studies.全基因组关联研究中基于插补法利用单核苷酸多态性进行人类白细胞抗原分型
Methods Mol Biol. 2018;1802:163-176. doi: 10.1007/978-1-4939-8546-3_11.
7
Human leucocyte antigen class I and II imputation in a multiracial population.多种族人群中人类白细胞抗原I类和II类的推算
Int J Immunogenet. 2016 Dec;43(6):369-375. doi: 10.1111/iji.12292. Epub 2016 Oct 24.
8
SNP-HLA Reference Consortium (SHLARC): HLA and SNP data sharing for promoting MHC-centric analyses in genomics.单核苷酸多态性-人类白细胞抗原参考联盟 (SHLARC):促进基因组学中 MHC 中心分析的 HLA 和 SNP 数据共享。
Genet Epidemiol. 2020 Oct;44(7):733-740. doi: 10.1002/gepi.22334. Epub 2020 Jul 18.
9
Imputing amino acid polymorphisms in human leukocyte antigens.推断人类白细胞抗原中的氨基酸多态性。
PLoS One. 2013 Jun 6;8(6):e64683. doi: 10.1371/journal.pone.0064683. Print 2013.
10
Comparison of HLA allelic imputation programs.HLA 等位基因推算程序的比较。
PLoS One. 2017 Feb 16;12(2):e0172444. doi: 10.1371/journal.pone.0172444. eCollection 2017.

引用本文的文献

1
Genome-wide association study of pulpal and apical diseases.牙髓和根尖周疾病的全基因组关联研究
Nat Commun. 2025 Jul 23;16(1):6774. doi: 10.1038/s41467-025-61721-1.
2
Optimized multiplex PCR-NGS for comprehensive HLA genotyping in Chinese populations: resolving ambiguities at high resolution.优化的多重PCR-NGS用于中国人群全面的HLA基因分型:高分辨率解析模糊性
Front Immunol. 2025 Jun 26;16:1551173. doi: 10.3389/fimmu.2025.1551173. eCollection 2025.
3
Impact of placental and peripheral blood DNA methylation on celiac disease susceptibility.

本文引用的文献

1
The IMGT/HLA database.IMGT/HLA 数据库。
Nucleic Acids Res. 2013 Jan;41(Database issue):D1222-7. doi: 10.1093/nar/gks949. Epub 2012 Oct 17.
2
Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis.五个氨基酸在三种 HLA 蛋白中解释了 MHC 与血清阳性类风湿关节炎之间的大部分关联。
Nat Genet. 2012 Jan 29;44(3):291-6. doi: 10.1038/ng.1076.
3
Haplotype phasing: existing methods and new developments.单体型相位确定:现有方法和新进展。
胎盘和外周血DNA甲基化对乳糜泻易感性的影响。
J Pediatr Gastroenterol Nutr. 2025 Sep;81(3):587-595. doi: 10.1002/jpn3.70124. Epub 2025 Jun 22.
4
Diversity and longitudinal records: Genetic architecture of disease associations and polygenic risk in the Taiwanese Han population.多样性与纵向记录:台湾汉族人群疾病关联及多基因风险的遗传结构
Sci Adv. 2025 Jun 6;11(23):eadt0539. doi: 10.1126/sciadv.adt0539. Epub 2025 Jun 4.
5
Female-mediated selective sperm activation may remodel major histocompatibility complex-based mate choice decisions in humans.雌性介导的选择性精子激活可能会重塑人类基于主要组织相容性复合体的配偶选择决策。
Heredity (Edinb). 2025 May 9. doi: 10.1038/s41437-025-00759-9.
6
MultiCook: A Tool That Improves Accuracy of HLA Imputation by Combining Probabilities From Multiple Reference Panels and Methods.MultiCook:一种通过结合来自多个参考面板和方法的概率来提高HLA基因分型准确性的工具。
HLA. 2025 May;105(5):e70153. doi: 10.1111/tan.70153.
7
genotype imputation with attribute bagging (LIBAG): leukocyte immunoglobulin-like receptor copy number imputation system.基于属性装袋法的基因型插补(LIBAG):白细胞免疫球蛋白样受体拷贝数插补系统
Front Immunol. 2025 Apr 7;16:1559301. doi: 10.3389/fimmu.2025.1559301. eCollection 2025.
8
Clinical and HLA Associations of Fluoroquinolone-Induced Liver Injury: Results From the Drug-Induced Liver Injury Network.氟喹诺酮类药物所致肝损伤的临床及HLA相关性:药物性肝损伤网络研究结果
Am J Gastroenterol. 2025 Apr 10. doi: 10.14309/ajg.0000000000003457.
9
HLA-DQB1*03:01 and risk of HBV-related HCC.HLA-DQB1*03:01与乙型肝炎病毒相关肝细胞癌的风险
Hepatology. 2025 Mar 14. doi: 10.1097/HEP.0000000000001307.
10
Machine Learning Methods for Classifying Multiple Sclerosis and Alzheimer's Disease Using Genomic Data.使用基因组数据对多发性硬化症和阿尔茨海默病进行分类的机器学习方法
Int J Mol Sci. 2025 Feb 27;26(5):2085. doi: 10.3390/ijms26052085.
Nat Rev Genet. 2011 Sep 16;12(10):703-14. doi: 10.1038/nrg3054.
4
HLA*IMP--an integrated framework for imputing classical HLA alleles from SNP genotypes.HLA*IMP——一种从 SNP 基因型推断经典 HLA 等位基因的集成框架。
Bioinformatics. 2011 Apr 1;27(7):968-72. doi: 10.1093/bioinformatics/btr061. Epub 2011 Feb 7.
5
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.MaCH:利用序列和基因型数据来估计单倍型和未观测基因型。
Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.
6
Linkage disequilibrium and age of HLA region SNPs in relation to classic HLA gene alleles within Europe.欧洲人群中 HLA 区域单核苷酸多态性的连锁不平衡与经典 HLA 基因等位基因及其与年龄的关系
Eur J Hum Genet. 2010 Aug;18(8):924-32. doi: 10.1038/ejhg.2010.32. Epub 2010 Mar 31.
7
A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.一种用于下一代全基因组关联研究的灵活且准确的基因型填充方法。
PLoS Genet. 2009 Jun;5(6):e1000529. doi: 10.1371/journal.pgen.1000529. Epub 2009 Jun 19.
8
The HLA genomic loci map: expression, interaction, diversity and disease.HLA 基因组座图谱:表达、相互作用、多样性和疾病。
J Hum Genet. 2009 Jan;54(1):15-39. doi: 10.1038/jhg.2008.5. Epub 2009 Jan 9.
9
A statistical method for predicting classical HLA alleles from SNP data.一种从单核苷酸多态性(SNP)数据预测经典人类白细胞抗原(HLA)等位基因的统计方法。
Am J Hum Genet. 2008 Jan;82(1):48-56. doi: 10.1016/j.ajhg.2007.09.001.
10
Localization of type 1 diabetes susceptibility to the MHC class I genes HLA-B and HLA-A.1型糖尿病易感性定位于MHC I类基因HLA - B和HLA - A。
Nature. 2007 Dec 6;450(7171):887-92. doi: 10.1038/nature06406. Epub 2007 Nov 14.