• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用已识别的单倍型和单倍型模式,为聚类基因的存在-缺失数据进行单倍型推断的隐马尔可夫模型。

A hidden Markov model for haplotype inference for present-absent data of clustered genes using identified haplotypes and haplotype patterns.

机构信息

Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham Birmingham, AL, USA.

Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham Birmingham, AL, USA ; Queensland Brain Institute, The University of Queensland St. Lucia, QLD, Australia.

出版信息

Front Genet. 2014 Aug 12;5:267. doi: 10.3389/fgene.2014.00267. eCollection 2014.

DOI:10.3389/fgene.2014.00267
PMID:25161663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4129397/
Abstract

The majority of killer cell immunoglobin-like receptor (KIR) genes are detected as either present or absent using locus-specific genotyping technology. Ambiguity arises from the presence of a specific KIR gene since the exact copy number (one or two) of that gene is unknown. Therefore, haplotype inference for these genes is becoming more challenging due to such large portion of missing information. Meantime, many haplotypes and partial haplotype patterns have been previously identified due to tight linkage disequilibrium (LD) among these clustered genes thus can be incorporated to facilitate haplotype inference. In this paper, we developed a hidden Markov model (HMM) based method that can incorporate identified haplotypes or partial haplotype patterns for haplotype inference from present-absent data of clustered genes (e.g., KIR genes). We compared its performance with an expectation maximization (EM) based method previously developed in terms of haplotype assignments and haplotype frequency estimation through extensive simulations for KIR genes. The simulation results showed that the new HMM based method outperformed the previous method when some incorrect haplotypes were included as identified haplotypes and/or the standard deviation of haplotype frequencies were small. We also compared the performance of our method with two methods that do not use previously identified haplotypes and haplotype patterns, including an EM based method, HPALORE, and a HMM based method, MaCH. Our simulation results showed that the incorporation of identified haplotypes and partial haplotype patterns can improve accuracy for haplotype inference. The new software package HaploHMM is available and can be downloaded at http://www.soph.uab.edu/ssg/files/People/KZhang/HaploHMM/haplohmm-index.html.

摘要

大多数杀伤细胞免疫球蛋白样受体(KIR)基因使用基因座特异性基因分型技术检测为存在或不存在。由于该基因的确切拷贝数(一个或两个)未知,因此存在特定的 KIR 基因时会出现歧义。因此,由于这种大量缺失信息,这些基因的单体型推断变得更加具有挑战性。同时,由于这些聚集的基因之间存在紧密的连锁不平衡(LD),因此已经确定了许多单体型和部分单体型模式,从而可以将其纳入以促进单体型推断。在本文中,我们开发了一种基于隐马尔可夫模型(HMM)的方法,该方法可以从聚类基因(例如 KIR 基因)的存在-缺失数据中结合已识别的单体型或部分单体型模式进行单体型推断。我们通过对 KIR 基因进行广泛的模拟,比较了其与先前基于期望最大化(EM)的方法在单体型分配和单体型频率估计方面的性能。模拟结果表明,当将一些不正确的单体型作为已识别的单体型包含在内和/或单体型频率的标准偏差较小时,新的基于 HMM 的方法优于先前的方法。我们还比较了我们的方法与不使用先前确定的单体型和单体型模式的两种方法的性能,包括基于 EM 的方法 HPALORE 和基于 HMM 的方法 MaCH。我们的模拟结果表明,结合已识别的单体型和部分单体型模式可以提高单体型推断的准确性。新的软件包 HaploHMM 可在以下网址获得并下载:http://www.soph.uab.edu/ssg/files/People/KZhang/HaploHMM/haplohmm-index.html。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/6cbf5d2e928f/fgene-05-00267-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/0f22cd4a9b10/fgene-05-00267-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/6664f01fc7e4/fgene-05-00267-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/a0bca2453216/fgene-05-00267-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/6cbf5d2e928f/fgene-05-00267-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/0f22cd4a9b10/fgene-05-00267-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/6664f01fc7e4/fgene-05-00267-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/a0bca2453216/fgene-05-00267-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91f7/4129397/6cbf5d2e928f/fgene-05-00267-g0006.jpg

相似文献

1
A hidden Markov model for haplotype inference for present-absent data of clustered genes using identified haplotypes and haplotype patterns.使用已识别的单倍型和单倍型模式,为聚类基因的存在-缺失数据进行单倍型推断的隐马尔可夫模型。
Front Genet. 2014 Aug 12;5:267. doi: 10.3389/fgene.2014.00267. eCollection 2014.
2
Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns.利用先前确定的单倍型和单倍型模式对存在-缺失基因型数据进行单倍型推断。
Bioinformatics. 2007 Sep 15;23(18):2399-406. doi: 10.1093/bioinformatics/btm371. Epub 2007 Jul 21.
3
Haplotype reconstruction for genetically complex regions with ambiguous genotype calls: Illustration by the KIR gene region.单倍型重建用于基因型模糊的遗传复杂区域:以 KIR 基因区域为例。
Genet Epidemiol. 2024 Feb;48(1):3-26. doi: 10.1002/gepi.22538. Epub 2023 Oct 13.
4
Genotype calling from next-generation sequencing data using haplotype information of reads.基于读段单倍型信息进行下一代测序数据的基因型推断。
Bioinformatics. 2012 Apr 1;28(7):938-46. doi: 10.1093/bioinformatics/bts047. Epub 2012 Jan 27.
5
Estimation of German KIR Allele Group Haplotype Frequencies.德国 KIR 等位基因组单倍型频率的估计。
Front Immunol. 2020 Mar 12;11:429. doi: 10.3389/fimmu.2020.00429. eCollection 2020.
6
Estimating KIR Haplotype Frequencies on a Cohort of 10,000 Individuals: A Comprehensive Study on Population Variations, Typing Resolutions, and Reference Haplotypes.估计一万名个体队列中的杀伤细胞免疫球蛋白样受体单倍型频率:关于群体变异、分型分辨率和参考单倍型的综合研究
PLoS One. 2016 Oct 10;11(10):e0163973. doi: 10.1371/journal.pone.0163973. eCollection 2016.
7
Comparison of the accuracy of methods of computational haplotype inference using a large empirical dataset.使用大型实证数据集对计算单倍型推断方法的准确性进行比较。
BMC Genet. 2004 Aug 3;5:22. doi: 10.1186/1471-2156-5-22.
8
Preliminary analysis of a KIR haplotype estimation algorithm: a simulation study.杀伤细胞免疫球蛋白样受体(KIR)单倍型估计算法的初步分析:一项模拟研究
Tissue Antigens. 2007 Apr;69 Suppl 1:96-100. doi: 10.1111/j.1399-0039.2006.762_4.x.
9
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
10
Accounting for haplotype phase uncertainty in linkage disequilibrium estimation.在连锁不平衡估计中考虑单倍型相位不确定性。
Genet Epidemiol. 2008 Feb;32(2):168-78. doi: 10.1002/gepi.20273.

引用本文的文献

1
Estimating KIR Haplotype Frequencies on a Cohort of 10,000 Individuals: A Comprehensive Study on Population Variations, Typing Resolutions, and Reference Haplotypes.估计一万名个体队列中的杀伤细胞免疫球蛋白样受体单倍型频率:关于群体变异、分型分辨率和参考单倍型的综合研究
PLoS One. 2016 Oct 10;11(10):e0163973. doi: 10.1371/journal.pone.0163973. eCollection 2016.

本文引用的文献

1
A method for calling copy number polymorphism using haplotypes.一种使用单倍型调用拷贝数多态性的方法。
Front Genet. 2013 Sep 23;4:165. doi: 10.3389/fgene.2013.00165. eCollection 2013.
2
Inferring haplotypes of copy number variations from high-throughput data with uncertainty.从具有不确定性的高通量数据推断拷贝数变异的单倍型。
G3 (Bethesda). 2011 Jun;1(1):35-42. doi: 10.1534/g3.111.000174. Epub 2011 Jun 1.
3
Practical Consideration of Genotype Imputation: Sample Size, Window Size, Reference Choice, and Untyped Rate.基因型填充的实际考量:样本量、窗口大小、参考选择及未分型率
Stat Interface. 2011;4(3):339-352. doi: 10.4310/sii.2011.v4.n3.a8.
4
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.MaCH:利用序列和基因型数据来估计单倍型和未观测基因型。
Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.
5
An NOS3 Haplotype is Protective against Hypertension in a Caucasian Population.一种一氧化氮合酶3单倍型对白种人群的高血压具有保护作用。
Int J Hypertens. 2010 Mar 25;2010:865031. doi: 10.4061/2010/865031.
6
Inferring combined CNV/SNP haplotypes from genotype data.从基因型数据推断 CNV/SNP 单体型。
Bioinformatics. 2010 Jun 1;26(11):1437-45. doi: 10.1093/bioinformatics/btq157. Epub 2010 Apr 20.
7
Genetic epidemiology of glioblastoma multiforme: confirmatory and new findings from analyses of human leukocyte antigen alleles and motifs.胶质母细胞瘤多形性的遗传流行病学:人类白细胞抗原等位基因和基序分析的确认性和新发现。
PLoS One. 2009 Sep 23;4(9):e7157. doi: 10.1371/journal.pone.0007157.
8
Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns.利用先前确定的单倍型和单倍型模式对存在-缺失基因型数据进行单倍型推断。
Bioinformatics. 2007 Sep 15;23(18):2399-406. doi: 10.1093/bioinformatics/btm371. Epub 2007 Jul 21.
9
A new multipoint method for genome-wide association studies by imputation of genotypes.一种通过基因型插补进行全基因组关联研究的新的多点方法。
Nat Genet. 2007 Jul;39(7):906-13. doi: 10.1038/ng2088. Epub 2007 Jun 17.
10
KIR and disease: a model system or system of models?杀伤细胞免疫球蛋白样受体与疾病:一个模型系统还是多个模型的系统?
Immunol Rev. 2006 Dec;214:186-201. doi: 10.1111/j.1600-065X.2006.00459.x.