• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在家族关联研究中结合基于进化指导的聚类算法和基于单体型的似然比检验。

Combining an evolution-guided clustering algorithm and haplotype-based LRT in family association studies.

机构信息

Department of Public Health and Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei 10055, Taiwan.

出版信息

BMC Genet. 2011 May 19;12:48. doi: 10.1186/1471-2156-12-48.

DOI:10.1186/1471-2156-12-48
PMID:21592403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3118131/
Abstract

BACKGROUND

With the completion of the international HapMap project, many studies have been conducted to investigate the association between complex diseases and haplotype variants. Such haplotype-based association studies, however, often face two difficulties; one is the large number of haplotype configurations in the chromosome region under study, and the other is the ambiguity in haplotype phase when only genotype data are observed. The latter complexity may be handled based on an EM algorithm with family data incorporated, whereas the former can be more problematic, especially when haplotypes of rare frequencies are involved. Here based on family data we propose to cluster long haplotypes of linked SNPs in a biological sense, so that the number of haplotypes can be reduced and the power of statistical tests of association can be increased.

RESULTS

In this paper we employ family genotype data and combine a clustering scheme with a likelihood ratio statistic to test the association between quantitative phenotypes and haplotype variants. Haplotypes are first grouped based on their evolutionary closeness to establish a set containing core haplotypes. Then, we construct for each family the transmission and non-transmission phase in terms of these core haplotypes, taking into account simultaneously the phase ambiguity as weights. The likelihood ratio test (LRT) is next conducted with these weighted and clustered haplotypes to test for association with disease. This combination of evolution-guided haplotype clustering and weighted assignment in LRT is able, via its core-coding system, to incorporate into analysis both haplotype phase ambiguity and transmission uncertainty. Simulation studies show that this proposed procedure is more informative and powerful than three family-based association tests, FAMHAP, FBAT, and an LRT with a group consisting exclusively of rare haplotypes.

CONCLUSIONS

The proposed procedure takes into account the uncertainty in phase determination and in transmission, utilizes the evolutionary information contained in haplotypes, reduces the dimension in haplotype space and the degrees of freedom in tests, and performs better in association studies. This evolution-guided clustering procedure is particularly useful for long haplotypes containing linked SNPs, and is applicable to other haplotype-based association tests. This procedure is now implemented in R and is free for download.

摘要

背景

随着国际 HapMap 计划的完成,许多研究已经开展,以调查复杂疾病与单倍型变异之间的关联。然而,这种基于单倍型的关联研究通常面临两个困难;一个是研究染色体区域中单倍型构型的数量众多,另一个是当仅观察基因型数据时单倍型相位的不明确性。后者的复杂性可以基于包含家族数据的 EM 算法来处理,而前者则可能更加复杂,特别是当涉及罕见频率的单倍型时。在这里,我们基于家族数据提出了一种方法,即将连锁 SNPs 的长单倍型在生物学意义上进行聚类,从而减少单倍型的数量,并提高关联的统计检验的功效。

结果

在本文中,我们使用家族基因型数据,并结合聚类方案和似然比统计量来检验数量性状与单倍型变异之间的关联。首先,根据单倍型的进化关系对其进行分组,以建立一组包含核心单倍型的集合。然后,我们根据这些核心单倍型为每个家族构建传递和非传递相位,同时考虑相位模糊性作为权重。接下来,我们使用这些加权和聚类的单倍型进行似然比检验(LRT),以检验与疾病的关联。这种基于进化的单倍型聚类和 LRT 中加权分配的组合,通过其核心编码系统,能够将单倍型相位模糊性和传递不确定性纳入分析。模拟研究表明,与三种基于家族的关联检验(FAMHAP、FBAT 和仅包含罕见单倍型的 LRT)相比,该方法更具信息量和更有效。

结论

所提出的方法考虑了相位确定和传递的不确定性,利用了单倍型中包含的进化信息,减少了单倍型空间的维度和检验的自由度,并在关联研究中表现更好。这种基于进化的聚类程序对于包含连锁 SNPs 的长单倍型特别有用,并且适用于其他基于单倍型的关联检验。该程序现在已在 R 中实现,并可免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/6e5895214de6/1471-2156-12-48-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/0a3c12f2bf9f/1471-2156-12-48-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/f72fc04d4d85/1471-2156-12-48-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/8e67d1dc4f5d/1471-2156-12-48-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/9fa07838d448/1471-2156-12-48-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/11f0085b777a/1471-2156-12-48-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/6e5895214de6/1471-2156-12-48-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/0a3c12f2bf9f/1471-2156-12-48-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/f72fc04d4d85/1471-2156-12-48-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/8e67d1dc4f5d/1471-2156-12-48-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/9fa07838d448/1471-2156-12-48-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/11f0085b777a/1471-2156-12-48-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ea/3118131/6e5895214de6/1471-2156-12-48-6.jpg

相似文献

1
Combining an evolution-guided clustering algorithm and haplotype-based LRT in family association studies.在家族关联研究中结合基于进化指导的聚类算法和基于单体型的似然比检验。
BMC Genet. 2011 May 19;12:48. doi: 10.1186/1471-2156-12-48.
2
Using an uncertainty-coding matrix in Bayesian regression models for haplotype-specific risk detection in family association studies.在家族关联研究中,使用贝叶斯回归模型中的不确定性编码矩阵进行单体型特异性风险检测。
PLoS One. 2011;6(7):e21890. doi: 10.1371/journal.pone.0021890. Epub 2011 Jul 15.
3
Incorporating single-locus tests into haplotype cladistic analysis in case-control studies.在病例对照研究中,将单基因座检验纳入单倍型分支分析。
PLoS Genet. 2007 Mar 23;3(3):e46. doi: 10.1371/journal.pgen.0030046.
4
Powerful haplotype-based Hardy-Weinberg equilibrium tests for tightly linked loci.基于强关联单倍型的哈迪-温伯格平衡紧密连锁位点检验。
PLoS One. 2013 Oct 22;8(10):e77399. doi: 10.1371/journal.pone.0077399. eCollection 2013.
5
Genome-wide association studies using haplotype clustering with a new haplotype similarity.基于新型单倍型相似性的单倍型聚类的全基因组关联研究
Genet Epidemiol. 2010 Sep;34(6):633-41. doi: 10.1002/gepi.20521.
6
Maximum-likelihood estimation of haplotype frequencies in nuclear families.核心家庭中单体型频率的最大似然估计。
Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.
7
Haplotype reconstruction for genetically complex regions with ambiguous genotype calls: Illustration by the KIR gene region.单倍型重建用于基因型模糊的遗传复杂区域:以 KIR 基因区域为例。
Genet Epidemiol. 2024 Feb;48(1):3-26. doi: 10.1002/gepi.22538. Epub 2023 Oct 13.
8
Transmission/disequilibrium test meets measured haplotype analysis: family-based association analysis guided by evolution of haplotypes.传递/不平衡检验与实测单倍型分析相结合:基于单倍型进化的家系关联分析
Am J Hum Genet. 2001 May;68(5):1250-63. doi: 10.1086/320110. Epub 2001 Apr 10.
9
Family-based tests for associating haplotypes with general phenotype data: Improving the FBAT-haplotype algorithm.用于将单倍型与一般表型数据相关联的基于家系的检验:改进FBAT单倍型算法
Genet Epidemiol. 2018 Feb;42(1):123-126. doi: 10.1002/gepi.22094. Epub 2017 Nov 21.
10
A novel approach for haplotype-based association analysis using family data.基于单体型的家系数据关联分析的新方法。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S45. doi: 10.1186/1471-2105-11-S1-S45.

引用本文的文献

1
Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies.在疾病关联研究中,使用汉明距离作为单核苷酸多态性(SNP)集聚类和检验的信息。
PLoS One. 2015 Aug 24;10(8):e0135918. doi: 10.1371/journal.pone.0135918. eCollection 2015.
2
Using an uncertainty-coding matrix in Bayesian regression models for haplotype-specific risk detection in family association studies.在家族关联研究中,使用贝叶斯回归模型中的不确定性编码矩阵进行单体型特异性风险检测。
PLoS One. 2011;6(7):e21890. doi: 10.1371/journal.pone.0021890. Epub 2011 Jul 15.

本文引用的文献

1
A novel approach for haplotype-based association analysis using family data.基于单体型的家系数据关联分析的新方法。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S45. doi: 10.1186/1471-2105-11-S1-S45.
2
Most parsimonious haplotype allele sharing determination.最简约单倍型等位基因共享判定
BMC Bioinformatics. 2009 Apr 21;10:115. doi: 10.1186/1471-2105-10-115.
3
Genetic association analysis with FAMHAP: a major program update.使用FAMHAP进行基因关联分析:主要程序更新
Bioinformatics. 2009 Jan 1;25(1):134-6. doi: 10.1093/bioinformatics/btn581. Epub 2008 Nov 9.
4
Likelihood-based association analysis for nuclear families and unrelated subjects with missing genotype data.针对有缺失基因型数据的核心家庭和无关个体的基于似然性的关联分析。
Hum Hered. 2008;66(2):87-98. doi: 10.1159/000119108. Epub 2008 Mar 31.
5
A new method for haplotype inference including full-sib information.一种包含全同胞信息的单倍型推断新方法。
Genetics. 2007 Nov;177(3):1929-40. doi: 10.1534/genetics.107.079525. Epub 2007 Oct 18.
6
Detecting haplotype effects in genomewide association studies.在全基因组关联研究中检测单倍型效应
Genet Epidemiol. 2007 Dec;31(8):803-12. doi: 10.1002/gepi.20242.
7
GENOME: a rapid coalescent-based whole genome simulator.基因组:一种基于快速合并的全基因组模拟器。
Bioinformatics. 2007 Jun 15;23(12):1565-7. doi: 10.1093/bioinformatics/btm138. Epub 2007 Apr 25.
8
Haplotype-based quantitative trait mapping using a clustering algorithm.使用聚类算法的基于单倍型的数量性状定位
BMC Bioinformatics. 2006 May 18;7:258. doi: 10.1186/1471-2105-7-258.
9
Haplotype association analysis for late onset diseases using nuclear family data.利用核心家系数据对迟发性疾病进行单倍型关联分析。
Genet Epidemiol. 2006 Apr;30(3):220-30. doi: 10.1002/gepi.20139.
10
Regression-based association analysis with clustered haplotypes through use of genotypes.通过使用基因型对聚类单倍型进行基于回归的关联分析。
Am J Hum Genet. 2006 Feb;78(2):231-42. doi: 10.1086/500025. Epub 2005 Dec 19.