• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于模型的方法,用于捕捉遗传变异以进行未来的关联研究。

A model-based approach to capture genetic variation for future association studies.

作者信息

Eyheramendy Susana, Marchini Jonathan, McVean Gilean, Myers Simon, Donnelly Peter

机构信息

Department of Statistics, University of Oxford, Oxford, OX1 3TG, United Kingdom.

出版信息

Genome Res. 2007 Jan;17(1):88-95. doi: 10.1101/gr.5675406. Epub 2006 Nov 9.

DOI:10.1101/gr.5675406
PMID:17095708
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1716272/
Abstract

Genome-wide association studies are still constrained by the cost of genotyping. For this reason, the selection of a reduced set of markers or tags able to capture a significant proportion of the genetic variation is an important aspect of these studies. Most tagging SNP selection methods have been successful in capturing the genetic variation of the data from which the tags have been chosen. However, when these tags are used in an independent data set, a significant proportion of the remaining SNPs (non-tags) are not captured and, in most cases, there is no information on which SNPs are captured. We propose to use a probabilistic model to predict the non-tags based on a set of tags, as a way to capture genetic variation. An important advantage of this method is that it directly predicts the genotype of the non-tags with which we can test for association with the phenotype and which could help to elucidate the location of genes responsible for increasing disease susceptibility. Additionally, this method provides an estimate of the probabilities with which the predictions are made, which reflects the confidence of the probabilistic model. We also propose new methods to select the tagging SNPs. We empirically show by using HapMap data that our approach is able to capture significantly more genetic variation than methods based solely on a pairwise LD measure.

摘要

全基因组关联研究仍然受到基因分型成本的限制。因此,选择一组能够捕获相当比例遗传变异的简化标记或标签是这些研究的一个重要方面。大多数标签单核苷酸多态性(SNP)选择方法在捕获用于选择标签的数据的遗传变异方面都很成功。然而,当这些标签用于独立数据集时,相当比例的其余SNP(非标签)未被捕获,并且在大多数情况下,没有关于哪些SNP被捕获的信息。我们建议使用概率模型基于一组标签来预测非标签,以此作为捕获遗传变异的一种方法。该方法的一个重要优点是它直接预测非标签的基因型,我们可以用其来测试与表型的关联,这有助于阐明导致疾病易感性增加的基因的位置。此外,该方法提供了预测所基于的概率估计,这反映了概率模型的可信度。我们还提出了选择标签SNP的新方法。通过使用HapMap数据,我们通过实证表明,我们的方法比仅基于成对连锁不平衡(LD)测量的方法能够捕获显著更多的遗传变异。

相似文献

1
A model-based approach to capture genetic variation for future association studies.一种基于模型的方法,用于捕捉遗传变异以进行未来的关联研究。
Genome Res. 2007 Jan;17(1):88-95. doi: 10.1101/gr.5675406. Epub 2006 Nov 9.
2
Transferability of tag SNPs to capture common genetic variation in DNA repair genes across multiple populations.标签单核苷酸多态性在多个群体中捕获DNA修复基因常见遗传变异的可转移性。
Pac Symp Biocomput. 2006:478-86.
3
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
4
Efficiency and power in genetic association studies.基因关联研究中的效率与效能
Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.
5
SNPs, haplotypes, and model selection in a candidate gene region: the SIMPle analysis for multilocus data.候选基因区域中的单核苷酸多态性、单倍型及模型选择:多位点数据的简单分析
Genet Epidemiol. 2004 Dec;27(4):429-41. doi: 10.1002/gepi.20039.
6
A comparison of tagging methods and their tagging space.标签方法及其标签空间的比较。
Hum Mol Genet. 2005 Sep 15;14(18):2757-67. doi: 10.1093/hmg/ddi309. Epub 2005 Aug 15.
7
A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.一种基于观察到的连锁不平衡模式选择单核苷酸多态性(SNP)用于关联研究的工具。
Pac Symp Biocomput. 2006:487-98.
8
Genome-wide selection of tag SNPs using multiple-marker correlation.使用多标记相关性进行全基因组标签单核苷酸多态性选择。
Bioinformatics. 2007 Dec 1;23(23):3178-84. doi: 10.1093/bioinformatics/btm496. Epub 2007 Nov 15.
9
Optimal selection of SNP markers for disease association studies.疾病关联研究中SNP标记的最佳选择。
Hum Hered. 2004;58(3-4):190-202. doi: 10.1159/000083546.
10
Singleton SNPs in the human genome and implications for genome-wide association studies.人类基因组中的单核苷酸多态性及其对全基因组关联研究的意义。
Eur J Hum Genet. 2008 Apr;16(4):506-15. doi: 10.1038/sj.ejhg.5201987. Epub 2008 Jan 16.

引用本文的文献

1
A general quantitative genetic model for haplotyping a complex trait in humans.一种用于人类复杂性状单体型分析的通用数量遗传模型。
Curr Genomics. 2007 Aug;8(5):343-50. doi: 10.2174/138920207782446179.
2
A statistical method for predicting classical HLA alleles from SNP data.一种从单核苷酸多态性(SNP)数据预测经典人类白细胞抗原(HLA)等位基因的统计方法。
Am J Hum Genet. 2008 Jan;82(1):48-56. doi: 10.1016/j.ajhg.2007.09.001.
3
Haplotyping a quantitative trait with a high-density map in experimental crosses.利用高密度图谱对实验杂交中的数量性状进行单体型分析。
PLoS One. 2007 Aug 15;2(8):e732. doi: 10.1371/journal.pone.0000732.
4
Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples.宿务菲律宾人与亚洲HapMap样本中ENCODE区域单核苷酸多态性的比较。
J Hum Genet. 2007;52(9):729-737. doi: 10.1007/s10038-007-0175-9. Epub 2007 Jul 18.

本文引用的文献

1
Common deletion polymorphisms in the human genome.人类基因组中的常见缺失多态性。
Nat Genet. 2006 Jan;38(1):86-92. doi: 10.1038/ng1696.
2
Efficiency and power in genetic association studies.基因关联研究中的效率与效能
Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.
3
Genome-wide association studies: theoretical and practical concerns.全基因组关联研究:理论与实际问题
Nat Rev Genet. 2005 Feb;6(2):109-18. doi: 10.1038/nrg1522.
4
Selecting tagging SNPs for association studies using power calculations from genotype data.利用基因型数据的功效计算为关联研究选择标签单核苷酸多态性。
Hum Hered. 2004;57(3):156-70. doi: 10.1159/000079246.
5
Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。
Genome Res. 2004 May;14(5):908-16. doi: 10.1101/gr.1837404. Epub 2004 Apr 12.
6
The complex interplay among factors that influence allelic association.影响等位基因关联的因素之间复杂的相互作用。
Nat Rev Genet. 2004 Feb;5(2):89-100. doi: 10.1038/nrg1270.
7
Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.利用单核苷酸多态性数据对连锁不平衡进行建模并识别重组热点。
Genetics. 2003 Dec;165(4):2213-33. doi: 10.1093/genetics/165.4.2213.
8
The International HapMap Project.国际人类基因组单体型图计划
Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.
9
Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.利用连锁不平衡选择用于关联分析的信息量最大的单核苷酸多态性集合。
Am J Hum Genet. 2004 Jan;74(1):106-20. doi: 10.1086/381000. Epub 2003 Dec 15.
10
Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power.利用单倍型标签检测由连锁不平衡引起的疾病关联:一类检验方法及统计效能的决定因素。
Hum Hered. 2003;56(1-3):18-31. doi: 10.1159/000073729.