• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单倍型关联分析和逻辑回归识别相关易感基因座能力的研究。

Investigation of the ability of haplotype association and logistic regression to identify associated susceptibility loci.

作者信息

North B V, Sham P C, Knight J, Martin E R, Curtis D

机构信息

Institute of Cancer Research, 15 Cotswold Road, Belmont, Sutton, Surrey SM2 5NG, UK.

出版信息

Ann Hum Genet. 2006 Nov;70(Pt 6):893-906. doi: 10.1111/j.1469-1809.2006.00301.x.

DOI:10.1111/j.1469-1809.2006.00301.x
PMID:17044864
Abstract

While finely spaced markers are increasingly being used in case-control association studies in attempts to identify susceptibility loci, not enough is yet known as to the optimal spacing of such markers, their likely power to detect association, the relative merits of single marker versus multimarker analysis, or which methods of analysis may be optimal. Some investigations of these issues have used markers simulated under different theoretical models of population evolution. However the HapMap project and other sources provide real datasets which can be used to obtain a more realistic view of the performance of these approaches. SNPs around APOE and from two HapMap regions were used to obtain information regarding linkage disequilibrium (LD) relationships between polymorphisms, and these real patterns of LD were used to simulate datasets such as would be obtained in case-control studies were these SNPs to influence susceptibility to disease. The datasets obtained were analysed using tests for heterogeneity of estimated haplotype frequencies and using logistic regression analyses in which only main effects from each marker were considered. All markers surrounding the putative susceptibility locus were analysed, using sets of either 1, 2, 3 or 4 markers at a time. Some markers within 150 kb of the susceptibility locus were able to detect association. At distances less than 100 kb there was no correlation between the distance from the susceptibility locus and the strength of evidence for association. When the average inter-locus spacing is 25 kb many loci would not be detected, while when the spacing is as low as 2 kb one can be fairly confident that at least one marker will be in strong enough LD with the susceptibility locus to enable association to be detected, if the susceptibility locus has a strong enough effect relative to the sample size. With an inter-locus spacing of 4 kb some susceptibility loci did not have a marker locus in strong LD, potentially undermining the ability to detect association. There was little difference in the performance of haplotype-based analysis compared with logistic regression considering effects of each marker as separate. Multimarker analysis on occasion produced results which were much more highly significant than single marker analysis, but only very rarely. Our results support the view that if markers are randomly selected then a spacing as low as 2 kb is desirable. Multimarker analysis can sometimes be more powerful than single marker analysis so both should be performed. However, because it is rare for multimarker analysis to be much more highly significant than single marker analysis one should strongly suspect that when such results occur they may be due to mistakes in genotyping or through some other artefact. Haplotype analysis may be more prone to such problems than logistic regression, suggesting that the latter method might be preferred.

摘要

尽管在病例对照关联研究中,人们越来越多地使用间距精细的标记物来试图识别易感基因座,但对于此类标记物的最佳间距、它们检测关联的可能效能、单标记物分析与多标记物分析的相对优点,或者哪种分析方法可能是最优的,目前所知仍不够充分。对这些问题的一些研究使用了在不同种群进化理论模型下模拟的标记物。然而,国际人类基因组单体型图计划(HapMap计划)及其他来源提供了真实数据集,可用于更现实地了解这些方法的性能。使用载脂蛋白E(APOE)周围以及来自两个HapMap区域的单核苷酸多态性(SNP)来获取有关多态性之间连锁不平衡(LD)关系的信息,并利用这些真实的LD模式来模拟数据集,就像在病例对照研究中如果这些SNP影响疾病易感性将会获得的数据集一样。对获得的数据集进行分析时,使用了估计单倍型频率的异质性检验,并使用了逻辑回归分析,其中仅考虑每个标记物的主效应。每次使用1、2、3或4个标记物的组合,对假定易感基因座周围的所有标记物进行分析。易感基因座150 kb范围内的一些标记物能够检测到关联。在距离小于100 kb时,与易感基因座的距离和关联证据强度之间没有相关性。当基因座间平均间距为25 kb时,许多基因座将无法被检测到,而当间距低至2 kb时,如果易感基因座相对于样本量有足够强的效应,人们可以相当有信心地认为至少有一个标记物与易感基因座的LD足够强,从而能够检测到关联。当基因座间间距为4 kb时,一些易感基因座没有处于强LD状态的标记基因座,这可能会削弱检测关联的能力。与将每个标记物的效应分别考虑的逻辑回归相比,基于单倍型的分析性能差异不大。多标记物分析偶尔会产生比单标记物分析显著得多的结果,但这种情况非常罕见。我们的结果支持这样一种观点,即如果随机选择标记物,那么低至2 kb的间距是可取的。多标记物分析有时可能比单标记物分析更有效力,因此两者都应进行。然而,由于多标记物分析比单标记物分析显著得多的情况很少见,所以当出现这样的结果时,人们应该强烈怀疑它们可能是由于基因分型错误或其他人为因素造成的。单倍型分析可能比逻辑回归更容易出现此类问题,这表明后一种方法可能更可取。

相似文献

1
Investigation of the ability of haplotype association and logistic regression to identify associated susceptibility loci.单倍型关联分析和逻辑回归识别相关易感基因座能力的研究。
Ann Hum Genet. 2006 Nov;70(Pt 6):893-906. doi: 10.1111/j.1469-1809.2006.00301.x.
2
Further investigation of linkage disequilibrium SNPs and their ability to identify associated susceptibility loci.对连锁不平衡单核苷酸多态性及其识别相关易感基因座能力的进一步研究。
Ann Hum Genet. 2004 May;68(Pt 3):240-8. doi: 10.1046/j.1529-8817.2004.00086.x.
3
Investigation into the ability of SNP chipsets and microsatellites to detect association with a disease locus.关于单核苷酸多态性(SNP)芯片组和微卫星检测与疾病位点关联能力的研究。
Ann Hum Genet. 2008 Jul;72(Pt 4):547-56. doi: 10.1111/j.1469-1809.2008.00434.x. Epub 2008 Mar 18.
4
Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test.基于高度特异的连锁不平衡传递/不平衡测试的全基因组关联过滤。
Hum Genet. 2010 Sep;128(3):325-44. doi: 10.1007/s00439-010-0854-z. Epub 2010 Jul 6.
5
Comparison of multimarker logistic regression models, with application to a genomewide scan of schizophrenia.多标志物逻辑回归模型的比较及其在精神分裂症全基因组扫描中的应用。
BMC Genet. 2010 Sep 9;11:80. doi: 10.1186/1471-2156-11-80.
6
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
7
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
8
Haplotype and linkage disequilibrium architecture for human cancer-associated genes.人类癌症相关基因的单倍型和连锁不平衡结构
Genome Res. 2002 Dec;12(12):1846-53. doi: 10.1101/gr.483802.
9
Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.利用高密度共识图谱对优质硬粒小麦育种系中粗粒小麦粉和意大利面颜色进行单标记和单倍型关联分析
PLoS One. 2017 Jan 30;12(1):e0170941. doi: 10.1371/journal.pone.0170941. eCollection 2017.
10
Linkage disequilibrium and haplotype block patterns in popcorn populations.连锁不平衡和爆米花群体中的单倍型块模式。
PLoS One. 2019 Sep 25;14(9):e0219417. doi: 10.1371/journal.pone.0219417. eCollection 2019.

引用本文的文献

1
Genomic prediction of carcass traits using different haplotype block partitioning methods in beef cattle.利用不同单倍型块划分方法对肉牛胴体性状进行基因组预测。
Evol Appl. 2022 Nov 14;15(12):2028-2042. doi: 10.1111/eva.13491. eCollection 2022 Dec.
2
The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation.全基因组关联研究中基因-基因相互作用的探索:方法众多带来的挑战、实际考量及生物学解释
Ann Transl Med. 2018 Apr;6(8):157. doi: 10.21037/atm.2018.04.05.
3
Genomic prediction of genetic merit using LD-based haplotypes in the Nordic Holstein population.
在北欧荷斯坦牛群体中使用基于连锁不平衡的单倍型对遗传价值进行基因组预测。
BMC Genomics. 2014 Dec 23;15(1):1171. doi: 10.1186/1471-2164-15-1171.
4
Detecting the footprints of divergent selection in oaks with linked markers.利用连锁标记检测栎属植物中的分歧选择痕迹。
Heredity (Edinb). 2012 Dec;109(6):361-71. doi: 10.1038/hdy.2012.51. Epub 2012 Sep 19.
5
A simple method for assessing the strength of evidence for association at the level of the whole gene.一种评估全基因水平关联证据强度的简单方法。
Adv Appl Bioinform Chem. 2008;1:115-20. doi: 10.2147/aabc.s4095. Epub 2008 Nov 17.
6
Power comparisons between similarity-based multilocus association methods, logistic regression, and score tests for haplotypes.基于相似性的多位点关联方法、逻辑回归和单倍型得分检验之间的功效比较。
Genet Epidemiol. 2009 Apr;33(3):183-97. doi: 10.1002/gepi.20364.
7
Investigation into the ability of SNP chipsets and microsatellites to detect association with a disease locus.关于单核苷酸多态性(SNP)芯片组和微卫星检测与疾病位点关联能力的研究。
Ann Hum Genet. 2008 Jul;72(Pt 4):547-56. doi: 10.1111/j.1469-1809.2008.00434.x. Epub 2008 Mar 18.
8
Comparison of artificial neural network analysis with other multimarker methods for detecting genetic association.人工神经网络分析与其他多标记物方法在检测基因关联方面的比较。
BMC Genet. 2007 Jul 18;8:49. doi: 10.1186/1471-2156-8-49.