• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于病例对照研究的多标志物关联检验。

On multi-marker tests for association in case-control studies.

机构信息

Department of Biostatistics, Johns Hopkins University Baltimore, MD, USA.

Mathematical Institute, Heinrich Heine University Düsseldorf Düsseldorf, Germany.

出版信息

Front Genet. 2013 Dec 16;4:252. doi: 10.3389/fgene.2013.00252. eCollection 2013.

DOI:10.3389/fgene.2013.00252
PMID:24379823
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3863805/
Abstract

Genome-wide association studies (GWAs) have identified thousands of DNA loci associated with a variety of traits. Statistical inference is almost always based on single marker hypothesis tests of association and the respective p-values with Bonferroni correction. Since commercially available genomic arrays interrogate hundreds of thousands or even millions of loci simultaneously, many causal yet undetected loci are believed to exist because the conditional power to achieve a genome-wide significance level can be low, in particular for markers with small effect sizes and low minor allele frequencies and in studies with modest sample size. However, the correlation between neighboring markers in the human genome due to linkage disequilibrium (LD) resulting in correlated marker test statistics can be incorporated into multi-marker hypothesis tests, thereby increasing power to detect association. Herein, we establish a theoretical benchmark by quantifying the maximum power achievable for multi-marker tests of association in case-control studies, achievable only when the causal marker is known. Using that genotype correlations within an LD block translate into an asymptotically multivariate normal distribution for score test statistics, we develop a set of weights for the markers that maximize the non-centrality parameter, and assess the relative loss of power for other approaches. We find that the method of Conneely and Boehnke (2007) based on the maximum absolute test statistic observed in an LD block is a practical and powerful method in a variety of settings. We also explore the effect on the power that prior biological or functional knowledge used to narrow down the locus of the causal marker can have, and conclude that this prior knowledge has to be very strong and specific for the power to approach the maximum achievable level, or even beat the power observed for methods such as the one proposed by Conneely and Boehnke (2007).

摘要

全基因组关联研究(GWAS)已经确定了数千个与各种特征相关的 DNA 位点。统计推断几乎总是基于关联的单一标记假设检验,以及相应的经过 Bonferroni 校正的 p 值。由于商业上可用的基因组芯片同时检测数十万甚至数百万个位点,因此许多因果但未检测到的位点被认为存在,因为达到全基因组显著性水平的条件功效可能较低,特别是对于效应大小较小、次要等位基因频率较低的标记,以及在样本量适中的研究中。然而,由于连锁不平衡(LD)导致的人类基因组中相邻标记之间的相关性会导致相关标记检验统计量,可以将其纳入多标记假设检验中,从而提高检测关联的功效。在此,我们通过量化仅当因果标记已知时,病例对照研究中关联的多标记检验可以实现的最大功效,建立了一个理论基准。利用 LD 块内的基因型相关性转化为得分检验统计量的渐近多元正态分布,我们为标记开发了一组权重,使非中心参数最大化,并评估了其他方法的相对功效损失。我们发现,Conneely 和 Boehnke(2007)基于在 LD 块中观察到的最大绝对检验统计量的方法在各种情况下都是一种实用且强大的方法。我们还探讨了用于缩小因果标记位置的先验生物学或功能知识对功效的影响,并得出结论,只有当这种先验知识非常强大且具体时,功效才能接近可达到的最大水平,甚至超过 Conneely 和 Boehnke(2007)提出的方法观察到的功效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/dd2af69afba0/fgene-04-00252-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/dd81be17ab0f/fgene-04-00252-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/ab88e8849cc3/fgene-04-00252-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/5395464f6d55/fgene-04-00252-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/eb2dbe1e8470/fgene-04-00252-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/dd2af69afba0/fgene-04-00252-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/dd81be17ab0f/fgene-04-00252-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/ab88e8849cc3/fgene-04-00252-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/5395464f6d55/fgene-04-00252-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/eb2dbe1e8470/fgene-04-00252-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a8a/3863805/dd2af69afba0/fgene-04-00252-g0005.jpg

相似文献

1
On multi-marker tests for association in case-control studies.基于病例对照研究的多标志物关联检验。
Front Genet. 2013 Dec 16;4:252. doi: 10.3389/fgene.2013.00252. eCollection 2013.
2
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
3
Models and tests of linkage and association studies of quantitative trait locus for multi-allele marker Loci.多等位基因标记位点数量性状基因座的连锁与关联研究模型及检验
Hum Hered. 2002;53(3):130-45. doi: 10.1159/000064975.
4
The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant.使用单核苷酸多态性标记进行全基因组连锁不平衡检测以发现低频疾病变异的预期效能。
Ann Hum Genet. 2002 Jul;66(Pt 4):297-306. doi: 10.1017/S0003480002001197.
5
Multi-marker linkage disequilibrium mapping of quantitative trait loci.数量性状基因座的多标记连锁不平衡定位
Brief Bioinform. 2017 Mar 1;18(2):195-204. doi: 10.1093/bib/bbw006.
6
Powerful multi-marker association tests: unifying genomic distance-based regression and logistic regression.强大的多标记关联测试:基于基因组距离的回归和逻辑回归的统一。
Genet Epidemiol. 2010 Nov;34(7):680-8. doi: 10.1002/gepi.20529.
7
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
8
Genome-wide association analyses identify known and novel loci for teat number in Duroc pigs using single-locus and multi-locus models.全基因组关联分析使用单基因座和多基因座模型鉴定杜洛克猪乳头数的已知和新基因座。
BMC Genomics. 2020 May 7;21(1):344. doi: 10.1186/s12864-020-6742-6.
9
Genome association studies of complex diseases by case-control designs.通过病例对照设计对复杂疾病进行全基因组关联研究。
Am J Hum Genet. 2003 Apr;72(4):850-68. doi: 10.1086/373966. Epub 2003 Mar 19.
10
Properties of permutation-based gene tests and controlling type 1 error using a summary statistic based gene test.基于排列的基因检验的性质和使用基于汇总统计量的基因检验控制Ⅰ类错误。
BMC Genet. 2013 Nov 7;14:108. doi: 10.1186/1471-2156-14-108.

引用本文的文献

1
Exploring GWAS and genomic prediction to improve Septoria tritici blotch resistance in wheat.探讨 GWAS 和基因组预测在提高小麦条锈病抗性中的作用。
Sci Rep. 2023 Sep 20;13(1):15651. doi: 10.1038/s41598-023-42856-x.
2
Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.针对适应连锁不平衡结构的常见变异的多重线性组合(MLC)回归检验。
Genet Epidemiol. 2017 Feb;41(2):108-121. doi: 10.1002/gepi.22024. Epub 2016 Nov 25.
3
Assessing the effects of multiple markers in genetic association studies.

本文引用的文献

1
Multilocus association testing with penalized regression.基于惩罚回归的多位点关联分析。
Genet Epidemiol. 2011 Dec;35(8):755-65. doi: 10.1002/gepi.20625. Epub 2011 Sep 15.
2
Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing.基于基因组距离的回归与核机器回归在多标记关联测试中的关系。
Genet Epidemiol. 2011 May;35(4):211-6. doi: 10.1002/gepi.20567.
3
Powerful SNP-set analysis for case-control genome-wide association studies.基于全基因组关联研究的病例对照 SNP 集分析。
评估基因关联研究中多个标记物的作用。
Front Genet. 2015 Feb 24;6:66. doi: 10.3389/fgene.2015.00066. eCollection 2015.
Am J Hum Genet. 2010 Jun 11;86(6):929-42. doi: 10.1016/j.ajhg.2010.05.002.
4
Test selection with application to detecting disease association with multiple SNPs.应用于检测疾病与多个单核苷酸多态性(SNP)关联的测试选择。
Hum Hered. 2010;69(2):120-30. doi: 10.1159/000264449. Epub 2009 Dec 4.
5
Comparisons of multi-marker association methods to detect association between a candidate region and disease.比较多种标记物关联方法,以检测候选区域与疾病之间的关联。
Genet Epidemiol. 2010 Apr;34(3):201-12. doi: 10.1002/gepi.20448.
6
Association tests using kernel-based measures of multi-locus genotype similarity between individuals.基于核函数的个体间多基因座基因型相似性的关联测试。
Genet Epidemiol. 2010 Apr;34(3):213-21. doi: 10.1002/gepi.20451.
7
The SNP ratio test: pathway analysis of genome-wide association datasets.SNP 比值检验:全基因组关联数据集的通路分析。
Bioinformatics. 2009 Oct 15;25(20):2762-3. doi: 10.1093/bioinformatics/btp448. Epub 2009 Jul 20.
8
GLOSSI: a method to assess the association of genetic loci-sets with complex diseases.GLOSSI:一种评估基因座集与复杂疾病关联的方法。
BMC Bioinformatics. 2009 Apr 3;10:102. doi: 10.1186/1471-2105-10-102.
9
A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.针对三联体和无关个体的大型数据集进行基因型填充和单倍型相位推断的统一方法。
Am J Hum Genet. 2009 Feb;84(2):210-23. doi: 10.1016/j.ajhg.2009.01.005. Epub 2009 Feb 5.
10
Genome-wide association analysis by lasso penalized logistic regression.基于套索惩罚逻辑回归的全基因组关联分析。
Bioinformatics. 2009 Mar 15;25(6):714-21. doi: 10.1093/bioinformatics/btp041. Epub 2009 Jan 28.