• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强大的多标记关联测试:基于基因组距离的回归和逻辑回归的统一。

Powerful multi-marker association tests: unifying genomic distance-based regression and logistic regression.

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota 55455–0392, USA.

出版信息

Genet Epidemiol. 2010 Nov;34(7):680-8. doi: 10.1002/gepi.20529.

DOI:10.1002/gepi.20529
PMID:20976795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3345567/
Abstract

To detect genetic association with common and complex diseases, many statistical tests have been proposed for candidate gene or genome-wide association studies with the case-control design. Due to linkage disequilibrium (LD), multi-marker association tests can gain power over single-marker tests with a Bonferroni multiple testing adjustment. Among many existing multi-marker association tests, most target to detect only one of many possible aspects in distributional differences between the genotypes of cases and controls, such as allele frequency differences, while a few new ones aim to target two or three aspects, all of which can be implemented in logistic regression. In contrast to logistic regression, a genomic distance-based regression (GDBR) approach aims to detect some high-order genotypic differences between cases and controls. A recent study has confirmed the high power of GDBR tests. At this moment, the popular logistic regression and the emerging GDBR approaches are completely unrelated; for example, one has to choose between the two. In this article, we reformulate GDBR as logistic regression, opening a venue to constructing other powerful tests while overcoming some limitations of GDBR. For example, asymptotic distributions can replace time-consuming permutations for deriving P-values and covariates, including gene-gene interactions, can be easily incorporated. Importantly, this reformulation facilitates combining GDBR with other existing methods in a unified framework of logistic regression. In particular, we show that Fisher's P-value combining method can boost statistical power by incorporating information from allele frequencies, Hardy-Weinberg disequilibrium, LD patterns, and other higher-order interactions among multi-markers as captured by GDBR.

摘要

为了检测常见和复杂疾病的遗传关联,已经提出了许多统计测试方法,用于基于病例对照设计的候选基因或全基因组关联研究。由于连锁不平衡(LD),多标记关联测试可以通过 Bonferroni 多重测试调整获得比单标记测试更强的功效。在许多现有的多标记关联测试中,大多数旨在检测病例和对照组基因型分布差异的许多可能方面之一,例如等位基因频率差异,而少数新方法旨在针对两个或三个方面,所有这些都可以在逻辑回归中实现。与逻辑回归相反,基于基因组距离的回归(GDBR)方法旨在检测病例和对照组之间某些高阶基因型差异。最近的一项研究证实了 GDBR 测试的高功效。目前,流行的逻辑回归和新兴的 GDBR 方法完全没有关系;例如,人们必须在两者之间做出选择。在本文中,我们将 GDBR 重新表述为逻辑回归,为构建其他强大的测试开辟了途径,同时克服了 GDBR 的一些限制。例如,可以用耗时的置换来代替渐近分布来推导出 P 值,并且可以轻松地包含协变量,包括基因-基因相互作用。重要的是,这种重新表述便于将 GDBR 与逻辑回归的统一框架中的其他现有方法相结合。特别是,我们表明,Fisher 的 P 值组合方法可以通过整合 GDBR 捕获的等位基因频率、Hardy-Weinberg 不平衡、LD 模式和其他多标记之间的高阶相互作用等信息来提高统计功效。

相似文献

1
Powerful multi-marker association tests: unifying genomic distance-based regression and logistic regression.强大的多标记关联测试:基于基因组距离的回归和逻辑回归的统一。
Genet Epidemiol. 2010 Nov;34(7):680-8. doi: 10.1002/gepi.20529.
2
Single-marker and two-marker association tests for unphased case-control genotype data, with a power comparison.单标记和双标记关联检验用于非相位病例对照基因型数据,并进行了效能比较。
Genet Epidemiol. 2010 Jan;34(1):67-77. doi: 10.1002/gepi.20436.
3
Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing.基于基因组距离的回归与核机器回归在多标记关联测试中的关系。
Genet Epidemiol. 2011 May;35(4):211-6. doi: 10.1002/gepi.20567.
4
A unified framework for detecting genetic association with multiple SNPs in a candidate gene or region: contrasting genotype scores and LD patterns between cases and controls.一种用于检测候选基因或区域中多个单核苷酸多态性(SNP)与遗传关联的统一框架:对比病例组和对照组之间的基因型得分和连锁不平衡模式。
Hum Hered. 2010;69(1):1-13. doi: 10.1159/000243149. Epub 2009 Oct 2.
5
Comparison of multimarker logistic regression models, with application to a genomewide scan of schizophrenia.多标志物逻辑回归模型的比较及其在精神分裂症全基因组扫描中的应用。
BMC Genet. 2010 Sep 9;11:80. doi: 10.1186/1471-2156-11-80.
6
The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant.使用单核苷酸多态性标记进行全基因组连锁不平衡检测以发现低频疾病变异的预期效能。
Ann Hum Genet. 2002 Jul;66(Pt 4):297-306. doi: 10.1017/S0003480002001197.
7
A new association test to test multiple-marker association.一种用于测试多标记关联的新关联测试。
Genet Epidemiol. 2009 Feb;33(2):164-71. doi: 10.1002/gepi.20369.
8
Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between markers and QTL.评估多等位基因标记之间的连锁不平衡度量,作为标记与数量性状基因座之间连锁不平衡的预测指标。
Genet Res. 2005 Aug;86(1):77-87. doi: 10.1017/S001667230500769X.
9
A powerful score test to detect positive selection in genome-wide scans.一种强大的评分检验方法,用于检测全基因组扫描中的正选择。
Eur J Hum Genet. 2010 Oct;18(10):1148-59. doi: 10.1038/ejhg.2010.60. Epub 2010 May 12.
10
Semiparametric Allelic Tests for Mapping Multiple Phenotypes: Binomial Regression and Mahalanobis Distance.用于定位多种表型的半参数等位基因检验:二项式回归和马氏距离
Genet Epidemiol. 2015 Dec;39(8):635-50. doi: 10.1002/gepi.21930. Epub 2015 Oct 23.

引用本文的文献

1
Selecting Genetic Variants and Interactions Associated with Amyotrophic Lateral Sclerosis: A Group LASSO Approach.选择与肌萎缩侧索硬化相关的基因变异和相互作用:一种分组套索方法。
J Pers Med. 2022 Aug 19;12(8):1330. doi: 10.3390/jpm12081330.
2
Detection of epigenetic field defects using a weighted epigenetic distance-based method.利用加权基于表观遗传距离的方法检测表观遗传场缺陷。
Nucleic Acids Res. 2019 Jan 10;47(1):e6. doi: 10.1093/nar/gky882.
3
Rare variants analysis using penalization methods for whole genome sequence data.使用惩罚方法对全基因组序列数据进行罕见变异分析。

本文引用的文献

1
Statistical tests of genetic association in the presence of gene-gene and gene-environment interactions.在存在基因-基因和基因-环境相互作用的情况下进行基因关联的统计检验。
Hum Hered. 2010;69(2):131-42. doi: 10.1159/000264450. Epub 2009 Dec 4.
2
Test selection with application to detecting disease association with multiple SNPs.应用于检测疾病与多个单核苷酸多态性(SNP)关联的测试选择。
Hum Hered. 2010;69(2):120-30. doi: 10.1159/000264449. Epub 2009 Dec 4.
3
Comparisons of multi-marker association methods to detect association between a candidate region and disease.
BMC Bioinformatics. 2015 Dec 4;16:405. doi: 10.1186/s12859-015-0825-4.
4
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.一种在核心家庭中纳入最优P值阈值算法的复杂疾病多单核苷酸多态性关联测试。
BMC Genomics. 2015 May 15;16(1):381. doi: 10.1186/s12864-015-1620-3.
5
Family-based association analysis: a fast and efficient method of multivariate association analysis with multiple variants.基于家系的关联分析:一种针对多个变异体进行多变量关联分析的快速有效方法。
BMC Bioinformatics. 2015 Feb 15;16:46. doi: 10.1186/s12859-015-0484-5.
6
GEE-based SNP set association test for continuous and discrete traits in family-based association studies.基于广义估计方程的单核苷酸多态性集合关联检验,用于家族关联研究中的连续和离散性状。
Genet Epidemiol. 2013 Dec;37(8):778-86. doi: 10.1002/gepi.21763. Epub 2013 Oct 25.
7
A fast multilocus test with adaptive SNP selection for large-scale genetic-association studies.一种用于大规模基因关联研究的具有适应性单核苷酸多态性选择的快速多位点检测方法。
Eur J Hum Genet. 2014 May;22(5):696-702. doi: 10.1038/ejhg.2013.201. Epub 2013 Sep 11.
8
Analysis of rare, exonic variation amongst subjects with autism spectrum disorders and population controls.分析自闭症谱系障碍患者和人口对照个体中罕见的外显子变异。
PLoS Genet. 2013 Apr;9(4):e1003443. doi: 10.1371/journal.pgen.1003443. Epub 2013 Apr 11.
9
SNP set association analysis for familial data.家族数据的单核苷酸多态性集合关联分析。
Genet Epidemiol. 2012 Dec;36(8):797-810. doi: 10.1002/gepi.21676. Epub 2012 Sep 11.
10
Similarity-based multimarker association tests for continuous traits.基于相似性的连续性状多标记关联测试。
Ann Hum Genet. 2012 May;76(3):246-60. doi: 10.1111/j.1469-1809.2012.00706.x.
比较多种标记物关联方法,以检测候选区域与疾病之间的关联。
Genet Epidemiol. 2010 Apr;34(3):201-12. doi: 10.1002/gepi.20448.
4
A unified framework for detecting genetic association with multiple SNPs in a candidate gene or region: contrasting genotype scores and LD patterns between cases and controls.一种用于检测候选基因或区域中多个单核苷酸多态性(SNP)与遗传关联的统一框架:对比病例组和对照组之间的基因型得分和连锁不平衡模式。
Hum Hered. 2010;69(1):1-13. doi: 10.1159/000243149. Epub 2009 Oct 2.
5
Single-marker and two-marker association tests for unphased case-control genotype data, with a power comparison.单标记和双标记关联检验用于非相位病例对照基因型数据,并进行了效能比较。
Genet Epidemiol. 2010 Jan;34(1):67-77. doi: 10.1002/gepi.20436.
6
Discovering genetic ancestry using spectral graph theory.利用谱图理论探寻遗传渊源。
Genet Epidemiol. 2010 Jan;34(1):51-9. doi: 10.1002/gepi.20434.
7
Genetic architecture of quantitative traits in mice, flies, and humans.小鼠、果蝇和人类数量性状的遗传结构。
Genome Res. 2009 May;19(5):723-33. doi: 10.1101/gr.086660.108.
8
Phase uncertainty in case-control association studies.病例对照关联研究中的相位不确定性。
Genet Epidemiol. 2009 Sep;33(6):463-78. doi: 10.1002/gepi.20399.
9
Asymptotic tests of association with multiple SNPs in linkage disequilibrium.与处于连锁不平衡状态的多个单核苷酸多态性(SNP)相关联的渐近检验。
Genet Epidemiol. 2009 Sep;33(6):497-507. doi: 10.1002/gepi.20402.
10
Genetic background comparison using distance-based regression, with applications in population stratification evaluation and adjustment.使用基于距离的回归进行遗传背景比较及其在群体分层评估与调整中的应用。
Genet Epidemiol. 2009 Jul;33(5):432-41. doi: 10.1002/gepi.20396.