• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中比值比的偏差校正估计量和置信区间

Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies.

作者信息

Zhong Hua, Prentice Ross L

机构信息

Department of Biostatistics, University of Washington, Seattle, WA 98105, USA.

出版信息

Biostatistics. 2008 Oct;9(4):621-34. doi: 10.1093/biostatistics/kxn001. Epub 2008 Feb 28.

DOI:10.1093/biostatistics/kxn001
PMID:18310059
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2536726/
Abstract

Genome-wide association studies (GWAS) provide an important approach to identifying common genetic variants that predispose to human disease. A typical GWAS may genotype hundreds of thousands of single nucleotide polymorphisms (SNPs) located throughout the human genome in a set of cases and controls. Logistic regression is often used to test for association between a SNP genotype and case versus control status, with corresponding odds ratios (ORs) typically reported only for those SNPs meeting selection criteria. However, when these estimates are based on the original data used to detect the variant, the results are affected by a selection bias sometimes referred to the "winner's curse" (Capen and others, 1971). The actual genetic association is typically overestimated. We show that such selection bias may be severe in the sense that the conditional expectation of the standard OR estimator may be quite far away from the underlying parameter. Also standard confidence intervals (CIs) may have far from the desired coverage rate for the selected ORs. We propose and evaluate 3 bias-reduced estimators, and also corresponding weighted estimators that combine corrected and uncorrected estimators, to reduce selection bias. Their corresponding CIs are also proposed. We study the performance of these estimators using simulated data sets and show that they reduce the bias and give CI coverage close to the desired level under various scenarios, even for associations having only small statistical power.

摘要

全基因组关联研究(GWAS)为识别易患人类疾病的常见基因变异提供了一种重要方法。典型的GWAS可能会对一组病例和对照中位于整个人类基因组中的数十万个单核苷酸多态性(SNP)进行基因分型。逻辑回归通常用于检验SNP基因型与病例对照状态之间的关联性,通常仅报告那些符合选择标准的SNP的相应优势比(OR)。然而,当这些估计基于用于检测变异的原始数据时,结果会受到一种有时被称为“胜者之咒”的选择偏倚的影响(卡彭等人,1971年)。实际的基因关联性通常被高估。我们表明,这种选择偏倚可能很严重,因为标准OR估计量的条件期望可能与基础参数相差甚远。而且标准置信区间(CI)对于所选OR的覆盖率可能远低于期望水平。我们提出并评估了3种偏差减少估计量,以及将校正估计量和未校正估计量结合起来的相应加权估计量,以减少选择偏倚。还提出了它们相应的CI。我们使用模拟数据集研究了这些估计量的性能,结果表明它们在各种情况下都能减少偏差,并使CI覆盖率接近期望水平,即使对于统计功效较小的关联性也是如此。

相似文献

1
Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies.全基因组关联研究中比值比的偏差校正估计量和置信区间
Biostatistics. 2008 Oct;9(4):621-34. doi: 10.1093/biostatistics/kxn001. Epub 2008 Feb 28.
2
Quantifying and correcting for the winner's curse in genetic association studies.在基因关联研究中对胜者之咒进行量化与校正。
Genet Epidemiol. 2009 Jul;33(5):453-62. doi: 10.1002/gepi.20398.
3
A flexible genome-wide bootstrap method that accounts for ranking and threshold-selection bias in GWAS interpretation and replication study design.一种灵活的全基因组引导方法,可解释 GWAS 解释和复制研究设计中的排名和阈值选择偏差。
Stat Med. 2011 Jul 10;30(15):1898-912. doi: 10.1002/sim.4228. Epub 2011 May 3.
4
Correcting "winner's curse" in odds ratios from genomewide association findings for major complex human diseases.纠正主要复杂人类疾病全基因组关联研究中优势比的“胜者的诅咒”。
Genet Epidemiol. 2010 Jan;34(1):78-91. doi: 10.1002/gepi.20437.
5
Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies.从全基因组关联研究中估计复杂疾病潜在的易感变异总数。
PLoS One. 2010 Nov 17;5(11):e13898. doi: 10.1371/journal.pone.0013898.
6
Review and further developments in statistical corrections for Winner's Curse in genetic association studies.遗传关联研究中赢家诅咒的统计校正方法的回顾与进一步发展。
PLoS Genet. 2023 Sep 18;19(9):e1010546. doi: 10.1371/journal.pgen.1010546. eCollection 2023 Sep.
7
Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data.胜者之咒校正和可变阈值法可提高基于全基因组关联研究汇总水平数据的多基因风险建模性能。
PLoS Genet. 2016 Dec 30;12(12):e1006493. doi: 10.1371/journal.pgen.1006493. eCollection 2016 Dec.
8
Empirical Bayes correction for the Winner's Curse in genetic association studies.遗传关联研究中赢家诅咒的经验贝叶斯校正。
Genet Epidemiol. 2013 Jan;37(1):60-8. doi: 10.1002/gepi.21683. Epub 2012 Sep 25.
9
Robust logistic regression to narrow down the winner's curse for rare and recessive susceptibility variants.稳健逻辑回归缩小罕见和隐性易感性变异的赢家诅咒。
Brief Bioinform. 2017 Nov 1;18(6):962-972. doi: 10.1093/bib/bbw074.
10
Analysis of untyped SNPs: maximum likelihood and imputation methods.非分型单核苷酸多态性分析:最大似然法和推断方法。
Genet Epidemiol. 2010 Dec;34(8):803-15. doi: 10.1002/gepi.20527.

引用本文的文献

1
Winner's curse in rare variant analysis: effect size estimation bias depends on effect direction and the association method used.罕见变异分析中的胜者诅咒:效应大小估计偏差取决于效应方向和所使用的关联方法。
Front Genet. 2025 Aug 8;16:1416673. doi: 10.3389/fgene.2025.1416673. eCollection 2025.
2
Whole exome sequencing identifies FANCM as a susceptibility gene for estrogen-receptor-negative breast cancer in Hispanic/Latina women.全外显子组测序确定FANCM是西班牙裔/拉丁裔女性雌激素受体阴性乳腺癌的一个易感基因。
Nat Commun. 2025 Aug 21;16(1):7816. doi: 10.1038/s41467-025-60564-0.
3
Adjustment of Conditional Bias in Hazard Ratios for Group Sequential Testing of Progression-Free Survival and Overall Survival.无进展生存期和总生存期成组序贯检验风险比中条件偏倚的调整
Stat Med. 2025 May;44(10-12):e70112. doi: 10.1002/sim.70112.
4
Characterization of Additive Gene-environment Interactions For Colorectal Cancer Risk.结直肠癌风险的加性基因-环境交互作用的特征。
Epidemiology. 2025 Jan 1;36(1):126-138. doi: 10.1097/EDE.0000000000001795. Epub 2024 Sep 24.
5
SumVg: Total Heritability Explained by All Variants in Genome-Wide Association Studies Based on Summary Statistics with Standard Error Estimates.SumVg:基于具有标准误差估计的汇总统计数据的全基因组关联研究中所有变异解释的总遗传力。
Int J Mol Sci. 2024 Jan 22;25(2):1347. doi: 10.3390/ijms25021347.
6
Genetic architecture and biology of youth-onset type 2 diabetes.青年起病 2 型糖尿病的遗传结构和生物学特性。
Nat Metab. 2024 Feb;6(2):226-237. doi: 10.1038/s42255-023-00970-0. Epub 2024 Jan 26.
7
East Asian-specific and cross-ancestry genome-wide meta-analyses provide mechanistic insights into peptic ulcer disease.东亚人群特异性及跨种族全基因组荟萃分析为消化性溃疡病提供了机制见解。
Nat Genet. 2023 Dec;55(12):2129-2138. doi: 10.1038/s41588-023-01569-7. Epub 2023 Nov 30.
8
Review and further developments in statistical corrections for Winner's Curse in genetic association studies.遗传关联研究中赢家诅咒的统计校正方法的回顾与进一步发展。
PLoS Genet. 2023 Sep 18;19(9):e1010546. doi: 10.1371/journal.pgen.1010546. eCollection 2023 Sep.
9
Evaluating significance of European-associated index SNPs in the East Asian population for 31 complex phenotypes.评估欧洲相关索引单核苷酸多态性在东亚人群中 31 种复杂表型的意义。
BMC Genomics. 2023 Jun 13;24(1):324. doi: 10.1186/s12864-023-09425-y.
10
Point estimation following a two-stage group sequential trial.两阶段分组序贯试验后的点估计。
Stat Methods Med Res. 2023 Feb;32(2):287-304. doi: 10.1177/09622802221137745. Epub 2022 Nov 16.

本文引用的文献

1
Flexible design for following up positive findings.针对阳性结果随访的灵活设计。
Am J Hum Genet. 2007 Sep;81(3):540-51. doi: 10.1086/520678. Epub 2007 Aug 3.
2
Genomewide association analysis of coronary artery disease.冠状动脉疾病的全基因组关联分析。
N Engl J Med. 2007 Aug 2;357(5):443-53. doi: 10.1056/NEJMoa072366. Epub 2007 Jul 18.
3
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。
Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.
4
A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer.一项全基因组关联研究确定了FGFR2基因中的等位基因与散发性绝经后乳腺癌风险相关。
Nat Genet. 2007 Jul;39(7):870-4. doi: 10.1038/ng2075. Epub 2007 May 27.
5
Genome-wide association study identifies novel breast cancer susceptibility loci.全基因组关联研究确定了新的乳腺癌易感基因座。
Nature. 2007 Jun 28;447(7148):1087-93. doi: 10.1038/nature05887.
6
Overcoming the winner's curse: estimating penetrance parameters from case-control data.克服胜者的诅咒:从病例对照数据估计外显率参数。
Am J Hum Genet. 2007 Apr;80(4):605-15. doi: 10.1086/512821. Epub 2007 Feb 16.
7
Upward bias in odds ratio estimates from genome-wide association studies.全基因组关联研究中比值比估计的向上偏差。
Genet Epidemiol. 2007 May;31(4):288-95. doi: 10.1002/gepi.20209.
8
Locus-specific heritability estimation via the bootstrap in linkage scans for quantitative trait loci.在数量性状基因座连锁扫描中通过自助法进行基因座特异性遗传力估计。
Hum Hered. 2006;62(2):84-96. doi: 10.1159/000096096. Epub 2006 Oct 12.
9
Aspects of the design and analysis of high-dimensional SNP studies for disease risk estimation.用于疾病风险评估的高维单核苷酸多态性研究的设计与分析方面
Biostatistics. 2006 Jul;7(3):339-54. doi: 10.1093/biostatistics/kxj020. Epub 2006 Jan 27.
10
Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies.对于两阶段全基因组关联研究,联合分析比基于重复的分析更有效。
Nat Genet. 2006 Feb;38(2):209-13. doi: 10.1038/ng1706. Epub 2006 Jan 15.