• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稳健逻辑回归缩小罕见和隐性易感性变异的赢家诅咒。

Robust logistic regression to narrow down the winner's curse for rare and recessive susceptibility variants.

出版信息

Brief Bioinform. 2017 Nov 1;18(6):962-972. doi: 10.1093/bib/bbw074.

DOI:10.1093/bib/bbw074
PMID:27543791
Abstract

Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants.

摘要

逻辑回归是用于遗传病例对照关联研究的最常用技术。基因型相对风险(GRR)的标准极大似然估计的一个缺点是它们对异常值个体的强烈依赖,例如,在异常年轻的年龄被诊断出的患者。有可用的稳健方法来约束异常值的影响,但它们在遗传研究中很少使用。本文提供了对稳健逻辑回归的一种不令人生畏的介绍,并研究了其在遗传关联研究中的优势和局限性。我们应用了有界 Huber 和扩展的 R 包'robustbase',使用重新下降的 Hampel 函数来降低异常值的影响。进行了计算机模拟,以根据遗传研究和调查标记的主要特征评估Ⅰ型错误率、均方误差(MSE)和统计功效。模拟结果补充了真实数据的分析。标准和稳健估计都控制了Ⅰ型错误率。标准逻辑回归显示出最高的功效,但标准 GRR 估计也显示出最大的偏差和 MSE,特别是对于相关的罕见和隐性变体。为了说明问题,在一项 1000 例病例/1000 例对照研究中,对真实数据的分析中,对一个隐性变体进行了研究,该变体的真实 GRR=6.32,次要等位基因频率=0.05,标准逻辑回归的结果是功效=0.60,MSE=16.5。基于 Huber 的估计的相应数字为功效=0.51,MSE=0.53。总体而言,Hampel 和 Huber 基于的 GRR 估计值差异不大。当重点在于风险预测而不是识别易感性变体时,稳健逻辑回归可能是标准最大似然估计的一种有价值的替代方法。

相似文献

1
Robust logistic regression to narrow down the winner's curse for rare and recessive susceptibility variants.稳健逻辑回归缩小罕见和隐性易感性变异的赢家诅咒。
Brief Bioinform. 2017 Nov 1;18(6):962-972. doi: 10.1093/bib/bbw074.
2
Quantifying and correcting for the winner's curse in genetic association studies.在基因关联研究中对胜者之咒进行量化与校正。
Genet Epidemiol. 2009 Jul;33(5):453-62. doi: 10.1002/gepi.20398.
3
Review and further developments in statistical corrections for Winner's Curse in genetic association studies.遗传关联研究中赢家诅咒的统计校正方法的回顾与进一步发展。
PLoS Genet. 2023 Sep 18;19(9):e1010546. doi: 10.1371/journal.pgen.1010546. eCollection 2023 Sep.
4
Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies.全基因组关联研究中比值比的偏差校正估计量和置信区间
Biostatistics. 2008 Oct;9(4):621-34. doi: 10.1093/biostatistics/kxn001. Epub 2008 Feb 28.
5
An empirical investigation into the impact of winner's curse on estimates from Mendelian randomization.孟德尔随机化中赢家诅咒对估计值影响的实证研究
Int J Epidemiol. 2023 Aug 2;52(4):1209-1219. doi: 10.1093/ije/dyac233.
6
Power, false discovery rate and Winner's Curse in eQTL studies.基因表达数量性状位点(eQTL)研究中的权力、假发现率和赢家诅咒。
Nucleic Acids Res. 2018 Dec 14;46(22):e133. doi: 10.1093/nar/gky780.
7
Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data.胜者之咒校正和可变阈值法可提高基于全基因组关联研究汇总水平数据的多基因风险建模性能。
PLoS Genet. 2016 Dec 30;12(12):e1006493. doi: 10.1371/journal.pgen.1006493. eCollection 2016 Dec.
8
Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association.基于基因的罕见变异关联检验事后分析中偏差的说明、量化与校正
Front Genet. 2017 Sep 14;8:117. doi: 10.3389/fgene.2017.00117. eCollection 2017.
9
Robust Huber-LASSO for improved prediction of protein, metabolite and gene expression levels relying on individual genotype data.基于个体基因型数据,利用稳健 Huber-LASSO 提高蛋白质、代谢物和基因表达水平的预测能力。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa230.
10
Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies.从全基因组关联研究中估计复杂疾病潜在的易感变异总数。
PLoS One. 2010 Nov 17;5(11):e13898. doi: 10.1371/journal.pone.0013898.

引用本文的文献

1
Quick assessment for systematic test statistic inflation/deflation due to null model misspecifications in genome-wide environment interaction studies.全基因组环境互作研究中由于零模型误设导致系统检验统计膨胀/收缩的快速评估。
PLoS One. 2019 Jul 18;14(7):e0219825. doi: 10.1371/journal.pone.0219825. eCollection 2019.
2
A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets.基于模拟和真实数据集的全基因组关联研究中细胞类型调整方法的比较分析。
Brief Bioinform. 2019 Nov 27;20(6):2055-2065. doi: 10.1093/bib/bby068.