• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于基因关联研究中秩归一化的完全调整两阶段程序。

A fully adjusted two-stage procedure for rank-normalization in genetic association studies.

作者信息

Sofer Tamar, Zheng Xiuwen, Gogarten Stephanie M, Laurie Cecelia A, Grinde Kelsey, Shaffer John R, Shungin Dmitry, O'Connell Jeffrey R, Durazo-Arvizo Ramon A, Raffield Laura, Lange Leslie, Musani Solomon, Vasan Ramachandran S, Cupples L Adrienne, Reiner Alexander P, Laurie Cathy C, Rice Kenneth M

机构信息

Department of Medicine, Harvard Medical School, Boston, Massachusetts.

Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts.

出版信息

Genet Epidemiol. 2019 Apr;43(3):263-275. doi: 10.1002/gepi.22188. Epub 2019 Jan 17.

DOI:10.1002/gepi.22188
PMID:30653739
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6416071/
Abstract

When testing genotype-phenotype associations using linear regression, departure of the trait distribution from normality can impact both Type I error rate control and statistical power, with worse consequences for rarer variants. Because genotypes are expected to have small effects (if any) investigators now routinely use a two-stage method, in which they first regress the trait on covariates, obtain residuals, rank-normalize them, and then use the rank-normalized residuals in association analysis with the genotypes. Potential confounding signals are assumed to be removed at the first stage, so in practice, no further adjustment is done in the second stage. Here, we show that this widely used approach can lead to tests with undesirable statistical properties, due to both combination of a mis-specified mean-variance relationship and remaining covariate associations between the rank-normalized residuals and genotypes. We demonstrate these properties theoretically, and also in applications to genome-wide and whole-genome sequencing association studies. We further propose and evaluate an alternative fully adjusted two-stage approach that adjusts for covariates both when residuals are obtained and in the subsequent association test. This method can reduce excess Type I errors and improve statistical power.

摘要

在使用线性回归测试基因型与表型的关联时,性状分布偏离正态性会影响I型错误率控制和统计功效,对罕见变异的影响更为严重。由于基因型预计具有较小的效应(如果有),研究人员现在通常使用两阶段方法,即首先将性状对协变量进行回归,获得残差,对其进行秩归一化,然后在与基因型的关联分析中使用秩归一化后的残差。潜在的混杂信号被认为在第一阶段已被去除,因此在实际操作中,第二阶段不再进行进一步调整。在这里,我们表明,由于错误指定的均值 - 方差关系以及秩归一化残差与基因型之间剩余的协变量关联的组合,这种广泛使用的方法可能导致具有不良统计特性的检验。我们从理论上证明了这些特性,并在全基因组和全基因组测序关联研究的应用中也进行了证明。我们进一步提出并评估了一种替代的完全调整两阶段方法,该方法在获得残差时以及在随后的关联检验中都对协变量进行调整。这种方法可以减少过多的I型错误并提高统计功效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/2373909a2d61/nihms-1004041-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/d0e0f01fd171/nihms-1004041-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/ae5ccd280480/nihms-1004041-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/d4312dc17007/nihms-1004041-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/2373909a2d61/nihms-1004041-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/d0e0f01fd171/nihms-1004041-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/ae5ccd280480/nihms-1004041-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/d4312dc17007/nihms-1004041-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f41/6416071/2373909a2d61/nihms-1004041-f0004.jpg

相似文献

1
A fully adjusted two-stage procedure for rank-normalization in genetic association studies.一种用于基因关联研究中秩归一化的完全调整两阶段程序。
Genet Epidemiol. 2019 Apr;43(3):263-275. doi: 10.1002/gepi.22188. Epub 2019 Jan 17.
2
A rank-based normalization method with the fully adjusted full-stage procedure in genetic association studies.基于秩的标准化方法与全调整全阶段程序在遗传关联研究中的应用。
PLoS One. 2020 Jun 19;15(6):e0233847. doi: 10.1371/journal.pone.0233847. eCollection 2020.
3
Are your covariates under control? How normalization can re-introduce covariate effects.你的协变量得到控制了吗?标准化如何重新引入协变量效应。
Eur J Hum Genet. 2018 Aug;26(8):1194-1201. doi: 10.1038/s41431-018-0159-6. Epub 2018 Apr 30.
4
A novel application of data-consistent inversion to overcome spurious inference in genome-wide association studies.一种新颖的数据一致性反演方法在全基因组关联研究中克服虚假推断的应用。
Genet Epidemiol. 2024 Sep;48(6):270-288. doi: 10.1002/gepi.22563. Epub 2024 Apr 21.
5
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
6
Power loss due to testing association between covariate-adjusted traits and genetic variants.由于测试协变量调整后的特征与遗传变异之间的关联而导致的功率损失。
Genet Epidemiol. 2020 Sep;44(6):579-588. doi: 10.1002/gepi.22325. Epub 2020 Jun 8.
7
Adjustment for covariates using summary statistics of genome-wide association studies.使用全基因组关联研究的汇总统计量对协变量进行调整。
Genet Epidemiol. 2018 Dec;42(8):812-825. doi: 10.1002/gepi.22148. Epub 2018 Sep 20.
8
Nonparametric evaluation of quantitative traits in population-based association studies when the genetic model is unknown.基于人群的关联研究中遗传模型未知时数量性状的非参数评估。
PLoS One. 2012;7(2):e31242. doi: 10.1371/journal.pone.0031242. Epub 2012 Feb 21.
9
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
10
Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: efficiency and power.用于发现和测试常见及罕见遗传变异的两阶段极端表型测序设计:效率与效能
Hum Hered. 2012;73(3):139-47. doi: 10.1159/000337300. Epub 2012 Jun 7.

引用本文的文献

1
Disentangling multivariate relationships between cognition, language and social traits: structures of G, E, and .理清认知、语言和社会特质之间的多变量关系:G、E和……的结构
bioRxiv. 2025 Jul 29:2025.07.26.666154. doi: 10.1101/2025.07.26.666154.
2
A Robust Association Test Leveraging Unknown Genetic Interactions: Application to Cystic Fibrosis Lung Disease.一种利用未知基因相互作用的稳健关联测试:在囊性纤维化肺病中的应用。
Genet Epidemiol. 2025 Jul;49(5):e70013. doi: 10.1002/gepi.70013.
3
Whole genome sequencing analysis of body mass index identifies novel African ancestry-specific risk allele.

本文引用的文献

1
Are your covariates under control? How normalization can re-introduce covariate effects.你的协变量得到控制了吗?标准化如何重新引入协变量效应。
Eur J Hum Genet. 2018 Aug;26(8):1194-1201. doi: 10.1038/s41431-018-0159-6. Epub 2018 Apr 30.
2
Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation.利用电子健康记录进行的全基因组关联分析确定了影响血压变异的新基因座。
Nat Genet. 2017 Jan;49(1):54-64. doi: 10.1038/ng.3715. Epub 2016 Nov 14.
3
Large-Scale Exome-wide Association Analysis Identifies Loci for White Blood Cell Traits and Pleiotropy with Immune-Mediated Diseases.
体重指数的全基因组测序分析确定了新的非洲血统特异性风险等位基因。
Nat Commun. 2025 Apr 11;16(1):3470. doi: 10.1038/s41467-025-58420-2.
4
On the analysis of metabolite quantitative trait loci: Impact of different data transformations and study designs.代谢物数量性状位点分析:不同数据转换和研究设计的影响
Sci Adv. 2025 Apr 11;11(15):eadp4532. doi: 10.1126/sciadv.adp4532.
5
Consumer Product Chemical Mixtures and Oxylipin-Mediated Inflammation and Oxidative Stress during Early Pregnancy: Findings from a Large US Pregnancy Cohort.消费品化学混合物与妊娠早期氧化脂质介导的炎症和氧化应激:来自美国一个大型妊娠队列的研究结果。
Environ Sci Technol. 2025 Feb 18;59(6):2987-2999. doi: 10.1021/acs.est.4c10390. Epub 2025 Feb 6.
6
Modifies Chronic Airway Infection Risk in Cystic Fibrosis.改变囊性纤维化患者慢性气道感染风险。
Ann Am Thorac Soc. 2025 May;22(5):715-723. doi: 10.1513/AnnalsATS.202408-868OC.
7
Gene-Excessive Sleepiness Interactions Suggest Treatment Targets for Obstructive Sleep Apnea Subtype.基因与过度嗜睡的相互作用提示阻塞性睡眠呼吸暂停亚型的治疗靶点。
medRxiv. 2024 Oct 28:2024.10.25.24316158. doi: 10.1101/2024.10.25.24316158.
8
Whole-genome Sequencing Association Analysis of Quantitative Platelet Traits in A Large Cohort of β-thalassemia.一大群β地中海贫血患者血小板定量性状的全基因组测序关联分析
Genomics Proteomics Bioinformatics. 2025 May 30;23(2). doi: 10.1093/gpbjnl/qzae065.
9
Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans.基于电子健康记录数据的蛋白质基因组分析揭示了美国黑人与疾病相关的变异。
J Clin Invest. 2024 Sep 24;134(21):e181802. doi: 10.1172/JCI181802.
10
Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium.基于全基因组测序的精准医学转化研究联盟(TOPMed)炎症生物标志物分析。
Hum Mol Genet. 2024 Aug 6;33(16):1429-1441. doi: 10.1093/hmg/ddae050.
大规模外显子组全关联分析确定白细胞性状位点及与免疫介导疾病的多效性。
Am J Hum Genet. 2016 Jul 7;99(1):22-39. doi: 10.1016/j.ajhg.2016.05.003. Epub 2016 Jun 23.
4
The effect of phenotypic outliers and non-normality on rare-variant association testing.表型异常值和非正态性对罕见变异关联检验的影响。
Eur J Hum Genet. 2016 Aug;24(8):1188-94. doi: 10.1038/ejhg.2015.270. Epub 2016 Jan 6.
5
Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs.用于发现罕见变异关联的荟萃分析:统计方法与软件程序
Am J Hum Genet. 2015 Jul 2;97(1):35-53. doi: 10.1016/j.ajhg.2015.05.001. Epub 2015 Jun 18.
6
New genetic loci link adipose and insulin biology to body fat distribution.新的遗传位点将脂肪和胰岛素生物学与体脂肪分布联系起来。
Nature. 2015 Feb 12;518(7538):187-196. doi: 10.1038/nature14132.
7
Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol.全外显子组测序鉴定出与 LDL 胆固醇相关的罕见和低频编码变异。
Am J Hum Genet. 2014 Feb 6;94(2):233-45. doi: 10.1016/j.ajhg.2014.01.010.
8
Genome-wide complex trait analysis (GCTA): methods, data analyses, and interpretations.全基因组复杂性状分析(GCTA):方法、数据分析及解读
Methods Mol Biol. 2013;1019:215-36. doi: 10.1007/978-1-62703-447-0_9.
9
Loss of power in two-stage residual-outcome regression analysis in genetic association studies.基因关联研究中两阶段残差-结果回归分析的效能损失
Genet Epidemiol. 2012 Dec;36(8):890-4. doi: 10.1002/gepi.21671. Epub 2012 Aug 31.
10
Meta-analysis identifies common variants associated with body mass index in east Asians.荟萃分析鉴定出与东亚人体重指数相关的常见变异。
Nat Genet. 2012 Feb 19;44(3):307-11. doi: 10.1038/ng.1087.