• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于全基因组关联研究中泛化测试的强大统计框架,并应用于西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)。

A powerful statistical framework for generalization testing in GWAS, with application to the HCHS/SOL.

作者信息

Sofer Tamar, Heller Ruth, Bogomolov Marina, Avery Christy L, Graff Mariaelisa, North Kari E, Reiner Alex P, Thornton Timothy A, Rice Kenneth, Benjamini Yoav, Laurie Cathy C, Kerr Kathleen F

机构信息

Department of Biostatistics, University of Washington, Seattle, WA, USA.

Department of Statistics and Operations Research, Tel-Aviv University, Tel-Aviv, Israel.

出版信息

Genet Epidemiol. 2017 Apr;41(3):251-258. doi: 10.1002/gepi.22029. Epub 2017 Jan 15.

DOI:10.1002/gepi.22029
PMID:28090672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5340573/
Abstract

In genome-wide association studies (GWAS), "generalization" is the replication of genotype-phenotype association in a population with different ancestry than the population in which it was first identified. Current practices for declaring generalizations rely on testing associations while controlling the family-wise error rate (FWER) in the discovery study, then separately controlling error measures in the follow-up study. This approach does not guarantee control over the FWER or false discovery rate (FDR) of the generalization null hypotheses. It also fails to leverage the two-stage design to increase power for detecting generalized associations. We provide a formal statistical framework for quantifying the evidence of generalization that accounts for the (in)consistency between the directions of associations in the discovery and follow-up studies. We develop the directional generalization FWER (FWER ) and FDR (FDR ) controlling r-values, which are used to declare associations as generalized. This framework extends to generalization testing when applied to a published list of Single Nucleotide Polymorphism-(SNP)-trait associations. Our methods control FWER or FDR under various SNP selection rules based on P-values in the discovery study. We find that it is often beneficial to use a more lenient P-value threshold than the genome-wide significance threshold. In a GWAS of total cholesterol in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), when testing all SNPs with P-values <5×10-8 (15 genomic regions) for generalization in a large GWAS of whites, we generalized SNPs from 15 regions. But when testing all SNPs with P-values <6.6×10-5 (89 regions), we generalized SNPs from 27 regions.

摘要

在全基因组关联研究(GWAS)中,“泛化”是指在与首次发现基因型-表型关联的人群具有不同祖先的人群中对该关联进行复制。目前宣布泛化的做法依赖于在发现研究中控制家族性错误率(FWER)的同时测试关联,然后在后续研究中分别控制错误度量。这种方法不能保证对泛化无效假设的FWER或错误发现率(FDR)进行控制。它也未能利用两阶段设计来提高检测泛化关联的功效。我们提供了一个正式的统计框架,用于量化泛化的证据,该框架考虑了发现研究和后续研究中关联方向之间的(不)一致性。我们开发了用于控制r值的方向泛化FWER(FWER )和FDR(FDR ),这些r值用于将关联声明为泛化。当应用于已发表的单核苷酸多态性-(SNP)-性状关联列表时,该框架扩展到泛化测试。我们的方法在基于发现研究中的P值的各种SNP选择规则下控制FWER或FDR。我们发现,使用比全基因组显著性阈值更宽松的P值阈值通常是有益的。在西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)中对总胆固醇进行的GWAS中,当在一项针对白人进行的大型GWAS中测试所有P值<5×10-8(15个基因组区域)的SNP的泛化情况时,我们从15个区域泛化了SNP。但是,当测试所有P值<6.6×10-5(89个区域)的SNP时,我们从27个区域泛化了SNP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee1/5340573/7049d0227f4f/nihms829142f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee1/5340573/7049d0227f4f/nihms829142f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee1/5340573/7049d0227f4f/nihms829142f1.jpg

相似文献

1
A powerful statistical framework for generalization testing in GWAS, with application to the HCHS/SOL.一种用于全基因组关联研究中泛化测试的强大统计框架,并应用于西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)。
Genet Epidemiol. 2017 Apr;41(3):251-258. doi: 10.1002/gepi.22029. Epub 2017 Jan 15.
2
Genetic architecture of lipid traits in the Hispanic community health study/study of Latinos.西班牙裔社区健康研究/拉丁裔研究中的脂质特征的遗传结构。
Lipids Health Dis. 2017 Oct 12;16(1):200. doi: 10.1186/s12944-017-0591-6.
3
Genetics of Type 2 Diabetes in U.S. Hispanic/Latino Individuals: Results From the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).美国西班牙裔/拉丁裔个体2型糖尿病的遗传学:西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)的结果
Diabetes. 2017 May;66(5):1419-1425. doi: 10.2337/db16-1150. Epub 2017 Mar 2.
4
Hidden Markov models for controlling false discovery rate in genome-wide association analysis.用于全基因组关联分析中控制错误发现率的隐马尔可夫模型
Methods Mol Biol. 2012;802:337-44. doi: 10.1007/978-1-61779-400-1_22.
5
Ancestry-specific associations identified in genome-wide combined-phenotype study of red blood cell traits emphasize benefits of diversity in genomics.全基因组综合表型研究中鉴定的与祖先相关的关联强调了基因组多样性的益处,这些关联与红细胞特征有关。
BMC Genomics. 2020 Mar 14;21(1):228. doi: 10.1186/s12864-020-6626-9.
6
Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals: Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).对具有相关性个体的全基因组关联研究的荟萃分析:应用于西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)
Genet Epidemiol. 2016 Sep;40(6):492-501. doi: 10.1002/gepi.21981. Epub 2016 Jun 3.
7
Generalizing polygenic risk scores from Europeans to Hispanics/Latinos.将多基因风险评分从欧洲人推广到西班牙裔/拉丁裔。
Genet Epidemiol. 2019 Feb;43(1):50-62. doi: 10.1002/gepi.22166. Epub 2018 Oct 15.
8
A powerful method for combining P-values in genomic studies.一种强大的基因组研究中结合 P 值的方法。
Genet Epidemiol. 2013 Dec;37(8):814-9. doi: 10.1002/gepi.21755. Epub 2013 Aug 19.
9
Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.利用连锁不平衡和综合统计方法处理超高维全基因组数据
Genetics. 2016 Feb;202(2):411-26. doi: 10.1534/genetics.115.179507. Epub 2015 Dec 12.
10
Control procedures and estimators of the false discovery rate and their application in low-dimensional settings: an empirical investigation.控制程序和虚假发现率的估计及其在低维环境中的应用:实证研究。
BMC Bioinformatics. 2018 Mar 2;19(1):78. doi: 10.1186/s12859-018-2081-x.

引用本文的文献

1
Methodological opportunities in genomic data analysis to advance health equity.基因组数据分析中促进健康公平的方法学机遇。
Nat Rev Genet. 2025 May 15. doi: 10.1038/s41576-025-00839-w.
2
Advancements in genetic research by the Hispanic Community Health Study/Study of Latinos: A 10-year retrospective review.西班牙裔社区健康研究/拉丁裔研究在基因研究方面的进展:十年回顾
HGG Adv. 2025 Jan 9;6(1):100376. doi: 10.1016/j.xhgg.2024.100376. Epub 2024 Oct 29.
3
Intracranial Volume Is Driven by Both Genetics and Early Life Exposures: The SOL-INCA-MRI Study.颅内容积受遗传因素和早期生活暴露的双重影响:SOL-INCA-MRI 研究。
Ethn Dis. 2024 Jul 2;34(2):103-112. doi: 10.18865/ed.34.2.103. eCollection 2024 Feb.
4
Metabolomic profiles of sleep-disordered breathing are associated with hypertension and diabetes mellitus development.睡眠呼吸障碍的代谢组学特征与高血压和糖尿病的发展有关。
Nat Commun. 2024 Feb 28;15(1):1845. doi: 10.1038/s41467-024-46019-y.
5
A polygenic risk score for Alzheimer's disease constructed using APOE-region variants has stronger association than APOE alleles with mild cognitive impairment in Hispanic/Latino adults in the U.S.使用 APOE 区域变异构建的阿尔茨海默病多基因风险评分与 APOE 等位基因相比,与美国西班牙裔/拉丁裔成年人的轻度认知障碍的相关性更强。
Alzheimers Res Ther. 2023 Aug 30;15(1):146. doi: 10.1186/s13195-023-01298-3.
6
Genome-wide association study of obstructive sleep apnoea in the Million Veteran Program uncovers genetic heterogeneity by sex.在百万退伍军人计划中进行的全基因组关联研究揭示了性别导致的阻塞性睡眠呼吸暂停的遗传异质性。
EBioMedicine. 2023 Apr;90:104536. doi: 10.1016/j.ebiom.2023.104536. Epub 2023 Mar 28.
7
Development and validation of a metabolite index for obstructive sleep apnea across race/ethnicities.跨种族/族裔的阻塞性睡眠呼吸暂停代谢物指数的开发和验证。
Sci Rep. 2022 Dec 16;12(1):21805. doi: 10.1038/s41598-022-26321-9.
8
Discerning asthma endotypes through comorbidity mapping.通过共病映射辨别哮喘表型。
Nat Commun. 2022 Nov 7;13(1):6712. doi: 10.1038/s41467-022-33628-8.
9
Proteomics of Coagulopathy Following Injury Reveals Limitations of Using Laboratory Assessment to Define Trauma-Induced Coagulopathy to Predict Massive Transfusion.创伤后凝血病的蛋白质组学揭示了使用实验室评估来定义创伤性凝血病以预测大量输血的局限性。
Ann Surg Open. 2022 Jun;3(2). doi: 10.1097/as9.0000000000000167. Epub 2022 May 25.
10
Mendelian randomization analysis of arsenic metabolism and pulmonary function within the Hispanic Community Health Study/Study of Latinos.基于西班牙裔社区健康研究/拉丁裔研究中的砷代谢与肺功能开展的孟德尔随机化分析。
Sci Rep. 2021 Jun 29;11(1):13470. doi: 10.1038/s41598-021-92911-8.

本文引用的文献

1
Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos.美国西班牙裔/拉丁裔人群的遗传多样性与关联研究:在西班牙裔社区健康研究/拉丁裔研究中的应用。
Am J Hum Genet. 2016 Jan 7;98(1):165-84. doi: 10.1016/j.ajhg.2015.12.001.
2
Deciding whether follow-up studies have replicated findings in a preliminary large-scale omics study.确定后续研究是否在初步的大规模组学研究中重复了研究结果。
Proc Natl Acad Sci U S A. 2014 Nov 18;111(46):16262-7. doi: 10.1073/pnas.1314814111. Epub 2014 Nov 3.
3
Discovery and refinement of loci associated with lipid levels.发现和完善与脂质水平相关的基因座。
Nat Genet. 2013 Nov;45(11):1274-1283. doi: 10.1038/ng.2797. Epub 2013 Oct 6.
4
Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma.全基因组关联研究鉴定出影响血浆中肝酶浓度的基因座。
Nat Genet. 2011 Oct 16;43(11):1131-8. doi: 10.1038/ng.970.
5
The N342S MYLIP polymorphism is associated with high total cholesterol and increased LDL receptor degradation in humans.N342S MYLIP 多态性与人类总胆固醇升高和 LDL 受体降解增加有关。
J Clin Invest. 2011 Aug;121(8):3062-71. doi: 10.1172/JCI45504. Epub 2011 Jul 18.
6
Biological, clinical and population relevance of 95 loci for blood lipids.95 个与血脂相关的生物学、临床和人群相关性位点。
Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.
7
Replication in genome-wide association studies.全基因组关联研究中的复制
Stat Sci. 2009 Nov 1;24(4):561-573. doi: 10.1214/09-STS290.
8
Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies.对于两阶段全基因组关联研究,联合分析比基于重复的分析更有效。
Nat Genet. 2006 Feb;38(2):209-13. doi: 10.1038/ng1706. Epub 2006 Jan 15.
9
Genomic control for association studies.关联研究的基因组控制
Biometrics. 1999 Dec;55(4):997-1004. doi: 10.1111/j.0006-341x.1999.00997.x.
10
Freely associating.自由联想。
Nat Genet. 1999 May;22(1):1-2. doi: 10.1038/8702.