• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 GWAS 汇总统计数据估计复杂性状遗传相关性的方法比较。

Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics.

机构信息

Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA.

Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa442.

DOI:10.1093/bib/bbaa442
PMID:33497438
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8425307/
Abstract

Genetic correlation is the correlation of phenotypic effects by genetic variants across the genome on two phenotypes. It is an informative metric to quantify the overall genetic similarity between complex traits, which provides insights into their polygenic genetic architecture. Several methods have been proposed to estimate genetic correlation based on data collected from genome-wide association studies (GWAS). Due to the easy access of GWAS summary statistics and computational efficiency, methods only requiring GWAS summary statistics as input have become more popular than methods utilizing individual-level genotype data. Here, we present a benchmark study for different summary-statistics-based genetic correlation estimation methods through simulation and real data applications. We focus on two major technical challenges in estimating genetic correlation: marker dependency caused by linkage disequilibrium (LD) and sample overlap between different studies. To assess the performance of different methods in the presence of these two challenges, we first conducted comprehensive simulations with diverse LD patterns and sample overlaps. Then we applied these methods to real GWAS summary statistics for a wide spectrum of complex traits. Based on these experiments, we conclude that methods relying on accurate LD estimation are less robust in real data applications due to the imprecision of LD obtained from reference panels. Our findings offer guidance on how to choose appropriate methods for genetic correlation estimation in post-GWAS analysis.

摘要

遗传相关是指基因组中遗传变异对两种表型的表型效应的相关性。它是量化复杂性状之间整体遗传相似性的一种有用指标,为其多基因遗传结构提供了深入了解。已经提出了几种基于从全基因组关联研究(GWAS)收集的数据估计遗传相关的方法。由于 GWAS 汇总统计数据易于获取和计算效率高,因此仅需要 GWAS 汇总统计数据作为输入的方法比利用个体水平基因型数据的方法更受欢迎。在这里,我们通过模拟和真实数据应用对不同基于汇总统计数据的遗传相关估计方法进行了基准研究。我们重点关注估计遗传相关时面临的两个主要技术挑战:由连锁不平衡(LD)引起的标记依赖性和不同研究之间的样本重叠。为了评估在存在这两个挑战的情况下不同方法的性能,我们首先进行了具有不同 LD 模式和样本重叠的综合模拟。然后,我们将这些方法应用于广泛的复杂性状的真实 GWAS 汇总统计数据。基于这些实验,我们得出结论,由于参考面板中获得的 LD 不精确,依赖于准确 LD 估计的方法在真实数据应用中不太稳健。我们的研究结果为 GWAS 后分析中遗传相关估计提供了如何选择适当方法的指导。

相似文献

1
Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics.利用 GWAS 汇总统计数据估计复杂性状遗传相关性的方法比较。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa442.
2
Benchmarking of local genetic correlation estimation methods using summary statistics from genome-wide association studies.使用全基因组关联研究的汇总统计数据对局部遗传相关估计方法进行基准测试。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad407.
3
Cross-trait prediction accuracy of summary statistics in genome-wide association studies.全基因组关联研究中汇总统计数据的跨性状预测准确性。
Biometrics. 2023 Jun;79(2):841-853. doi: 10.1111/biom.13661. Epub 2022 Mar 30.
4
PhenoSpD: an integrated toolkit for phenotypic correlation estimation and multiple testing correction using GWAS summary statistics.PhenoSpD:一个整合的工具包,用于使用 GWAS 汇总统计数据进行表型相关性估计和多重检验校正。
Gigascience. 2018 Aug 1;7(8):giy090. doi: 10.1093/gigascience/giy090.
5
Estimating colocalization probability from limited summary statistics.从有限的汇总统计数据中估计共定位概率。
BMC Bioinformatics. 2021 May 17;22(1):254. doi: 10.1186/s12859-021-04170-z.
6
Increasing the resolution and precision of psychiatric genome-wide association studies by re-imputing summary statistics using a large, diverse reference panel.通过使用大型、多样化的参考面板重新估算汇总统计数据,提高精神疾病全基因组关联研究的分辨率和精度。
Am J Med Genet B Neuropsychiatr Genet. 2021 Jan;186(1):16-27. doi: 10.1002/ajmg.b.32834. Epub 2021 Feb 11.
7
Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics.基于分区 GWAS 汇总统计量的非参数多基因风险预测。
Am J Hum Genet. 2020 Jul 2;107(1):46-59. doi: 10.1016/j.ajhg.2020.05.004. Epub 2020 May 28.
8
An atlas of genetic correlations across human diseases and traits.人类疾病与性状的遗传相关性图谱。
Nat Genet. 2015 Nov;47(11):1236-41. doi: 10.1038/ng.3406. Epub 2015 Sep 28.
9
GAUSS: a summary-statistics-based R package for accurate estimation of linkage disequilibrium for variants, Gaussian imputation, and TWAS analysis of cosmopolitan cohorts.GAUSS:一个基于汇总统计的 R 包,用于准确估计变体的连锁不平衡、高斯插补以及世界性队列的 TWAS 分析。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae203.
10
Assumptions about frequency-dependent architectures of complex traits bias measures of functional enrichment.关于复杂性状的频率相关结构的假设会影响功能富集的度量。
Genet Epidemiol. 2021 Sep;45(6):621-632. doi: 10.1002/gepi.22388. Epub 2021 Jun 22.

引用本文的文献

1
Genome-Wide Association Studies of Delay Discounting and Impulsive Personality Traits in Children From the Adolescent Behavior and Cognitive Development Study.基于青少年行为与认知发展研究的儿童延迟折扣与冲动人格特质的全基因组关联研究
Genes Brain Behav. 2025 Aug;24(4):e70033. doi: 10.1111/gbb.70033.
2
The Use of AI for Phenotype-Genotype Mapping.人工智能在表型-基因型映射中的应用。
Methods Mol Biol. 2025;2952:369-410. doi: 10.1007/978-1-0716-4690-8_21.
3
Genetic Correlation and Mendelian Randomization Analyses Support Causal Relationships Between Instant Coffee and Age-Related Macular Degeneration.遗传相关性和孟德尔随机化分析支持速溶咖啡与年龄相关性黄斑变性之间的因果关系。
Food Sci Nutr. 2025 Jun 14;13(6):e70439. doi: 10.1002/fsn3.70439. eCollection 2025 Jun.
4
MUTATE: a human genetic atlas of multiorgan artificial intelligence endophenotypes using genome-wide association summary statistics.MUTATE:一个利用全基因组关联汇总统计数据构建的多器官人工智能内表型人类遗传图谱。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf125.
5
Elucidating shared genetic association between female body mass index and preeclampsia.阐明女性体重指数与先兆子痫之间的共同遗传关联。
Commun Biol. 2025 Feb 27;8(1):322. doi: 10.1038/s42003-025-07726-4.
6
Investigating shared risk variants and genetic etiology between Alzheimer's disease and three stress-related psychiatric disorders: a large-scale genome-wide cross-trait analysis.探究阿尔茨海默病与三种应激相关精神障碍之间的共享风险变异和遗传病因:一项大规模全基因组跨性状分析。
Front Aging. 2025 Feb 5;6:1488528. doi: 10.3389/fragi.2025.1488528. eCollection 2025.
7
Association of Metabolic Diseases and Moderate Fat Intake with Myocardial Infarction Risk.代谢疾病及适度脂肪摄入与心肌梗死风险的关联
Nutrients. 2024 Dec 11;16(24):4273. doi: 10.3390/nu16244273.
8
GWAShug: a comprehensive platform for decoding the shared genetic basis between complex traits based on summary statistics.GWAShug:一个基于汇总统计数据解码复杂性状之间共享遗传基础的综合平台。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1006-D1015. doi: 10.1093/nar/gkae873.
9
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
10
Using Genetics to Investigate Relationships between Phenotypes: Application to Endometrial Cancer.利用遗传学研究表型之间的关系:以内膜癌为例。
Genes (Basel). 2024 Jul 18;15(7):939. doi: 10.3390/genes15070939.

本文引用的文献

1
Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models.高维线性模型中遗传相关性的最优估计
J Am Stat Assoc. 2019;114(525):358-369. doi: 10.1080/01621459.2017.1407774. Epub 2018 Nov 19.
2
On Genetic Correlation Estimation With Summary Statistics From Genome-Wide Association Studies.基于全基因组关联研究汇总统计量的遗传相关性估计
J Am Stat Assoc. 2022;117(537):1-11. doi: 10.1080/01621459.2021.1906684. Epub 2021 May 19.
3
SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traits.超新星:局部遗传相关分析揭示了复杂性状病因异质性共享。
Genome Biol. 2021 Sep 7;22(1):262. doi: 10.1186/s13059-021-02478-w.
4
Detecting local genetic correlations with scan statistics.利用扫描统计检测局部遗传相关性。
Nat Commun. 2021 Apr 1;12(1):2033. doi: 10.1038/s41467-021-22334-6.
5
High-definition likelihood inference of genetic correlations across human complex traits.人类复杂特征遗传相关性的高分辨率似然推断。
Nat Genet. 2020 Aug;52(8):859-864. doi: 10.1038/s41588-020-0653-y. Epub 2020 Jun 29.
6
2. Classification and Diagnosis of Diabetes: .2. 糖尿病的分类和诊断: 。
Diabetes Care. 2020 Jan;43(Suppl 1):S14-S31. doi: 10.2337/dc20-S002.
7
Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits.基因组结构方程模型为复杂性状的多变量遗传结构提供了深入的了解。
Nat Hum Behav. 2019 May;3(5):513-525. doi: 10.1038/s41562-019-0566-x. Epub 2019 Apr 8.
8
Identification of common genetic risk variants for autism spectrum disorder.孤独症谱系障碍常见遗传风险变异的鉴定。
Nat Genet. 2019 Mar;51(3):431-444. doi: 10.1038/s41588-019-0344-8. Epub 2019 Feb 25.
9
SumHer better estimates the SNP heritability of complex traits from summary statistics.SumHer 可以更好地从汇总统计数据估计复杂性状的 SNP 遗传力。
Nat Genet. 2019 Feb;51(2):277-284. doi: 10.1038/s41588-018-0279-5. Epub 2018 Dec 3.
10
Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder.发现首个与注意缺陷多动障碍全基因组显著相关的风险位点。
Nat Genet. 2019 Jan;51(1):63-75. doi: 10.1038/s41588-018-0269-7. Epub 2018 Nov 26.