• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 GWAS 汇总统计数据的多表型关联研究的聚类线性组合方法。

A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics.

机构信息

Mathematical Sciences, Michigan Technological University, Houghton, MI, USA.

出版信息

Sci Rep. 2023 Feb 28;13(1):3389. doi: 10.1038/s41598-023-30415-3.

DOI:10.1038/s41598-023-30415-3
PMID:36854754
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9975197/
Abstract

There is strong evidence showing that joint analysis of multiple phenotypes in genome-wide association studies (GWAS) can increase statistical power when detecting the association between genetic variants and human complex diseases. We previously developed the Clustering Linear Combination (CLC) method and a computationally efficient CLC (ceCLC) method to test the association between multiple phenotypes and a genetic variant, which perform very well. However, both of these methods require individual-level genotypes and phenotypes that are often not easily accessible. In this research, we develop a novel method called sCLC for association studies of multiple phenotypes and a genetic variant based on GWAS summary statistics. We use the LD score regression to estimate the correlation matrix among phenotypes. The test statistic of sCLC is constructed by GWAS summary statistics and has an approximate Cauchy distribution. We perform a variety of simulation studies and compare sCLC with other commonly used methods for multiple phenotype association studies using GWAS summary statistics. Simulation results show that sCLC can control Type I error rates well and has the highest power in most scenarios. Moreover, we apply the newly developed method to the UK Biobank GWAS summary statistics from the XIII category with 70 related musculoskeletal system and connective tissue phenotypes. The results demonstrate that sCLC detects the most number of significant SNPs, and most of these identified SNPs can be matched to genes that have been reported in the GWAS catalog to be associated with those phenotypes. Furthermore, sCLC also identifies some novel signals that were missed by standard GWAS, which provide new insight into the potential genetic factors of the musculoskeletal system and connective tissue phenotypes.

摘要

有强有力的证据表明,在全基因组关联研究(GWAS)中联合分析多个表型可以提高检测遗传变异与人类复杂疾病之间关联的统计能力。我们之前开发了聚类线性组合(CLC)方法和一种计算效率高的 CLC(ceCLC)方法来检验多个表型与遗传变异之间的关联,这两种方法表现都非常出色。然而,这两种方法都需要个体水平的基因型和表型,而这些通常不容易获得。在这项研究中,我们开发了一种新的方法,称为 sCLC,用于基于 GWAS 汇总统计数据的多个表型和遗传变异的关联研究。我们使用 LD 得分回归来估计表型之间的相关矩阵。sCLC 的检验统计量由 GWAS 汇总统计数据构建,具有近似的 Cauchy 分布。我们进行了各种模拟研究,并使用 GWAS 汇总统计数据比较了 sCLC 与其他常用的多表型关联研究方法。模拟结果表明,sCLC 可以很好地控制第一类错误率,并且在大多数情况下具有最高的功效。此外,我们将新开发的方法应用于 UK Biobank GWAS 汇总统计数据的第十三类,其中有 70 个相关的肌肉骨骼系统和结缔组织表型。结果表明,sCLC 检测到了最多数量的显著 SNPs,并且这些鉴定出的 SNPs 中的大多数可以与 GWAS 目录中报告的与这些表型相关的基因相匹配。此外,sCLC 还鉴定出了一些标准 GWAS 错过的新信号,这为肌肉骨骼系统和结缔组织表型的潜在遗传因素提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/b24a860ada08/41598_2023_30415_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/236e17f69190/41598_2023_30415_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/52e6f85e018a/41598_2023_30415_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/9870dba20143/41598_2023_30415_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/b24a860ada08/41598_2023_30415_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/236e17f69190/41598_2023_30415_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/52e6f85e018a/41598_2023_30415_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/9870dba20143/41598_2023_30415_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9466/9975197/b24a860ada08/41598_2023_30415_Fig4_HTML.jpg

相似文献

1
A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics.基于 GWAS 汇总统计数据的多表型关联研究的聚类线性组合方法。
Sci Rep. 2023 Feb 28;13(1):3389. doi: 10.1038/s41598-023-30415-3.
2
A computationally efficient clustering linear combination approach to jointly analyze multiple phenotypes for GWAS.一种计算效率高的聚类线性组合方法,用于联合分析 GWAS 中的多种表型。
PLoS One. 2022 Apr 28;17(4):e0260911. doi: 10.1371/journal.pone.0260911. eCollection 2022.
3
Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts.基于不同队列的全基因组关联研究(GWAS)汇总统计数据的基于集合的多表型关联测试的荟萃分析。
Front Genet. 2024 Sep 5;15:1359591. doi: 10.3389/fgene.2024.1359591. eCollection 2024.
4
An adaptive test based on principal components for detecting multiple phenotype associations using GWAS summary data.一种基于主成分的适应性检验,用于利用全基因组关联研究(GWAS)汇总数据检测多种表型关联。
Genetica. 2023 Apr;151(2):97-104. doi: 10.1007/s10709-023-00179-9. Epub 2023 Jan 19.
5
A clustering linear combination approach to jointly analyze multiple phenotypes for GWAS.一种聚类线性组合方法,用于联合分析 GWAS 中的多种表型。
Bioinformatics. 2019 Apr 15;35(8):1373-1379. doi: 10.1093/bioinformatics/bty810.
6
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
7
Methods for meta-analysis of multiple traits using GWAS summary statistics.使用全基因组关联研究(GWAS)汇总统计量进行多性状荟萃分析的方法。
Genet Epidemiol. 2018 Mar;42(2):134-145. doi: 10.1002/gepi.22105. Epub 2017 Dec 10.
8
A summary-statistics-based approach to examine the role of serotonin transporter promoter tandem repeat polymorphism in psychiatric phenotypes.基于汇总统计的方法来检验 5-羟色胺转运体启动子串联重复多态性在精神表型中的作用。
Eur J Hum Genet. 2022 May;30(5):547-554. doi: 10.1038/s41431-021-00996-6. Epub 2021 Dec 23.
9
Gene-based association tests in family samples using GWAS summary statistics.基于基因的家系样本关联分析,使用 GWAS 汇总统计数据。
Genet Epidemiol. 2024 Apr;48(3):103-113. doi: 10.1002/gepi.22548. Epub 2024 Feb 5.
10
Joint analysis of multiple phenotypes using a clustering linear combination method based on hierarchical clustering.基于层次聚类的聚类线性组合方法对多个表型进行联合分析。
Genet Epidemiol. 2020 Jan;44(1):67-78. doi: 10.1002/gepi.22263. Epub 2019 Sep 20.

引用本文的文献

1
Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability.多性状 GWAS 中的性状选择策略:提高 SNP 可发现性。
HGG Adv. 2024 Jul 18;5(3):100319. doi: 10.1016/j.xhgg.2024.100319. Epub 2024 Jun 13.
2
Trait selection strategy in multi-trait GWAS: Boosting SNPs discoverability.多性状全基因组关联研究中的性状选择策略:提高单核苷酸多态性的可发现性
bioRxiv. 2023 Oct 27:2023.10.27.564319. doi: 10.1101/2023.10.27.564319.

本文引用的文献

1
Gene selection by incorporating genetic networks into case-control association studies.通过将基因网络纳入病例对照关联研究进行基因选择。
Eur J Hum Genet. 2024 Mar;32(3):270-277. doi: 10.1038/s41431-022-01264-x. Epub 2022 Dec 19.
2
HCLC-FC: A novel statistical method for phenome-wide association studies.HCLC-FC:一种用于全基因组关联研究的新统计方法。
PLoS One. 2022 Nov 9;17(11):e0276646. doi: 10.1371/journal.pone.0276646. eCollection 2022.
3
A Novel Framework for Analysis of the Shared Genetic Background of Correlated Traits.
一种分析相关性状共享遗传背景的新框架。
Genes (Basel). 2022 Sep 21;13(10):1694. doi: 10.3390/genes13101694.
4
PolarMorphism enables discovery of shared genetic variants across multiple traits from GWAS summary statistics.多态性分析能够从 GWAS 汇总统计数据中发现多个性状共有的遗传变异。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i212-i219. doi: 10.1093/bioinformatics/btac228.
5
A computationally efficient clustering linear combination approach to jointly analyze multiple phenotypes for GWAS.一种计算效率高的聚类线性组合方法,用于联合分析 GWAS 中的多种表型。
PLoS One. 2022 Apr 28;17(4):e0260911. doi: 10.1371/journal.pone.0260911. eCollection 2022.
6
PLEIO: a method to map and interpret pleiotropic loci with GWAS summary statistics.PLEIO:一种使用 GWAS 汇总统计数据进行基因多效性位点映射和解释的方法。
Am J Hum Genet. 2021 Jan 7;108(1):36-48. doi: 10.1016/j.ajhg.2020.11.017. Epub 2020 Dec 21.
7
Morphological signs of connective tissue dysplasia as predictors of frequent post-exercise musculoskeletal disorders.结缔组织发育异常的形态学特征作为运动后频繁发生肌肉骨骼疾病的预测指标。
BMC Musculoskelet Disord. 2020 Oct 8;21(1):660. doi: 10.1186/s12891-020-03698-0.
8
Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures.柯西组合检验:一种在任意相依结构下具有解析值计算功能的强大检验。
J Am Stat Assoc. 2020;115(529):393-402. doi: 10.1080/01621459.2018.1554485. Epub 2019 Apr 25.
9
Multi-trait Genome-Wide Analyses of the Brain Imaging Phenotypes in UK Biobank.多特征全基因组分析 UK Biobank 中的脑影像表型。
Genetics. 2020 Aug;215(4):947-958. doi: 10.1534/genetics.120.303242. Epub 2020 Jun 15.
10
Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics.通过基因水平关联统计量的层次聚类检测多种表型之间的共享遗传结构。
Genetics. 2020 Jun;215(2):511-529. doi: 10.1534/genetics.120.303096. Epub 2020 Apr 3.