• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中基于汇总统计量与多位点关联分析方法的全面比较。

A comprehensive comparison of multilocus association methods with summary statistics in genome-wide association studies.

机构信息

Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.

Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.

出版信息

BMC Bioinformatics. 2022 Aug 30;23(1):359. doi: 10.1186/s12859-022-04897-3.

DOI:10.1186/s12859-022-04897-3
PMID:36042399
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9429742/
Abstract

BACKGROUND

Multilocus analysis on a set of single nucleotide polymorphisms (SNPs) pre-assigned within a gene constitutes a valuable complement to single-marker analysis by aggregating data on complex traits in a biologically meaningful way. However, despite the existence of a wide variety of SNP-set methods, few comprehensive comparison studies have been previously performed to evaluate the effectiveness of these methods.

RESULTS

We herein sought to fill this knowledge gap by conducting a comprehensive empirical comparison for 22 commonly-used summary-statistics based SNP-set methods. We showed that only seven methods could effectively control the type I error, and that these well-calibrated approaches had varying power performance under the simulation scenarios. Overall, we confirmed that the burden test was generally underpowered and score-based variance component tests (e.g., sequence kernel association test) were much powerful under the polygenic genetic architecture in both common and rare variant association analyses. We further revealed that two linkage-disequilibrium-free P value combination methods (e.g., harmonic mean P value method and aggregated Cauchy association test) behaved very well under the sparse genetic architecture in simulations and real-data applications to common and rare variant association analyses as well as in expression quantitative trait loci weighted integrative analysis. We also assessed the scalability of these approaches by recording computational time and found that all these methods can be scalable to biobank-scale data although some might be relatively slow.

CONCLUSION

In conclusion, we hope that our findings can offer an important guidance on how to choose appropriate multilocus association analysis methods in post-GWAS era. All the SNP-set methods are implemented in the R package called MCA, which is freely available at https://github.com/biostatpzeng/ .

摘要

背景

在基因内预先分配的一组单核苷酸多态性(SNP)的多位点分析通过以生物学上有意义的方式聚合复杂性状的数据,是对单标记分析的宝贵补充。然而,尽管存在各种各样的 SNP 集方法,但以前很少进行全面的比较研究来评估这些方法的有效性。

结果

我们旨在通过对 22 种常用的基于汇总统计量的 SNP 集方法进行全面的实证比较来填补这一知识空白。我们表明,只有七种方法能够有效地控制 I 型错误,并且这些校准良好的方法在模拟场景下具有不同的功效表现。总体而言,我们证实了负担检验通常功效不足,基于分数的方差分量检验(例如序列核关联检验)在常见和罕见变异关联分析中的多基因遗传结构下具有更强的功效。我们进一步揭示了两种无连锁不平衡的 P 值组合方法(例如调和平均值 P 值方法和聚合柯西关联检验)在模拟和真实数据应用中常见和罕见变异关联分析以及表达数量性状基因座加权综合分析中的稀疏遗传结构下表现非常好。我们还通过记录计算时间来评估这些方法的可扩展性,发现尽管有些方法可能相对较慢,但所有这些方法都可以扩展到生物库规模的数据。

结论

总之,我们希望我们的研究结果能够为在后 GWAS 时代如何选择适当的多基因关联分析方法提供重要指导。所有的 SNP 集方法都在名为 MCA 的 R 包中实现,可在 https://github.com/biostatpzeng/ 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/a095792a473e/12859_2022_4897_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/d03243b502cc/12859_2022_4897_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/23cc55002312/12859_2022_4897_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/0c7e26aabb40/12859_2022_4897_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/78208a176d26/12859_2022_4897_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/a095792a473e/12859_2022_4897_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/d03243b502cc/12859_2022_4897_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/23cc55002312/12859_2022_4897_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/0c7e26aabb40/12859_2022_4897_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/78208a176d26/12859_2022_4897_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e807/9429742/a095792a473e/12859_2022_4897_Fig5_HTML.jpg

相似文献

1
A comprehensive comparison of multilocus association methods with summary statistics in genome-wide association studies.全基因组关联研究中基于汇总统计量与多位点关联分析方法的全面比较。
BMC Bioinformatics. 2022 Aug 30;23(1):359. doi: 10.1186/s12859-022-04897-3.
2
Estimating colocalization probability from limited summary statistics.从有限的汇总统计数据中估计共定位概率。
BMC Bioinformatics. 2021 May 17;22(1):254. doi: 10.1186/s12859-021-04170-z.
3
A gene based combination test using GWAS summary data.基于 GWAS 汇总数据的基因组合测试。
BMC Bioinformatics. 2023 Jan 3;24(1):2. doi: 10.1186/s12859-022-05114-x.
4
SumVg: Total Heritability Explained by All Variants in Genome-Wide Association Studies Based on Summary Statistics with Standard Error Estimates.SumVg:基于具有标准误差估计的汇总统计数据的全基因组关联研究中所有变异解释的总遗传力。
Int J Mol Sci. 2024 Jan 22;25(2):1347. doi: 10.3390/ijms25021347.
5
Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics.利用 GWAS 汇总统计数据估计复杂性状遗传相关性的方法比较。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa442.
6
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
7
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
8
POLARIS: Polygenic LD-adjusted risk score approach for set-based analysis of GWAS data.POLARIS:用于全基因组关联研究(GWAS)数据基于集合分析的多基因连锁不平衡调整风险评分方法。
Genet Epidemiol. 2018 Jun;42(4):366-377. doi: 10.1002/gepi.22117. Epub 2018 Mar 12.
9
Genomic prediction based on selective linkage disequilibrium pruning of low-coverage whole-genome sequence variants in a pure Duroc population.基于在纯杜洛克群体中对低覆盖度全基因组序列变异体进行选择性连锁不平衡修剪的基因组预测。
Genet Sel Evol. 2023 Oct 18;55(1):72. doi: 10.1186/s12711-023-00843-w.
10
Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics.基于分区 GWAS 汇总统计量的非参数多基因风险预测。
Am J Hum Genet. 2020 Jul 2;107(1):46-59. doi: 10.1016/j.ajhg.2020.05.004. Epub 2020 May 28.

引用本文的文献

1
Integrative Multi-Omics Approach for Improving Causal Gene Identification.用于改进因果基因识别的整合多组学方法
Genet Epidemiol. 2025 Jan;49(1):e22601. doi: 10.1002/gepi.22601. Epub 2024 Oct 23.
2
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
3
Incorporating genetic similarity of auxiliary samples into eGene identification under the transfer learning framework.在迁移学习框架下,将辅助样本的遗传相似性纳入 eGene 识别。

本文引用的文献

1
Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data.多基因富集可区分单细胞 RNA-seq 数据中单个细胞的疾病关联。
Nat Genet. 2022 Oct;54(10):1572-1580. doi: 10.1038/s41588-022-01167-z. Epub 2022 Sep 1.
2
EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.EPIC:通过整合全基因组关联研究和单细胞 RNA 测序来推断复杂性状的相关细胞类型。
PLoS Genet. 2022 Jun 16;18(6):e1010251. doi: 10.1371/journal.pgen.1010251. eCollection 2022 Jun.
3
Transcriptome-wide association studies: a view from Mendelian randomization.
J Transl Med. 2024 Mar 9;22(1):258. doi: 10.1186/s12967-024-05053-6.
4
Detecting associated genes for complex traits shared across East Asian and European populations under the framework of composite null hypothesis testing.在复合零假设检验框架下检测东亚和欧洲人群中共享的复杂性状的相关基因。
J Transl Med. 2022 Sep 23;20(1):424. doi: 10.1186/s12967-022-03637-8.
全转录组关联研究:孟德尔随机化视角
Quant Biol. 2021 Jun;9(2):107-121. doi: 10.1007/s40484-020-0207-4.
4
Simultaneous test and estimation of total genetic effect in eQTL integrative analysis through mixed models.通过混合模型实现 eQTL 整合分析中总遗传效应的同时检验和估计。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac038.
5
Analysis of rare genetic variation underlying cardiometabolic diseases and traits among 200,000 individuals in the UK Biobank.在英国生物银行的 20 万名个体中分析心血管代谢疾病和特征的罕见遗传变异。
Nat Genet. 2022 Mar;54(3):240-250. doi: 10.1038/s41588-021-01011-w. Epub 2022 Feb 17.
6
The Prognostic Value of FoxP3+ Tumour-Infiltrating Lymphocytes in Rectal Cancer Depends on Immune Phenotypes Defined by CD8+ Cytotoxic T Cell Density.FoxP3+ 肿瘤浸润淋巴细胞对直肠癌的预后价值取决于 CD8+ 细胞毒性 T 细胞密度定义的免疫表型。
Front Immunol. 2022 Jan 24;13:781222. doi: 10.3389/fimmu.2022.781222. eCollection 2022.
7
MicroRNA-379-5p targets MAP3K2 to reduce autophagy and alleviate neuronal injury following cerebral ischemia via the JNK/c-Jun signaling pathway.miRNA-379-5p 通过 JNK/c-Jun 信号通路靶向 MAP3K2 减少自噬并减轻脑缺血后的神经元损伤。
Kaohsiung J Med Sci. 2022 Mar;38(3):230-243. doi: 10.1002/kjm2.12488. Epub 2021 Dec 21.
8
A comprehensive gene-centric pleiotropic association analysis for 14 psychiatric disorders with GWAS summary statistics.一项针对 14 种精神障碍的全基因关联荟萃分析,使用 GWAS 汇总统计数据进行。
BMC Med. 2021 Dec 13;19(1):314. doi: 10.1186/s12916-021-02186-z.
9
Integrative eQTL-weighted hierarchical Cox models for SNP-set based time-to-event association studies.基于 SNP 集的整合 eQTL 加权分层 Cox 模型用于时间事件关联研究。
J Transl Med. 2021 Oct 9;19(1):418. doi: 10.1186/s12967-021-03090-z.
10
Identifying pleiotropic genes for complex phenotypes with summary statistics from a perspective of composite null hypothesis testing.从复合零假设检验的角度,利用汇总统计信息鉴定复杂表型的多效基因。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab389.