• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PRED-LD:全基因组关联研究汇总统计数据的高效估算

PRED-LD: efficient imputation of GWAS summary statistics.

作者信息

Manios Georgios A, Michailidi Aikaterini, Kontou Panagiota I, Bagos Pantelis G

机构信息

Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.

Department of Mathematics, University of Thessaly, 35131, Lamia, Greece.

出版信息

BMC Bioinformatics. 2025 Apr 16;26(1):107. doi: 10.1186/s12859-025-06119-y.

DOI:10.1186/s12859-025-06119-y
PMID:40240925
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12004831/
Abstract

BACKGROUND

Genome-wide association studies have identified connections between genetic variations and diseases, but they only examine a small portion of single nucleotide polymorphisms. To enhance genetic findings, researchers suggest imputing genotypes for unmeasured SNPs to improve coverage and statistical power. When this is not possible, summary statistics imputation can be used as an alternative. The available summary statistics imputation tools rely on reference panels, such as the 1000 Genomes Project, to estimate linkage disequilibrium (LD) between variants for accurate imputation. Tools like FAPI and SSIMP use these reference panels in variant call format (VCF) for this purpose, though this process can be time-consuming. A more effective approach for processing reference panels in summary statistics imputation was proposed in RAISS. In this approach, the LD among the variants is precomputed from the reference panel, prior to imputation, thereby reducing computational time.

RESULTS

We present PRED-LD, an imputation method for GWAS summary statistics that aims to enhance the resolution of genetic association analyses. The proposed method uses precomputed linkage disequilibrium statistics from HapMap, Pheno Scanner and TOP-LD to impute summary statistics, given beta coefficients and standard errors. The single-point approach that we describe provides a fast and accurate way to estimate associations for untyped single nucleotide polymorphisms that exhibit high linkage disequilibrium (LD). The proposed method is faster, provides accurate imputation compared to existing tools, and has been implemented in both a web service ( https://compgen.dib.uth.gr/PRED-LD/ ) and a command-line tool ( https://github.com/pbagos/PRED-LD ), making it a useful resource for the research community.

CONCLUSIONS

PRED-LD offers an efficient and accurate method for GWAS summary statistics imputation, providing faster performance, direct result interpretation, and the ability to use multiple reference panels. Also, the online version of PRED-LD simplifies obtaining LD information and performing imputation tasks without downloading reference panels and will be continuously updated to support tools for meta-analysis and fine-mapping in GWAS.

摘要

背景

全基因组关联研究已确定了基因变异与疾病之间的联系,但它们仅检测了一小部分单核苷酸多态性。为了增强基因研究结果,研究人员建议对未测量的单核苷酸多态性进行基因型填充,以提高覆盖范围和统计功效。当无法进行这种填充时,汇总统计量填充可作为一种替代方法。现有的汇总统计量填充工具依赖于参考面板,如千人基因组计划,来估计变异之间的连锁不平衡(LD),以进行准确的填充。像FAPI和SSIMP这样的工具为此使用变体调用格式(VCF)的这些参考面板,不过这个过程可能很耗时。RAISS中提出了一种在汇总统计量填充中处理参考面板的更有效方法。在这种方法中,在填充之前从参考面板预先计算变异之间的LD,从而减少计算时间。

结果

我们提出了PRED-LD,一种用于全基因组关联研究汇总统计量的填充方法,旨在提高基因关联分析的分辨率。所提出的方法使用来自HapMap、Pheno Scanner和TOP-LD的预先计算的连锁不平衡统计量,在给定β系数和标准误的情况下对汇总统计量进行填充。我们描述的单点方法提供了一种快速准确的方法来估计与表现出高连锁不平衡(LD)的未分型单核苷酸多态性的关联。所提出的方法更快,与现有工具相比提供了准确的填充,并且已在网络服务(https://compgen.dib.uth.gr/PRED-LD/)和命令行工具(https://github.com/pbagos/PRED-LD)中实现,使其成为研究社区的有用资源。

结论

PRED-LD为全基因组关联研究汇总统计量填充提供了一种高效准确的方法,具有更快的性能、直接的结果解释以及使用多个参考面板的能力。此外,PRED-LD的在线版本简化了获取LD信息和执行填充任务的过程,无需下载参考面板,并将不断更新以支持全基因组关联研究中的荟萃分析和精细定位工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/5c08fb702283/12859_2025_6119_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/7012af462946/12859_2025_6119_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/3230a1d51611/12859_2025_6119_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/dafb982422ca/12859_2025_6119_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/a522b304c172/12859_2025_6119_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/7b4a94129f73/12859_2025_6119_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/5c08fb702283/12859_2025_6119_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/7012af462946/12859_2025_6119_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/3230a1d51611/12859_2025_6119_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/dafb982422ca/12859_2025_6119_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/a522b304c172/12859_2025_6119_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/7b4a94129f73/12859_2025_6119_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/845a/12004831/5c08fb702283/12859_2025_6119_Fig6_HTML.jpg

相似文献

1
PRED-LD: efficient imputation of GWAS summary statistics.PRED-LD:全基因组关联研究汇总统计数据的高效估算
BMC Bioinformatics. 2025 Apr 16;26(1):107. doi: 10.1186/s12859-025-06119-y.
2
GAUSS: a summary-statistics-based R package for accurate estimation of linkage disequilibrium for variants, Gaussian imputation, and TWAS analysis of cosmopolitan cohorts.GAUSS:一个基于汇总统计的 R 包,用于准确估计变体的连锁不平衡、高斯插补以及世界性队列的 TWAS 分析。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae203.
3
Increasing the resolution and precision of psychiatric genome-wide association studies by re-imputing summary statistics using a large, diverse reference panel.通过使用大型、多样化的参考面板重新估算汇总统计数据,提高精神疾病全基因组关联研究的分辨率和精度。
Am J Med Genet B Neuropsychiatr Genet. 2021 Jan;186(1):16-27. doi: 10.1002/ajmg.b.32834. Epub 2021 Feb 11.
4
FAPI: Fast and accurate P-value Imputation for genome-wide association study.FAPI:用于全基因组关联研究的快速准确P值估算
Eur J Hum Genet. 2016 May;24(5):761-6. doi: 10.1038/ejhg.2015.190. Epub 2015 Aug 26.
5
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment.快速准确地推断汇总统计数据可增强功能富集的证据。
Bioinformatics. 2014 Oct 15;30(20):2906-14. doi: 10.1093/bioinformatics/btu416. Epub 2014 Jul 1.
6
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.未分型标记的全基因组推断准确性及其对关联研究统计效能的影响。
BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.
7
Evaluation and application of summary statistic imputation to discover new height-associated loci.评估和应用汇总统计推断发现新的身高相关位点。
PLoS Genet. 2018 May 21;14(5):e1007371. doi: 10.1371/journal.pgen.1007371. eCollection 2018 May.
8
JEPEGMIX: gene-level joint analysis of functional SNPs in cosmopolitan cohorts.JEPEGMIX:全球人群队列中功能性单核苷酸多态性的基因水平联合分析。
Bioinformatics. 2016 Jan 15;32(2):295-7. doi: 10.1093/bioinformatics/btv567. Epub 2015 Oct 1.
9
Improved Use of Small Reference Panels for Conditional and Joint Analysis with GWAS Summary Statistics.利用小参考面板改进条件和联合分析与 GWAS 汇总统计数据。
Genetics. 2018 Jun;209(2):401-408. doi: 10.1534/genetics.118.300813. Epub 2018 Apr 19.
10
DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts.DISTMIX:从混合种族队列中直接推算未测量单核苷酸多态性的汇总统计量。
Bioinformatics. 2015 Oct 1;31(19):3099-104. doi: 10.1093/bioinformatics/btv348. Epub 2015 Jun 9.

本文引用的文献

1
GAUSS: a summary-statistics-based R package for accurate estimation of linkage disequilibrium for variants, Gaussian imputation, and TWAS analysis of cosmopolitan cohorts.GAUSS:一个基于汇总统计的 R 包,用于准确估计变体的连锁不平衡、高斯插补以及世界性队列的 TWAS 分析。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae203.
2
TOP-LD: A tool to explore linkage disequilibrium with TOPMed whole-genome sequence data.TOP-LD:一种利用 TOPMed 全基因组序列数据探索连锁不平衡的工具。
Am J Hum Genet. 2022 Jun 2;109(6):1175-1181. doi: 10.1016/j.ajhg.2022.04.006. Epub 2022 May 2.
3
Increasing the resolution and precision of psychiatric genome-wide association studies by re-imputing summary statistics using a large, diverse reference panel.
通过使用大型、多样化的参考面板重新估算汇总统计数据,提高精神疾病全基因组关联研究的分辨率和精度。
Am J Med Genet B Neuropsychiatr Genet. 2021 Jan;186(1):16-27. doi: 10.1002/ajmg.b.32834. Epub 2021 Feb 11.
4
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
5
A global overview of pleiotropy and genetic architecture in complex traits.复杂性状中的多效性和遗传结构的全球概述。
Nat Genet. 2019 Sep;51(9):1339-1348. doi: 10.1038/s41588-019-0481-0. Epub 2019 Aug 19.
6
RAISS: robust and accurate imputation from summary statistics.RAISS:从汇总统计数据中进行稳健且准确的推断。
Bioinformatics. 2019 Nov 1;35(22):4837-4839. doi: 10.1093/bioinformatics/btz466.
7
Genome-wide mega-analysis identifies 16 loci and highlights diverse biological mechanisms in the common epilepsies.全基因组大规模分析确定了 16 个基因座,并强调了常见癫痫中的多种生物学机制。
Nat Commun. 2018 Dec 10;9(1):5269. doi: 10.1038/s41467-018-07524-z.
8
Neural correlates of cognitive function and symptoms in attention-deficit/hyperactivity disorder in adults.成人注意缺陷多动障碍认知功能和症状的神经相关因素。
Neuroimage Clin. 2018 May 2;19:374-383. doi: 10.1016/j.nicl.2018.04.035. eCollection 2018.
9
Genome-wide association study in Japanese females identifies fifteen novel skin-related trait associations.全基因组关联研究在日本女性中确定了 15 个与皮肤相关的新特征关联。
Sci Rep. 2018 Jun 12;8(1):8974. doi: 10.1038/s41598-018-27145-2.
10
GWAS identifies two novel colorectal cancer loci at 16q24.1 and 20q13.12.GWAS 鉴定出两个新的结直肠癌风险位点,位于 16q24.1 和 20q13.12。
Carcinogenesis. 2018 May 3;39(5):652-660. doi: 10.1093/carcin/bgy026.