• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SNP-PRAGE:基于单核苷酸多态性的基因集富集参数稳健分析

SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment.

作者信息

Lee Jaehoon, Ahn Soyeon, Oh Sohee, Weir Bruce, Park Taesung

机构信息

Department of Statistics, Seoul National University, San 56-1, Shilim-dong, Seoul, Korea.

出版信息

BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S11. doi: 10.1186/1752-0509-5-S2-S11. Epub 2011 Dec 14.

DOI:10.1186/1752-0509-5-S2-S11
PMID:22784568
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287477/
Abstract

BACKGROUND

The current genome-wide association (GWA) analysis mainly focuses on the single genetic variant, which may not reveal some the genetic variants that have small individual effects but large joint effects. Considering the multiple SNPs jointly in Genome-wide association (GWA) analysis can increase power. When multiple SNPs are jointly considered, the corresponding SNP-level association measures are likely to be correlated due to the linkage disequilibrium (LD) among SNPs.

METHODS

We propose SNP-based parametric robust analysis of gene-set enrichment (SNP-PRAGE) method which handles correlation adequately among association measures of SNPs, and minimizes computing effort by the parametric assumption. SNP-PRAGE first obtains gene-level association measures from SNP-level association measures by incorporating the size of corresponding (or nearby) genes and the LD structure among SNPs. Afterward, SNP-PRAGE acquires the gene-set level summary of genes that undergo the same biological knowledge. This two-step summarization makes the within-set association measures to be independent from each other, and therefore the central limit theorem can be adequately applied for the parametric model.

RESULTS & CONCLUSIONS: We applied SNP-PRAGE to two GWA data sets: hypertension data of 8,842 samples from the Korean population and bipolar disorder data of 4,806 samples from the Wellcome Trust Case Control Consortium (WTCCC). We found two enriched gene sets for hypertension and three enriched gene sets for bipolar disorder. By a simulation study, we compared our method to other gene set methods, and we found SNP-PRAGE reduced many false positives notably while requiring much less computational efforts than other permutation-based gene set approaches.

摘要

背景

当前的全基因组关联(GWA)分析主要聚焦于单个基因变异,这可能无法揭示一些个体效应小但联合效应大的基因变异。在全基因组关联分析中联合考虑多个单核苷酸多态性(SNP)可提高检验效能。当联合考虑多个SNP时,由于SNP之间的连锁不平衡(LD),相应的SNP水平关联度量可能会相关。

方法

我们提出了基于SNP的基因集富集参数稳健分析(SNP-PRAGE)方法,该方法能充分处理SNP关联度量之间的相关性,并通过参数假设将计算量降至最低。SNP-PRAGE首先通过纳入相应(或附近)基因的大小以及SNP之间的LD结构,从SNP水平关联度量中获得基因水平关联度量。之后,SNP-PRAGE获取具有相同生物学知识的基因的基因集水平汇总。这种两步汇总使得集合内的关联度量相互独立,因此中心极限定理可充分应用于参数模型。

结果与结论

我们将SNP-PRAGE应用于两个GWA数据集:来自韩国人群的8842个样本的高血压数据和来自威康信托病例对照协会(WTCCC)的4806个样本的双相情感障碍数据。我们发现了两个高血压富集基因集和三个双相情感障碍富集基因集。通过模拟研究,我们将我们的方法与其他基因集方法进行了比较,发现SNP-PRAGE显著减少了许多假阳性,同时与其他基于置换的基因集方法相比,所需的计算量要少得多。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/37c7d6421ca0/1752-0509-5-S2-S11-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/f231031231ce/1752-0509-5-S2-S11-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/c5f59e39723d/1752-0509-5-S2-S11-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/37c7d6421ca0/1752-0509-5-S2-S11-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/f231031231ce/1752-0509-5-S2-S11-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/c5f59e39723d/1752-0509-5-S2-S11-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baf1/3287477/37c7d6421ca0/1752-0509-5-S2-S11-3.jpg

相似文献

1
SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment.SNP-PRAGE:基于单核苷酸多态性的基因集富集参数稳健分析
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S11. doi: 10.1186/1752-0509-5-S2-S11. Epub 2011 Dec 14.
2
RS-SNP: a random-set method for genome-wide association studies.RS-SNP:一种用于全基因组关联研究的随机集方法。
BMC Genomics. 2011 Mar 30;12:166. doi: 10.1186/1471-2164-12-166.
3
Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data.从全基因组关联数据中找到 2 型糖尿病因果单核苷酸多态性组合和功能模块。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1472-6947-13-S1-S3. Epub 2013 Apr 5.
4
Convergent genome wide association results for bipolar disorder and substance dependence.双相情感障碍和物质依赖的全基因组关联结果趋同。
Am J Med Genet B Neuropsychiatr Genet. 2009 Mar 5;150B(2):182-90. doi: 10.1002/ajmg.b.30900.
5
SNP-based pathway enrichment analysis for genome-wide association studies.基于 SNP 的通路富集分析在全基因组关联研究中的应用。
BMC Bioinformatics. 2011 Apr 15;12:99. doi: 10.1186/1471-2105-12-99.
6
Tagging SNP-set selection with maximum information based on linkage disequilibrium structure in genome-wide association studies.基于全基因组关联研究中连锁不平衡结构的最大信息进行 SNP 集选择标记。
Bioinformatics. 2017 Jul 15;33(14):2078-2081. doi: 10.1093/bioinformatics/btx151.
7
Where is the causal variant? On the advantage of the family design over the case-control design in genetic association studies.因果变异在哪里?论家族设计在基因关联研究中相对于病例对照设计的优势。
Eur J Hum Genet. 2015 Oct;23(10):1357-63. doi: 10.1038/ejhg.2014.284. Epub 2015 Jan 14.
8
An examination of single nucleotide polymorphism selection prioritization strategies for tests of gene-gene interaction.单核苷酸多态性选择优先级策略在基因-基因相互作用检验中的应用研究
Biol Psychiatry. 2011 Jul 15;70(2):198-203. doi: 10.1016/j.biopsych.2011.01.034. Epub 2011 Apr 9.
9
Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder.全基因组关联研究数据集的基因本体分析为双相情感障碍的生物学研究提供了见解。
Am J Hum Genet. 2009 Jul;85(1):13-24. doi: 10.1016/j.ajhg.2009.05.011. Epub 2009 Jun 18.
10
LDGIdb: a database of gene interactions inferred from long-range strong linkage disequilibrium between pairs of SNPs.LDGIdb:一个从单核苷酸多态性(SNP)对之间的长程强连锁不平衡推断出的基因相互作用数据库。
BMC Res Notes. 2012 May 2;5:212. doi: 10.1186/1756-0500-5-212.

引用本文的文献

1
Enhanced adaptive permutation test with negative binomial distribution in genome-wide omics datasets.全基因组组学数据集中具有负二项分布的增强自适应排列检验
Genes Genomics. 2025 Jan;47(1):59-70. doi: 10.1007/s13258-024-01584-w. Epub 2024 Nov 6.
2
Comprehensive detection of genes causing a phenotype using phenotype sequencing and pathway analysis.使用表型测序和通路分析对导致某一表型的基因进行全面检测。
PLoS One. 2014 Feb 26;9(2):e88072. doi: 10.1371/journal.pone.0088072. eCollection 2014.

本文引用的文献

1
Protein kinases CK1 and CK2 as new targets for neurodegenerative diseases.蛋白激酶 CK1 和 CK2 作为神经退行性疾病的新靶点。
Med Res Rev. 2011 Nov;31(6):924-54. doi: 10.1002/med.20207. Epub 2010 Jun 23.
2
GSA-SNP: a general approach for gene set analysis of polymorphisms.GSA-SNP:一种用于多态性基因集分析的通用方法。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W749-54. doi: 10.1093/nar/gkq428. Epub 2010 May 25.
3
The generalisation of student's problems when several different population variances are involved.当涉及几个不同总体方差时学生问题的推广。
Biometrika. 1947;34(1-2):28-35. doi: 10.1093/biomet/34.1-2.28.
4
Induction of mitogen-activated protein kinases is proportional to the amount of pressure overload.有丝分裂原激活的蛋白激酶的诱导与压力超负荷的量成正比。
Hypertension. 2010 Jan;55(1):137-43. doi: 10.1161/HYPERTENSIONAHA.109.135467. Epub 2009 Nov 9.
5
Gene and pathway-based second-wave analysis of genome-wide association studies.基于基因和通路的全基因组关联研究的二次分析。
Eur J Hum Genet. 2010 Jan;18(1):111-7. doi: 10.1038/ejhg.2009.115.
6
A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits.一项针对亚洲人群的大规模全基因组关联研究揭示了影响八个数量性状的遗传因素。
Nat Genet. 2009 May;41(5):527-34. doi: 10.1038/ng.357. Epub 2009 Apr 26.
7
GLOSSI: a method to assess the association of genetic loci-sets with complex diseases.GLOSSI:一种评估基因座集与复杂疾病关联的方法。
BMC Bioinformatics. 2009 Apr 3;10:102. doi: 10.1186/1471-2105-10-102.
8
Pathway analysis by adaptive combination of P-values.基于 P 值自适应组合的通路分析。
Genet Epidemiol. 2009 Dec;33(8):700-9. doi: 10.1002/gepi.20422.
9
Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways.对风险途径进行优先级排序:一种融合单核苷酸多态性(SNPs)和途径来寻找疾病途径的新型关联方法。
Bioinformatics. 2009 Jan 15;25(2):237-42. doi: 10.1093/bioinformatics/btn613. Epub 2008 Nov 24.
10
GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies.GSEA-SNP:将基因集富集分析应用于全基因组关联研究的SNP数据。
Bioinformatics. 2008 Dec 1;24(23):2784-5. doi: 10.1093/bioinformatics/btn516. Epub 2008 Oct 14.