• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于遗传分析研讨会16的单核苷酸多态性数据比较基因集分析方法。

Comparing gene set analysis methods on single-nucleotide polymorphism data from Genetic Analysis Workshop 16.

作者信息

Tintle Nathan L, Borchers Bryce, Brown Marshall, Bekmetjev Airat

机构信息

Department of Mathematics, Hope College, 27 Graves Place, Holland, Michigan 49423, USA.

出版信息

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S96. doi: 10.1186/1753-6561-3-s7-s96.

DOI:10.1186/1753-6561-3-s7-s96
PMID:20018093
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2796000/
Abstract

Recently, gene set analysis (GSA) has been extended from use on gene expression data to use on single-nucleotide polymorphism (SNP) data in genome-wide association studies. When GSA has been demonstrated on SNP data, two popular statistics from gene expression data analysis (gene set enrichment analysis [GSEA] and Fisher's exact test [FET]) have been used. However, GSEA and FET have shown a lack of power and robustness in the analysis of gene expression data. The purpose of this work is to investigate whether the same issues are also true for the analysis of SNP data. Ultimately, we conclude that GSEA and FET are not optimal for the analysis of SNP data when compared with the SUMSTAT method. In analysis of real SNP data from the Framingham Heart Study, we find that SUMSTAT finds many more gene sets to be significant when compared with other methods. In an analysis of simulated data, SUMSTAT demonstrates high power and better control of the type I error rate. GSA is a promising approach to the analysis of SNP data in GWAS and use of the SUMSTAT statistic instead of GSEA or FET may increase power and robustness.

摘要

最近,基因集分析(GSA)已从用于基因表达数据扩展到用于全基因组关联研究中的单核苷酸多态性(SNP)数据。当在SNP数据上进行GSA验证时,人们使用了来自基因表达数据分析的两种常用统计方法(基因集富集分析[GSEA]和Fisher精确检验[FET])。然而,GSEA和FET在基因表达数据分析中已显示出缺乏效力和稳健性。这项工作的目的是研究在SNP数据分析中是否也存在同样的问题。最终,我们得出结论,与SUMSTAT方法相比,GSEA和FET在SNP数据分析中并非最优。在对弗雷明汉心脏研究的真实SNP数据进行分析时,我们发现与其他方法相比,SUMSTAT发现的显著基因集更多。在对模拟数据的分析中,SUMSTAT显示出高效力和对I型错误率的更好控制。GSA是全基因组关联研究中SNP数据分析的一种有前景的方法,使用SUMSTAT统计量而非GSEA或FET可能会提高效力和稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3054/2796000/2a2aba2b6a1f/1753-6561-3-S7-S96-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3054/2796000/2a2aba2b6a1f/1753-6561-3-S7-S96-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3054/2796000/2a2aba2b6a1f/1753-6561-3-S7-S96-1.jpg

相似文献

1
Comparing gene set analysis methods on single-nucleotide polymorphism data from Genetic Analysis Workshop 16.基于遗传分析研讨会16的单核苷酸多态性数据比较基因集分析方法。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S96. doi: 10.1186/1753-6561-3-s7-s96.
2
Importance of SNP Dependency Correction and Association Integration for Gene Set Analysis in Genome-Wide Association Studies.单核苷酸多态性(SNP)依赖性校正和关联整合在全基因组关联研究基因集分析中的重要性。
Front Genet. 2021 Dec 9;12:767358. doi: 10.3389/fgene.2021.767358. eCollection 2021.
3
Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data.从全基因组关联数据中找到 2 型糖尿病因果单核苷酸多态性组合和功能模块。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1472-6947-13-S1-S3. Epub 2013 Apr 5.
4
Comparative study of gene set enrichment methods.基因集富集方法的比较研究。
BMC Bioinformatics. 2009 Sep 2;10:275. doi: 10.1186/1471-2105-10-275.
5
Gene set analyses for interpreting microarray experiments on prokaryotic organisms.用于解释原核生物微阵列实验的基因集分析。
BMC Bioinformatics. 2008 Nov 5;9:469. doi: 10.1186/1471-2105-9-469.
6
GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies.GSEA-SNP:将基因集富集分析应用于全基因组关联研究的SNP数据。
Bioinformatics. 2008 Dec 1;24(23):2784-5. doi: 10.1093/bioinformatics/btn516. Epub 2008 Oct 14.
7
Comparison of pathway analysis approaches using lung cancer GWAS data sets.使用肺癌 GWAS 数据集的通路分析方法比较。
PLoS One. 2012;7(2):e31816. doi: 10.1371/journal.pone.0031816. Epub 2012 Feb 21.
8
Efficient pathway enrichment and network analysis of GWAS summary data using GSA-SNP2.利用 GSA-SNP2 对 GWAS 汇总数据进行有效的通路富集和网络分析。
Nucleic Acids Res. 2018 Jun 1;46(10):e60. doi: 10.1093/nar/gky175.
9
Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS.两种用于全基因组关联研究结果的基因集分析方法的性能比较:GSA-SNP与i-GSEA4GWAS
Genomics Inform. 2012 Jun;10(2):123-7. doi: 10.5808/GI.2012.10.2.123. Epub 2012 Jun 30.
10
GSEA-SNP identifies genes associated with Johne's disease in cattle.GSEA-SNP 鉴定与牛副结核病相关的基因。
Mamm Genome. 2010 Aug;21(7-8):419-25. doi: 10.1007/s00335-010-9278-2. Epub 2010 Aug 13.

引用本文的文献

1
Live-bearing cockroach genome reveals convergent evolutionary mechanisms linked to viviparity in insects and beyond.胎生蟑螂基因组揭示了与昆虫及其他生物胎生相关的趋同进化机制。
iScience. 2023 Sep 9;26(10):107832. doi: 10.1016/j.isci.2023.107832. eCollection 2023 Oct 20.
2
Integrative pathway and network analysis provide insights on flooding-tolerance genes in soybean.综合通路和网络分析为大豆耐涝基因提供了新见解。
Sci Rep. 2023 Feb 3;13(1):1980. doi: 10.1038/s41598-023-28593-1.
3
Convergent genomics of longevity in rockfishes highlights the genetics of human life span variation.

本文引用的文献

1
Gene set analyses for interpreting microarray experiments on prokaryotic organisms.用于解释原核生物微阵列实验的基因集分析。
BMC Bioinformatics. 2008 Nov 5;9:469. doi: 10.1186/1471-2105-9-469.
2
On the utility of gene set methods in genomewide association studies of quantitative traits.基因集方法在数量性状全基因组关联研究中的效用
Genet Epidemiol. 2008 Nov;32(7):658-68. doi: 10.1002/gepi.20334.
3
Gene-set approach for expression pattern analysis.用于表达模式分析的基因集方法。
长寿石斑鱼的趋同基因组学强调了人类寿命变化的遗传学基础。
Sci Adv. 2023 Jan 13;9(2):eadd2743. doi: 10.1126/sciadv.add2743. Epub 2023 Jan 11.
4
Signatures of Positive Selection in the Genome of : A Subspecies of European Honeybees.欧洲蜜蜂一个亚种基因组中的正选择特征
Life (Basel). 2022 Oct 19;12(10):1642. doi: 10.3390/life12101642.
5
Understanding signatures of positive natural selection in human zinc transporter genes.理解人类锌转运基因中正向自然选择的特征。
Sci Rep. 2022 Mar 12;12(1):4320. doi: 10.1038/s41598-022-08439-y.
6
Refining Convergent Rate Analysis with Topology in Mammalian Longevity and Marine Transitions.利用拓扑结构在哺乳动物寿命和海洋过渡中的收敛率分析的改进。
Mol Biol Evol. 2021 Oct 27;38(11):5190-5203. doi: 10.1093/molbev/msab226.
7
Prioritization and Evaluation of Flooding Tolerance Genes in Soybean [ (L.) Merr.].大豆[(L.)Merr.]耐涝性基因的优先级排序与评估
Front Genet. 2021 Jan 27;11:612131. doi: 10.3389/fgene.2020.612131. eCollection 2020.
8
Multi-Set Testing Strategies Show Good Behavior When Applied to Very Large Sets of Rare Variants.多集合测试策略在应用于非常大的稀有变异集合时表现良好。
Front Genet. 2020 Nov 9;11:591606. doi: 10.3389/fgene.2020.591606. eCollection 2020.
9
A multiscale approach to detect selection in nonmodel tree species: Widespread adaptation despite population decline in L.一种检测非模式树种选择的多尺度方法:尽管L.种群数量下降,但仍存在广泛适应性
Evol Appl. 2019 Jul 19;13(1):143-160. doi: 10.1111/eva.12838. eCollection 2020 Jan.
10
Adaptive Evolution of Animal Proteins over Development: Support for the Darwin Selection Opportunity Hypothesis of Evo-Devo.动物蛋白在发育过程中的适应性进化:对发育进化的达尔文选择机会假说的支持。
Mol Biol Evol. 2018 Dec 1;35(12):2862-2872. doi: 10.1093/molbev/msy175.
Brief Bioinform. 2008 May;9(3):189-97. doi: 10.1093/bib/bbn001. Epub 2008 Jan 17.
4
Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis.使用非参数多变量分析在微阵列研究中鉴定差异表达的基因类别。
Bioinformatics. 2008 Jan 15;24(2):192-201. doi: 10.1093/bioinformatics/btm583. Epub 2007 Nov 27.
5
Ensembl 2008.Ensembl 2008。
Nucleic Acids Res. 2008 Jan;36(Database issue):D707-14. doi: 10.1093/nar/gkm988. Epub 2007 Nov 13.
6
Pathway-based approaches for analysis of genomewide association studies.基于通路的全基因组关联研究分析方法。
Am J Hum Genet. 2007 Dec;81(6):1278-83. doi: 10.1086/522374.
7
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
8
Successful design and conduct of genome-wide association studies.全基因组关联研究的成功设计与实施。
Hum Mol Genet. 2007 Oct 15;16 Spec No. 2(SPEC):R220-5. doi: 10.1093/hmg/ddm161. Epub 2007 Jun 27.
9
Analyzing gene expression data in terms of gene sets: methodological issues.从基因集角度分析基因表达数据:方法学问题。
Bioinformatics. 2007 Apr 15;23(8):980-7. doi: 10.1093/bioinformatics/btm051. Epub 2007 Feb 15.
10
Microarray data analysis: from disarray to consolidation and consensus.微阵列数据分析:从混乱到整合与共识。
Nat Rev Genet. 2006 Jan;7(1):55-65. doi: 10.1038/nrg1749.