• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用加权假发现率控制和优先子集分析提高全基因组关联研究的效能。

Improving power of genome-wide association studies with weighted false discovery rate control and prioritized subset analysis.

机构信息

Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.

出版信息

PLoS One. 2012;7(4):e33716. doi: 10.1371/journal.pone.0033716. Epub 2012 Apr 9.

DOI:10.1371/journal.pone.0033716
PMID:22496761
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3322139/
Abstract

The issue of large-scale testing has caught much attention with the advent of high-throughput technologies. In genomic studies, researchers are often confronted with a large number of tests. To make simultaneous inference for the many tests, the false discovery rate (FDR) control provides a practical balance between the number of true positives and the number of false positives. However, when few hypotheses are truly non-null, controlling the FDR may not provide additional advantages over controlling the family-wise error rate (e.g., the Bonferroni correction). To facilitate discoveries from a study, weighting tests according to prior information is a promising strategy. A 'weighted FDR control' (WEI) and a 'prioritized subset analysis' (PSA) have caught much attention. In this work, we compare the two weighting schemes with systematic simulation studies and demonstrate their use with a genome-wide association study (GWAS) on type 1 diabetes provided by the Wellcome Trust Case Control Consortium. The PSA and the WEI both can increase power when the prior is informative. With accurate and precise prioritization, the PSA can especially create substantial power improvements over the commonly-used whole-genome single-step FDR adjustment (i.e., the traditional un-weighted FDR control). When the prior is uninformative (true disease susceptibility regions are not prioritized), the power loss of the PSA and the WEI is almost negligible. However, a caution is that the overall FDR of the PSA can be slightly inflated if the prioritization is not accurate and precise. Our study highlights the merits of using information from mounting genetic studies, and provides insights to choose an appropriate weighting scheme to FDR control on GWAS.

摘要

随着高通量技术的出现,大规模测试的问题引起了广泛关注。在基因组学研究中,研究人员经常面临大量的测试。为了对许多测试进行同时推断,错误发现率(FDR)控制在真阳性和假阳性的数量之间提供了实际的平衡。然而,当很少的假设真正是非零假设时,控制 FDR 可能不会比控制总体错误率(例如 Bonferroni 校正)提供额外的优势。为了促进研究中的发现,根据先验信息对测试进行加权是一种很有前途的策略。一种“加权 FDR 控制”(WEI)和一种“优先子集分析”(PSA)引起了广泛关注。在这项工作中,我们通过系统模拟研究比较了这两种加权方案,并使用来自 Wellcome Trust Case Control Consortium 的 1 型糖尿病全基因组关联研究(GWAS)展示了它们的使用。当先验信息丰富时,PSA 和 WEI 都可以提高功效。通过准确和精确的优先级排序,PSA 可以在常用的全基因组单步 FDR 调整(即传统的无加权 FDR 控制)的基础上,特别是可以显著提高功效。当先验信息不丰富时(真正的疾病易感性区域没有被优先排序),PSA 和 WEI 的功效损失几乎可以忽略不计。然而,需要注意的是,如果优先级排序不准确和精确,PSA 的总体 FDR 可能会略微膨胀。我们的研究强调了利用越来越多的遗传研究信息的优点,并提供了选择适当的加权方案来控制 GWAS 中 FDR 的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/4408acb7410d/pone.0033716.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/75b6f79dc080/pone.0033716.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/3663e5de05cd/pone.0033716.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/7ae4ff61056e/pone.0033716.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/f8ae2f4e284f/pone.0033716.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/4408acb7410d/pone.0033716.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/75b6f79dc080/pone.0033716.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/3663e5de05cd/pone.0033716.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/7ae4ff61056e/pone.0033716.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/f8ae2f4e284f/pone.0033716.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff96/3322139/4408acb7410d/pone.0033716.g005.jpg

相似文献

1
Improving power of genome-wide association studies with weighted false discovery rate control and prioritized subset analysis.利用加权假发现率控制和优先子集分析提高全基因组关联研究的效能。
PLoS One. 2012;7(4):e33716. doi: 10.1371/journal.pone.0033716. Epub 2012 Apr 9.
2
Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.用于大规模假设检验的分层错误发现控制及其在全基因组关联研究中的应用。
Genet Epidemiol. 2006 Sep;30(6):519-30. doi: 10.1002/gepi.20164.
3
Prioritized subset analysis: improving power in genome-wide association studies.优先子集分析:提高全基因组关联研究的效能
Hum Hered. 2008;65(3):129-41. doi: 10.1159/000109730. Epub 2007 Oct 12.
4
Controlling the Rate of GWAS False Discoveries.控制全基因组关联研究的错误发现率
Genetics. 2017 Jan;205(1):61-75. doi: 10.1534/genetics.116.193987. Epub 2016 Oct 26.
5
Power and type I error rate of false discovery rate approaches in genome-wide association studies.全基因组关联研究中错误发现率方法的功效和Ⅰ型错误率。
BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S134. doi: 10.1186/1471-2156-6-S1-S134.
6
Were genome-wide linkage studies a waste of time? Exploiting candidate regions within genome-wide association studies.全基因组连锁研究是否浪费时间?在全基因组关联研究中利用候选区域。
Genet Epidemiol. 2010 Feb;34(2):107-18. doi: 10.1002/gepi.20438.
7
Hidden Markov models for controlling false discovery rate in genome-wide association analysis.用于全基因组关联分析中控制错误发现率的隐马尔可夫模型
Methods Mol Biol. 2012;802:337-44. doi: 10.1007/978-1-61779-400-1_22.
8
Control procedures and estimators of the false discovery rate and their application in low-dimensional settings: an empirical investigation.控制程序和虚假发现率的估计及其在低维环境中的应用:实证研究。
BMC Bioinformatics. 2018 Mar 2;19(1):78. doi: 10.1186/s12859-018-2081-x.
9
A powerful statistical framework for generalization testing in GWAS, with application to the HCHS/SOL.一种用于全基因组关联研究中泛化测试的强大统计框架,并应用于西班牙裔社区健康研究/拉丁裔研究(HCHS/SOL)。
Genet Epidemiol. 2017 Apr;41(3):251-258. doi: 10.1002/gepi.22029. Epub 2017 Jan 15.
10
A pleiotropy-informed Bayesian false discovery rate adapted to a shared control design finds new disease associations from GWAS summary statistics.一种适应共享对照设计的多效性信息贝叶斯错误发现率可从全基因组关联研究汇总统计数据中发现新的疾病关联。
PLoS Genet. 2015 Feb 6;11(2):e1004926. doi: 10.1371/journal.pgen.1004926. eCollection 2015 Feb.

引用本文的文献

1
Weighted False Discovery Rate Control in Large-Scale Multiple Testing.大规模多重检验中的加权错误发现率控制
J Am Stat Assoc. 2018;113(523):1172-1183. doi: 10.1080/01621459.2017.1336443. Epub 2018 Jun 12.
2
Hierarchical investigation of genetic influences on response inhibition in healthy young adults.对健康年轻成年人反应抑制中遗传影响的分层研究。
Exp Clin Psychopharmacol. 2017 Dec;25(6):512-520. doi: 10.1037/pha0000156.
3
CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data.芝加哥:在捕获Hi-C数据中对DNA环化相互作用进行稳健检测。

本文引用的文献

1
Polymorphic variants of the IL2RA gene and susceptibility to type 1 diabetes in the Polish population.波兰人群中IL2RA基因的多态性变异与1型糖尿病易感性
Tissue Antigens. 2012 Mar;79(3):198-203. doi: 10.1111/j.1399-0039.2011.01828.x. Epub 2011 Dec 30.
2
Association between age, IL-10, IFNγ, stimulated C-peptide and disease progression in children with newly diagnosed Type 1 diabetes.新诊断 1 型糖尿病患儿年龄、IL-10、IFNγ、刺激 C 肽与疾病进展的关系。
Diabet Med. 2012 Jun;29(6):734-41. doi: 10.1111/j.1464-5491.2011.03544.x.
3
HLA-DQ A1, -DQB1 and -DRB1 gene polymorphism--in Malay type 1 diabetes mellitus patients and their use for risk prediction.
Genome Biol. 2016 Jun 15;17(1):127. doi: 10.1186/s13059-016-0992-2.
4
Exome sequencing of oral squamous cell carcinoma in users of Arabian snuff reveals novel candidates for driver genes.对使用阿拉伯鼻烟者的口腔鳞状细胞癌进行外显子组测序,发现了驱动基因的新候选基因。
Int J Cancer. 2016 Jul 15;139(2):363-72. doi: 10.1002/ijc.30068. Epub 2016 Mar 18.
5
Multiscale modeling of the causal functional roles of nsSNPs in a genome-wide association study: application to hypoxia.在全基因组关联研究中对 nsSNPs 的因果功能作用进行多尺度建模:在缺氧中的应用。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S9. doi: 10.1186/1471-2164-14-S3-S9. Epub 2013 May 28.
6
Transcriptome and proteome exploration to provide a resource for the study of Agrocybe aegerita.转录组和蛋白质组学探索为研究草菇提供资源。
PLoS One. 2013;8(2):e56686. doi: 10.1371/journal.pone.0056686. Epub 2013 Feb 13.
HLA-DQ A1、-DQB1和-DRB1基因多态性——在马来西亚1型糖尿病患者中的情况及其用于风险预测的应用
Med J Malaysia. 2011 Jun;66(2):133-7.
4
Increased risk of type 1 diabetes in Polish children - association with INS-IGF2 5'VNTR and lack of association with HLA haplotype.波兰儿童 1 型糖尿病风险增加——与 INS-IGF2 5'VNTR 相关,与 HLA 单倍型无关。
Endokrynol Pol. 2011;62(5):436-42.
5
Pancreatic autoantibodies, HLA DR and PTPN22 polymorphisms in first degree relatives of patients with type 1 diabetes and multiethnic background.1型糖尿病患者及多民族背景的一级亲属中的胰腺自身抗体、HLA DR和PTPN22基因多态性
Exp Clin Endocrinol Diabetes. 2011 Nov;119(10):618-20. doi: 10.1055/s-0031-1280799. Epub 2011 Nov 8.
6
Healthy first-degree relatives of patients with type 1 diabetes exhibit significant differences in basal gene expression pattern of immunocompetent cells compared to controls: expression pattern as predeterminant of autoimmune diabetes.与对照组相比,1型糖尿病患者的健康一级亲属在免疫活性细胞的基础基因表达模式上表现出显著差异:表达模式作为自身免疫性糖尿病的决定因素。
Scand J Immunol. 2012 Feb;75(2):210-9. doi: 10.1111/j.1365-3083.2011.02637.x.
7
Study of polymorphisms in 4q27, 10p15, and 22q13 regions in autoantibodies stratified type 1 diabetes patients.对 1 型糖尿病患者分层的自身抗体中 4q27、10p15 和 22q13 区域的多态性研究。
Autoimmunity. 2011 Dec;44(8):624-30. doi: 10.3109/08916934.2011.592515. Epub 2011 Aug 30.
8
Association of PTPN22 1858T/T genotype with type 1 diabetes, Graves' disease but not with rheumatoid arthritis in Russian population.俄罗斯人群中PTPN22 1858T/T基因型与1型糖尿病、格雷夫斯病相关,但与类风湿关节炎无关。
Aging (Albany NY). 2011 Apr;3(4):368-73. doi: 10.18632/aging.100305.
9
Next-generation association studies for complex traits.下一代复杂性状的关联研究。
Nat Genet. 2011 Mar 29;43(4):287-8. doi: 10.1038/ng0411-287.
10
The PTPN22 C1858T gene variant is associated with proinsulin in new-onset type 1 diabetes.PTPN22 C1858T 基因变异与新发 1 型糖尿病的胰岛素原有关。
BMC Med Genet. 2011 Mar 23;12:41. doi: 10.1186/1471-2350-12-41.