• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两阶段全基因组关联研究中通过控制假发现率联合确定主要研究和复制研究的显著性水平。

Jointly determining significance levels of primary and replication studies by controlling the false discovery rate in two-stage genome-wide association studies.

机构信息

Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China.

出版信息

Stat Methods Med Res. 2018 Sep;27(9):2795-2808. doi: 10.1177/0962280216687168. Epub 2017 Jan 8.

DOI:10.1177/0962280216687168
PMID:28067114
Abstract

In genome-wide association studies, we normally discover associations between genetic variants and diseases/traits in primary studies, and validate the findings in replication studies. We consider the associations identified in both primary and replication studies as true findings. An important question under this two-stage setting is how to determine significance levels in both studies. In traditional methods, significance levels of the primary and replication studies are determined separately. We argue that the separate determination strategy reduces the power in the overall two-stage study. Therefore, we propose a novel method to determine significance levels jointly. Our method is a reanalysis method that needs summary statistics from both studies. We find the most powerful significance levels when controlling the false discovery rate in the two-stage study. To enjoy the power improvement from the joint determination method, we need to select single nucleotide polymorphisms for replication at a less stringent significance level. This is a common practice in studies designed for discovery purpose. We suggest this practice is also suitable in studies with validation purpose in order to identify more true findings. Simulation experiments show that our method can provide more power than traditional methods and that the false discovery rate is well-controlled. Empirical experiments on datasets of five diseases/traits demonstrate that our method can help identify more associations. The R-package is available at: http://bioinformatics.ust.hk/RFdr.html .

摘要

在全基因组关联研究中,我们通常在初步研究中发现遗传变异与疾病/特征之间的关联,并在复制研究中验证这些发现。我们将在初步和复制研究中发现的关联视为真实发现。在这种两阶段设置下,一个重要的问题是如何在两项研究中确定显著性水平。在传统方法中,初步和复制研究的显著性水平是分别确定的。我们认为,单独确定策略降低了整体两阶段研究的功效。因此,我们提出了一种联合确定显著性水平的新方法。我们的方法是一种需要从两项研究中汇总统计数据的重新分析方法。我们找到了在控制两阶段研究中的错误发现率时最强大的显著性水平。为了从联合确定方法中获得功效的提高,我们需要选择在较低显著性水平下用于复制的单核苷酸多态性。这是为发现目的而设计的研究中的常见做法。我们建议,这种做法也适用于具有验证目的的研究中,以便识别更多的真实发现。模拟实验表明,我们的方法比传统方法具有更高的功效,并且错误发现率得到了很好的控制。五个疾病/特征数据集的实证实验表明,我们的方法可以帮助识别更多的关联。R 包可在:http://bioinformatics.ust.hk/RFdr.html 获得。

相似文献

1
Jointly determining significance levels of primary and replication studies by controlling the false discovery rate in two-stage genome-wide association studies.两阶段全基因组关联研究中通过控制假发现率联合确定主要研究和复制研究的显著性水平。
Stat Methods Med Res. 2018 Sep;27(9):2795-2808. doi: 10.1177/0962280216687168. Epub 2017 Jan 8.
2
Controlling the joint local false discovery rate is more powerful than meta-analysis methods in joint analysis of summary statistics from multiple genome-wide association studies.在对多个全基因组关联研究的汇总统计数据进行联合分析时,控制联合局部错误发现率比元分析方法更有效。
Bioinformatics. 2017 Feb 15;33(4):500-507. doi: 10.1093/bioinformatics/btw690.
3
What is the probability of replicating a statistically significant association in genome-wide association studies?全基因组关联研究中复制具有统计学意义关联的可能性有多大?
Brief Bioinform. 2017 Nov 1;18(6):928-939. doi: 10.1093/bib/bbw091.
4
Power estimation and sample size determination for replication studies of genome-wide association studies.全基因组关联研究重复研究的功效估计与样本量确定
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):3. doi: 10.1186/s12864-015-2296-4.
5
What should the genome-wide significance threshold be? Empirical replication of borderline genetic associations.全基因组显著阈值应为多少?边缘遗传关联的实证复制。
Int J Epidemiol. 2012 Feb;41(1):273-86. doi: 10.1093/ije/dyr178. Epub 2011 Dec 5.
6
Meta-analysis in genome-wide association datasets: strategies and application in Parkinson disease.全基因组关联数据集的荟萃分析:在帕金森病中的策略与应用。
PLoS One. 2007 Feb 7;2(2):e196. doi: 10.1371/journal.pone.0000196.
7
A pleiotropy-informed Bayesian false discovery rate adapted to a shared control design finds new disease associations from GWAS summary statistics.一种适应共享对照设计的多效性信息贝叶斯错误发现率可从全基因组关联研究汇总统计数据中发现新的疾病关联。
PLoS Genet. 2015 Feb 6;11(2):e1004926. doi: 10.1371/journal.pgen.1004926. eCollection 2015 Feb.
8
Exploring the Genetic Patterns of Complex Diseases via the Integrative Genome-Wide Approach.通过整合全基因组方法探索复杂疾病的遗传模式。
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):557-64. doi: 10.1109/TCBB.2015.2459692.
9
Statistical power and significance testing in large-scale genetic studies.大规模遗传研究中的统计功效和显著性检验。
Nat Rev Genet. 2014 May;15(5):335-46. doi: 10.1038/nrg3706.
10
Extracting replicable associations across multiple studies: Empirical Bayes algorithms for controlling the false discovery rate.提取多项研究中的可重复关联:用于控制错误发现率的经验贝叶斯算法
PLoS Comput Biol. 2017 Aug 18;13(8):e1005700. doi: 10.1371/journal.pcbi.1005700. eCollection 2017 Aug.

引用本文的文献

1
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
2
Genetic polymorphisms in MTR are associated with non-syndromic congenital heart disease from a family-based case-control study in the Chinese population.基于家系的病例对照研究显示,MTR 中的遗传多态性与中国人群的非综合征性先天性心脏病有关。
Sci Rep. 2019 Mar 25;9(1):5065. doi: 10.1038/s41598-019-41641-z.