• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在病例对照全基因组关联研究中检测疾病相关单核苷酸多态性的概率。

Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies.

作者信息

Gail Mitchell H, Pfeiffer Ruth M, Wheeler William, Pee David

机构信息

Division of Cancer Epidemiology and Genetics, National Cancer Institute, 6120 Executive Boulevard, EPS 8032, Bethesda, MD 20892-7244, USA.

出版信息

Biostatistics. 2008 Apr;9(2):201-15. doi: 10.1093/biostatistics/kxm032. Epub 2007 Sep 14.

DOI:10.1093/biostatistics/kxm032
PMID:17873152
Abstract

Some case-control genome-wide association studies (CCGWASs) select promising single nucleotide polymorphisms (SNPs) by ranking corresponding p-values, rather than by applying the same p-value threshold to each SNP. For such a study, we define the detection probability (DP) for a specific disease-associated SNP as the probability that the SNP will be "T-selected," namely have one of the top T largest chi-square values (or smallest p-values) for trend tests of association. The corresponding proportion positive (PP) is the fraction of selected SNPs that are true disease-associated SNPs. We study DP and PP analytically and via simulations, both for fixed and for random effects models of genetic risk, that allow for heterogeneity in genetic risk. DP increases with genetic effect size and case-control sample size and decreases with the number of nondisease-associated SNPs, mainly through the ratio of T to N, the total number of SNPs. We show that DP increases very slowly with T, and the increment in DP per unit increase in T declines rapidly with T. DP is also diminished if the number of true disease SNPs exceeds T. For a genetic odds ratio per minor disease allele of 1.2 or less, even a CCGWAS with 1000 cases and 1000 controls requires T to be impractically large to achieve an acceptable DP, leading to PP values so low as to make the study futile and misleading. We further calculate the sample size of the initial CCGWAS that is required to minimize the total cost of a research program that also includes follow-up studies to examine the T-selected SNPs. A large initial CCGWAS is desirable if genetic effects are small or if the cost of a follow-up study is large.

摘要

一些病例对照全基因组关联研究(CCGWAS)通过对相应的p值进行排序来选择有前景的单核苷酸多态性(SNP),而不是对每个SNP应用相同的p值阈值。对于这样一项研究,我们将特定疾病相关SNP的检测概率(DP)定义为该SNP被“T选择”的概率,即对于关联趋势检验具有T个最大卡方值(或最小p值)之一。相应的阳性比例(PP)是所选SNP中真正与疾病相关的SNP的比例。我们通过分析和模拟研究了DP和PP,包括固定效应模型和随机效应模型,这些模型考虑了遗传风险的异质性。DP随着遗传效应大小和病例对照样本量的增加而增加,随着非疾病相关SNP数量的增加而减少,主要是通过T与SNP总数N的比例。我们表明,DP随着T的增加非常缓慢,并且T每增加一个单位,DP的增量会随着T的增加而迅速下降。如果真正的疾病SNP数量超过T,DP也会降低。对于每个次要疾病等位基因的遗传优势比为1.2或更低的情况,即使是一项有1000例病例和1000例对照的CCGWAS,也需要T大到不切实际的程度才能达到可接受的DP,导致PP值低到使该研究徒劳且具有误导性。我们进一步计算了初始CCGWAS的样本量,该样本量是使一个研究项目的总成本最小化所必需的,该研究项目还包括对T选择的SNP进行后续研究。如果遗传效应较小或后续研究成本较高,那么较大的初始CCGWAS是可取的。

相似文献

1
Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies.在病例对照全基因组关联研究中检测疾病相关单核苷酸多态性的概率。
Biostatistics. 2008 Apr;9(2):201-15. doi: 10.1093/biostatistics/kxm032. Epub 2007 Sep 14.
2
[How about the uncertainty in the haplotypes in the population-based KORA studies?].基于人群的KORA研究中,单倍型的不确定性情况如何?
Gesundheitswesen. 2005 Aug;67 Suppl 1:S132-6. doi: 10.1055/s-2005-858253.
3
Genetic polymorphisms of the HCR gene and a genomic segment in close proximity to HLA-C are associated with patients with psoriasis in Taiwan.HCR基因的遗传多态性以及与HLA - C紧密相邻的一个基因组片段与台湾地区的银屑病患者相关。
Br J Dermatol. 2004 Jun;150(6):1104-11. doi: 10.1111/j.1365-2133.2004.05972.x.
4
The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant.使用单核苷酸多态性标记进行全基因组连锁不平衡检测以发现低频疾病变异的预期效能。
Ann Hum Genet. 2002 Jul;66(Pt 4):297-306. doi: 10.1017/S0003480002001197.
5
The power of genome-wide association studies of complex disease genes: statistical limitations of indirect approaches using SNP markers.复杂疾病基因的全基因组关联研究的力量:使用单核苷酸多态性(SNP)标记的间接方法的统计局限性。
J Hum Genet. 2001;46(8):478-82. doi: 10.1007/s100380170048.
6
Probability that a two-stage genome-wide association study will detect a disease-associated snp and implications for multistage designs.两阶段全基因组关联研究检测疾病相关单核苷酸多态性的概率及其对多阶段设计的意义。
Ann Hum Genet. 2008 Nov;72(Pt 6):812-20. doi: 10.1111/j.1469-1809.2008.00467.x. Epub 2008 Jul 24.
7
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
8
Poor replication of candidate genes for major depressive disorder using genome-wide association data.利用全基因组关联数据对重度抑郁症候选基因的复制效果不佳。
Mol Psychiatry. 2011 May;16(5):516-32. doi: 10.1038/mp.2010.38. Epub 2010 Mar 30.
9
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
10
Analysis of single-locus tests to detect gene/disease associations.用于检测基因/疾病关联的单基因座测试分析。
Genet Epidemiol. 2005 Apr;28(3):207-19. doi: 10.1002/gepi.20050.

引用本文的文献

1
Robust Tests in Genome-Wide Scans under Incomplete Linkage Disequilibrium.不完全连锁不平衡下全基因组扫描中的稳健检验
Stat Sci. 2009 Nov;24(4):503-516. doi: 10.1214/09-sts314. Epub 2010 Apr 20.
2
Genome-wide association analysis of fleece traits in Northwest Xizang white cashmere goat.西藏西北部白绒山羊羊毛性状的全基因组关联分析
Front Vet Sci. 2024 May 30;11:1409084. doi: 10.3389/fvets.2024.1409084. eCollection 2024.
3
Comparison of approaches for incorporating new information into existing risk prediction models.将新信息纳入现有风险预测模型的方法比较。
Stat Med. 2017 Mar 30;36(7):1134-1156. doi: 10.1002/sim.7190. Epub 2016 Dec 11.
4
Regionally Smoothed Meta-Analysis Methods for GWAS Datasets.全基因组关联研究数据集的区域平滑荟萃分析方法。
Genet Epidemiol. 2016 Feb;40(2):154-60. doi: 10.1002/gepi.21949. Epub 2015 Dec 28.
5
The ranking probability approach and its usage in design and analysis of large-scale studies.排名概率法及其在大规模研究设计和分析中的应用。
PLoS One. 2013 Dec 20;8(12):e83079. doi: 10.1371/journal.pone.0083079. eCollection 2013.
6
ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION DATA.关于使用全基因组关联数据识别二元性状潜在基因的模型选择策略
Stat Sin. 2012;22:1041-1074.
7
In Vitro Assessment of the Inflammatory Breast Cancer Cell Line SUM 149: Discovery of 2 Single Nucleotide Polymorphisms in the RNase L Gene.体外评估炎症性乳腺癌细胞系 SUM 149:RNase L 基因中 2 个单核苷酸多态性的发现。
J Cancer. 2013;4(2):104-16. doi: 10.7150/jca.5002. Epub 2013 Jan 10.
8
Sample size and power analysis for sparse signal recovery in genome-wide association studies.全基因组关联研究中稀疏信号恢复的样本量与效能分析
Biometrika. 2011 Jun;98(2):273-290. doi: 10.1093/biomet/asr003.
9
Using Ascertainment for Targeted Resequencing to Increase Power to Identify Causal Variants.利用确定性进行靶向重测序以提高识别因果变异的能力。
Stat Interface. 2011;4(3):285-294. doi: 10.4310/sii.2011.v4.n3.a3.
10
Novel rank-based approaches for discovery and replication in genome-wide association studies.基于新的等级方法的全基因组关联研究中的发现和复制。
Genetics. 2011 Sep;189(1):329-40. doi: 10.1534/genetics.111.130542. Epub 2011 Jul 29.