• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

还有多少遗传变异有待发现?

How many genetic variants remain to be discovered?

机构信息

Department of Medical Epidemiology, Karolinska Institutet, Stockholm, Sweden.

出版信息

PLoS One. 2009 Dec 2;4(12):e7969. doi: 10.1371/journal.pone.0007969.

DOI:10.1371/journal.pone.0007969
PMID:19956539
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2780697/
Abstract

A great majority of genetic markers discovered in recent genome-wide association studies have small effect sizes, and they explain only a small fraction of the genetic contribution to the diseases. How many more variants can we expect to discover and what study sizes are needed? We derive the connection between the cumulative risk of the SNP variants to the latent genetic risk model and heritability of the disease. We determine the sample size required for case-control studies in order to achieve a certain expected number of discoveries in a collection of most significant SNPs. Assuming similar allele frequencies and effect sizes of the currently validated SNPs, complex phenotypes such as type-2 diabetes would need approximately 800 variants to explain its 40% heritability. Much smaller numbers of variants are needed if we assume rare-variants but higher penetrance models. We estimate that up to 50,000 cases and an equal number of controls are needed to discover 800 common low-penetrant variants among the top 5000 SNPs. Under common and rare low-penetrance models, the very large studies required to discover the numerous variants are probably at the limit of practical feasibility. Under rare-variant with medium- to high-penetrance models (odds-ratios between 1.6 and 4.0), studies comparable in size to many existing studies are adequate provided the genotyping technology can interrogate more and rarer variants.

摘要

绝大多数在最近的全基因组关联研究中发现的遗传标记具有较小的效应大小,并且仅解释了疾病遗传贡献的一小部分。我们可以期望发现多少更多的变体,以及需要多大的研究规模?我们推导出 SNP 变体的累积风险与潜在遗传风险模型和疾病遗传性之间的联系。我们确定了病例对照研究所需的样本量,以便在一组最重要的 SNP 中达到一定数量的预期发现。假设目前已验证的 SNP 的等位基因频率和效应大小相似,那么像 2 型糖尿病这样的复杂表型需要大约 800 个变体才能解释其 40%的遗传性。如果我们假设稀有变体但具有更高外显率的模型,则需要的变体数量要少得多。我们估计,需要多达 50,000 例病例和相同数量的对照,才能在排名前 5,000 的 SNP 中发现 800 个常见的低外显率变体。在常见和稀有低外显率模型下,发现大量常见低外显率变体所需的非常大规模的研究可能已经达到了实际可行性的极限。在罕见变体且具有中到高外显率的模型(比值比在 1.6 到 4.0 之间)下,如果基因分型技术能够检测到更多和更罕见的变体,则与许多现有研究规模相当的研究就足够了。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/4e1b5901ee56/pone.0007969.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/ab6ee655b729/pone.0007969.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/3e3660d2fc3c/pone.0007969.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/2e6b6316b51d/pone.0007969.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/4e1b5901ee56/pone.0007969.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/ab6ee655b729/pone.0007969.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/3e3660d2fc3c/pone.0007969.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/2e6b6316b51d/pone.0007969.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a3/2780697/4e1b5901ee56/pone.0007969.g004.jpg

相似文献

1
How many genetic variants remain to be discovered?还有多少遗传变异有待发现?
PLoS One. 2009 Dec 2;4(12):e7969. doi: 10.1371/journal.pone.0007969.
2
The distribution of common-variant effect sizes.常见变异效应大小的分布。
Nat Genet. 2021 Aug;53(8):1243-1249. doi: 10.1038/s41588-021-00901-3. Epub 2021 Jul 29.
3
Evaluating the breast cancer predisposition role of rare variants in genes associated with low-penetrance breast cancer risk SNPs.评估与低外显率乳腺癌风险 SNP 相关基因中的罕见变异在乳腺癌易感性中的作用。
Breast Cancer Res. 2018 Jan 9;20(1):3. doi: 10.1186/s13058-017-0929-z.
4
Genetic variants primarily associated with type 2 diabetes are related to coronary artery disease risk.主要与2型糖尿病相关的基因变异与冠状动脉疾病风险有关。
Atherosclerosis. 2015 Aug;241(2):419-26. doi: 10.1016/j.atherosclerosis.2015.05.033. Epub 2015 Jun 3.
5
The power of genome-wide association studies of complex disease genes: statistical limitations of indirect approaches using SNP markers.复杂疾病基因的全基因组关联研究的力量:使用单核苷酸多态性(SNP)标记的间接方法的统计局限性。
J Hum Genet. 2001;46(8):478-82. doi: 10.1007/s100380170048.
6
The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant.使用单核苷酸多态性标记进行全基因组连锁不平衡检测以发现低频疾病变异的预期效能。
Ann Hum Genet. 2002 Jul;66(Pt 4):297-306. doi: 10.1017/S0003480002001197.
7
Interpretation of genetic association studies: markers with replicated highly significant odds ratios may be poor classifiers.基因关联研究的解读:具有重复出现的高度显著优势比的标记物可能是较差的分类指标。
PLoS Genet. 2009 Feb;5(2):e1000337. doi: 10.1371/journal.pgen.1000337. Epub 2009 Feb 6.
8
Underestimated effect sizes in GWAS: fundamental limitations of single SNP analysis for dichotomous phenotypes.GWAS 中被低估的效应大小:二分类表型中单 SNP 分析的基本局限性。
PLoS One. 2011;6(11):e27964. doi: 10.1371/journal.pone.0027964. Epub 2011 Nov 28.
9
FSHB -211 G>T is a major genetic modulator of reproductive physiology and health in childbearing age women.FSHB-211G>T 是生育期女性生殖生理和健康的主要遗传调节剂。
Hum Reprod. 2018 May 1;33(5):954-966. doi: 10.1093/humrep/dey057.
10
IRS1 G972R polymorphism and type 2 diabetes: a paradigm for the difficult ascertainment of the contribution to disease susceptibility of 'low-frequency-low-risk' variants.IRS1 G972R 多态性与 2 型糖尿病:“低频率-低风险”变异对疾病易感性贡献的艰难确定模式。
Diabetologia. 2009 Sep;52(9):1852-7. doi: 10.1007/s00125-009-1426-4. Epub 2009 Jun 26.

引用本文的文献

1
DINGO: increasing the power of locus discovery in maternal and fetal genome-wide association studies of perinatal traits.DINGO:提高母胎全基因组关联研究中围产儿特征的定位发现能力。
Nat Commun. 2024 Oct 26;15(1):9255. doi: 10.1038/s41467-024-53495-9.
2
Cross-ancestry analysis of brain QTLs enhances interpretation of schizophrenia genome-wide association studies.跨血统脑 QTLs 分析增强了精神分裂症全基因组关联研究的解释。
Am J Hum Genet. 2024 Nov 7;111(11):2444-2457. doi: 10.1016/j.ajhg.2024.09.001. Epub 2024 Oct 2.
3
Contribution of rare variants to heritability of a disease is much greater than conventionally estimated: modification of allele distribution model.

本文引用的文献

1
Human genetic variation and its contribution to complex traits.人类遗传变异及其对复杂性状的贡献。
Nat Rev Genet. 2009 Apr;10(4):241-51. doi: 10.1038/nrg2554.
2
Variants in TF and HFE explain approximately 40% of genetic variation in serum-transferrin levels.转铁蛋白(TF)和遗传性血色素沉着症蛋白(HFE)中的变异解释了血清转铁蛋白水平约40%的遗传变异。
Am J Hum Genet. 2009 Jan;84(1):60-5. doi: 10.1016/j.ajhg.2008.11.011. Epub 2008 Dec 11.
3
Prediction of individual genetic risk of complex disease.复杂疾病个体遗传风险的预测。
罕见变异对疾病遗传力的贡献比传统估计要大得多:等位基因分布模型的修正。
J Hum Genet. 2024 Dec;69(12):663-668. doi: 10.1038/s10038-024-01281-2. Epub 2024 Aug 20.
4
Direct and INdirect effects analysis of Genetic lOci (DINGO): A software package to increase the power of locus discovery in GWAS meta-analyses of perinatal phenotypes and traits influenced by indirect genetic effects.遗传位点的直接和间接效应分析(DINGO):一种用于提高全基因组关联研究(GWAS)荟萃分析中基因座发现能力的软件包,该分析针对受间接遗传效应影响的围产期表型和性状。
medRxiv. 2023 Aug 28:2023.08.22.23294446. doi: 10.1101/2023.08.22.23294446.
5
Genome-wide analysis provides genetic evidence that ACE2 influences COVID-19 risk and yields risk scores associated with severe disease.全基因组分析提供了遗传证据,表明 ACE2 影响 COVID-19 风险,并产生与严重疾病相关的风险评分。
Nat Genet. 2022 Apr;54(4):382-392. doi: 10.1038/s41588-021-01006-7. Epub 2022 Mar 3.
6
On the Transformation of Genetic Effect Size from Logit to Liability Scale.遗传效应大小从对数几率到 Liability 尺度的转换。
Behav Genet. 2021 May;51(3):215-222. doi: 10.1007/s10519-021-10042-2. Epub 2021 Feb 25.
7
Assessing the performance of genome-wide association studies for predicting disease risk.评估全基因组关联研究预测疾病风险的性能。
PLoS One. 2019 Dec 5;14(12):e0220215. doi: 10.1371/journal.pone.0220215. eCollection 2019.
8
Comparative genetic architectures of schizophrenia in East Asian and European populations.东亚和欧洲人群精神分裂症的比较遗传结构。
Nat Genet. 2019 Dec;51(12):1670-1678. doi: 10.1038/s41588-019-0512-x. Epub 2019 Nov 18.
9
Future Preventive Gene Therapy of Polygenic Diseases from a Population Genetics Perspective.从群体遗传学角度看多基因疾病的未来预防性基因治疗。
Int J Mol Sci. 2019 Oct 10;20(20):5013. doi: 10.3390/ijms20205013.
10
Evaluating the Potential of Younger Cases and Older Controls Cohorts to Improve Discovery Power in Genome-Wide Association Studies of Late-Onset Diseases.评估年轻病例组和老年对照组队列在晚发性疾病全基因组关联研究中提高发现能力的潜力。
J Pers Med. 2019 Jul 22;9(3):38. doi: 10.3390/jpm9030038.
Curr Opin Genet Dev. 2008 Jun;18(3):257-63. doi: 10.1016/j.gde.2008.07.006. Epub 2008 Aug 28.
4
Estimating coverage and power for genetic association studies using near-complete variation data.利用近乎完整的变异数据估计基因关联研究的覆盖率和效能。
Nat Genet. 2008 Jul;40(7):841-3. doi: 10.1038/ng.180. Epub 2008 Jun 22.
5
What can genome-wide association studies tell us about the genetics of common disease?全基因组关联研究能让我们了解常见疾病的哪些遗传信息?
PLoS Genet. 2008 Feb;4(2):e33. doi: 10.1371/journal.pgen.0040033.
6
Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes.全基因组关联数据的荟萃分析及大规模重复研究确定了2型糖尿病的其他易感基因座。
Nat Genet. 2008 May;40(5):638-45. doi: 10.1038/ng.120. Epub 2008 Mar 30.
7
CHEK2*1100delC genotyping for clinical assessment of breast cancer risk: meta-analyses of 26,000 patient cases and 27,000 controls.用于乳腺癌风险临床评估的CHEK2*1100delC基因分型:对26000例患者病例和27000例对照的荟萃分析
J Clin Oncol. 2008 Feb 1;26(4):542-8. doi: 10.1200/JCO.2007.12.5922. Epub 2008 Jan 2.
8
Common sequence variants in the LOXL1 gene confer susceptibility to exfoliation glaucoma.赖氨酰氧化酶样蛋白1(LOXL1)基因中的常见序列变异会增加剥脱性青光眼的易感性。
Science. 2007 Sep 7;317(5843):1397-400. doi: 10.1126/science.1146554. Epub 2007 Aug 9.
9
Founder mutations in BRCA1 and BRCA2 genes.BRCA1和BRCA2基因的奠基者突变
Ann Oncol. 2007 Jun;18 Suppl 6:vi93-8. doi: 10.1093/annonc/mdm234.
10
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。
Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.