• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

群体遗传学揭示了分析罕见变异时所面临的挑战。

Population genetics identifies challenges in analyzing rare variants.

作者信息

Johnston Henry Richard, Hu Yijuan, Cutler David J

机构信息

Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, United States of America.

出版信息

Genet Epidemiol. 2015 Mar;39(3):145-8. doi: 10.1002/gepi.21881. Epub 2015 Jan 12.

DOI:10.1002/gepi.21881
PMID:25640419
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4366269/
Abstract

Geneticists have, for years, understood the nature of genome-wide association studies using common genomic variants. Recently, however, focus has shifted to the analysis of rare variants. This presents potential problems for researchers, as rare variants do not always behave in the same way common variants do, sometimes rendering decades of solid intuition moot. In this paper, we present examples of the differences between common and rare variants. We show why one must be significantly more careful about the origin of rare variants, and how failing to do so can lead to highly inflated type I error. We then explain how to best avoid such concerns with careful understanding and study design. Additionally, we demonstrate that a seemingly low error rate in next-generation sequencing can dramatically impact the false-positive rate for rare variants. This is due to the fact that rare variants are, by definition, seen infrequently, making it hard to distinguish between errors and real variants. Compounding this problem is the fact that the proportion of errors is likely to get worse, not better, with increasing sample size. One cannot simply scale their way up in order to solve this problem. Understanding these potential pitfalls is a key step in successfully identifying true associations between rare variants and diseases.

摘要

多年来,遗传学家已经了解了使用常见基因组变异进行全基因组关联研究的本质。然而,最近研究重点已转向对罕见变异的分析。这给研究人员带来了潜在问题,因为罕见变异的行为方式并不总是与常见变异相同,有时会使数十年的可靠直觉变得毫无意义。在本文中,我们展示了常见变异与罕见变异之间差异的实例。我们说明了为何必须更加谨慎地对待罕见变异的来源,以及不这样做如何会导致I型错误率大幅虚增。然后我们解释了如何通过仔细理解和研究设计来最好地避免此类问题。此外,我们证明了下一代测序中看似较低的错误率会极大地影响罕见变异的假阳性率。这是因为根据定义,罕见变异出现频率很低,难以区分错误和真实变异。使这个问题更加复杂的是,随着样本量增加,错误比例可能会变得更糟,而不是更好。人们不能简单地通过扩大规模来解决这个问题。了解这些潜在陷阱是成功识别罕见变异与疾病之间真正关联的关键一步。

相似文献

1
Population genetics identifies challenges in analyzing rare variants.群体遗传学揭示了分析罕见变异时所面临的挑战。
Genet Epidemiol. 2015 Mar;39(3):145-8. doi: 10.1002/gepi.21881. Epub 2015 Jan 12.
2
Rare variant association test with multiple phenotypes.针对多种表型的罕见变异关联测试。
Genet Epidemiol. 2017 Apr;41(3):198-209. doi: 10.1002/gepi.22021. Epub 2016 Dec 31.
3
How important are rare variants in common disease?罕见变异在常见疾病中有多重要?
Brief Funct Genomics. 2014 Sep;13(5):353-61. doi: 10.1093/bfgp/elu025. Epub 2014 Jul 8.
4
DoEstRare: A statistical test to identify local enrichments in rare genomic variants associated with disease.DoEstRare:一种用于识别与疾病相关的罕见基因组变异中局部富集情况的统计检验。
PLoS One. 2017 Jul 24;12(7):e0179364. doi: 10.1371/journal.pone.0179364. eCollection 2017.
5
Association studies for next-generation sequencing.下一代测序的关联研究。
Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.
6
Rare variant association testing in the non-coding genome.非编码基因组中的罕见变异关联测试。
Hum Genet. 2020 Nov;139(11):1345-1362. doi: 10.1007/s00439-020-02190-y. Epub 2020 Jun 4.
7
Large-scale detection of rare variants via pooled multiplexed next-generation sequencing: towards next-generation Ecotilling.通过汇集式多重新一代测序进行大规模稀有变异体检测:迈向新一代生态基因分型。
Plant J. 2011 Aug;67(4):736-45. doi: 10.1111/j.1365-313X.2011.04627.x. Epub 2011 Jul 11.
8
Rare-variant genome-wide association studies: a new frontier in genetic analysis of complex traits.罕见变异全基因组关联研究:复杂性状遗传分析的新领域。
Pharmacogenomics. 2013 Mar;14(4):413-24. doi: 10.2217/pgs.13.36.
9
Family-based exome-sequencing approach identifies rare susceptibility variants for lithium-responsive bipolar disorder.基于家系的外显子组测序方法鉴定出锂反应性双相情感障碍的罕见易感变异。
Genome. 2013 Oct;56(10):634-40. doi: 10.1139/gen-2013-0081. Epub 2013 Sep 17.
10
GWASeq: targeted re-sequencing follow up to GWAS.GWASeq:全基因组关联研究的靶向重测序后续研究。
BMC Genomics. 2016 Mar 3;17:176. doi: 10.1186/s12864-016-2459-y.

引用本文的文献

1
Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-insertion numbers.多祖先全基因组关联研究揭示与人类LINE-1和Alu插入数量变异相关的基因座。
Transl Med Aging. 2025;9:25-40. doi: 10.1016/j.tma.2025.02.001. Epub 2025 Feb 13.
2
Genetic characterization and phylogenetic analysis of common house crows (Corvus splendens).家鸦(Corvus splendens)的遗传特征与系统发育分析。
Sci Rep. 2025 Feb 10;15(1):4871. doi: 10.1038/s41598-025-85207-8.
3
Low nucleotide diversity of the Plasmodium falciparum AP2-EXP2 gene among clinical samples from Ghana.加纳临床样本中恶性疟原虫 AP2-EXP2 基因的核苷酸多样性较低。
Parasit Vectors. 2024 Nov 5;17(1):453. doi: 10.1186/s13071-024-06545-6.
4
Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-insertion numbers.多谱系全基因组关联研究揭示与人类LINE-1和Alu插入数量变异相关的基因座。
bioRxiv. 2025 Jan 27:2024.09.10.612283. doi: 10.1101/2024.09.10.612283.
5
Whole genome sequencing reveals population diversity and variation in HIV-1 specific host genes.全基因组测序揭示了HIV-1特异性宿主基因的群体多样性和变异。
Front Genet. 2023 Dec 20;14:1290624. doi: 10.3389/fgene.2023.1290624. eCollection 2023.
6
P2X2 receptor subunit interfaces are missense variant hotspots, where mutations tend to increase apparent ATP affinity.P2X2 受体亚基界面是错义变异热点,突变往往会增加明显的 ATP 亲和力。
Br J Pharmacol. 2022 Jul;179(14):3859-3874. doi: 10.1111/bph.15830. Epub 2022 Mar 29.
7
CRL4-Cereblon complex in Thalidomide Embryopathy: a translational investigation.CRL4-Cereblon 复合物在沙利度胺胚胎病中的作用:一项转化研究。
Sci Rep. 2020 Jan 21;10(1):851. doi: 10.1038/s41598-020-57512-x.
8
Allele balance bias identifies systematic genotyping errors and false disease associations.等位基因平衡偏倚可识别系统的基因分型错误和虚假的疾病关联。
Hum Mutat. 2019 Jan;40(1):115-126. doi: 10.1002/humu.23674. Epub 2018 Nov 23.
9
expression in mesencephalic neurons and characterization of a rare polymorphism associated with decreased risk of Parkinson's disease.中脑神经元中的表达以及与帕金森病风险降低相关的一种罕见多态性的特征
NPJ Parkinsons Dis. 2018 Aug 15;4:24. doi: 10.1038/s41531-018-0061-5. eCollection 2018.
10
Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome.从非洲侨民基因组中鉴定出针对非洲特有遗传变异的标记 SNP。
Sci Rep. 2017 Apr 21;7:46398. doi: 10.1038/srep46398.

本文引用的文献

1
Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing.多种变异calling 管道一致性低:外显子组和基因组测序的实际影响。
Genome Med. 2013 Mar 27;5(3):28. doi: 10.1186/gm432. eCollection 2013.
2
Recent explosive human population growth has resulted in an excess of rare genetic variants.最近人类人口的爆炸式增长导致了罕见遗传变异体的过剩。
Science. 2012 May 11;336(6082):740-3. doi: 10.1126/science.1217283.
3
Five years of GWAS discovery.GWAS 发现的五年。
Am J Hum Genet. 2012 Jan 13;90(1):7-24. doi: 10.1016/j.ajhg.2011.11.029.
4
Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析
Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.
5
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
6
Microdeletions of 3q29 confer high risk for schizophrenia.3q29 微缺失使精神分裂症风险增高。
Am J Hum Genet. 2010 Aug 13;87(2):229-36. doi: 10.1016/j.ajhg.2010.07.013.
7
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
8
On the number of segregating sites in genetical models without recombination.关于无重组遗传模型中的分离位点数。
Theor Popul Biol. 1975 Apr;7(2):256-76. doi: 10.1016/0040-5809(75)90020-9.