• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

下一代测序对关联研究的影响。

Implication of next-generation sequencing on association studies.

机构信息

MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China.

出版信息

BMC Genomics. 2011 Jun 17;12:322. doi: 10.1186/1471-2164-12-322.

DOI:10.1186/1471-2164-12-322
PMID:21682891
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3148210/
Abstract

BACKGROUND

Next-generation sequencing technologies can effectively detect the entire spectrum of genomic variation and provide a powerful tool for systematic exploration of the universe of common, low frequency and rare variants in the entire genome. However, the current paradigm for genome-wide association studies (GWAS) is to catalogue and genotype common variants (5% < MAF). The methods and study design for testing the association of low frequency (0.5% < MAF ≤ 5%) and rare variation (MAF ≤ 0.5%) have not been thoroughly investigated. The 1000 Genomes Project represents one such endeavour to characterize the human genetic variation pattern at the MAF = 1% level as a foundation for association studies. In this report, we explore different strategies and study designs for the near future GWAS in the post-era, based on both low coverage pilot data and exon pilot data in 1000 Genomes Project.

RESULTS

We investigated the linkage disequilibrium (LD) pattern among common and low frequency SNPs and its implication for association studies. We found that the LD between low frequency alleles and low frequency alleles, and low frequency alleles and common alleles are much weaker than the LD between common and common alleles. We examined various tagging designs with and without statistical imputation approaches and compare their power against de novo resequencing in mapping causal variants under various disease models. We used the low coverage pilot data which contain ~14 M SNPs as a hypothetical genotype-array platform (Pilot 14 M) to interrogate its impact on the selection of tag SNPs, mapping coverage and power of association tests. We found that even after imputation we still observed 45.4% of low frequency SNPs which were untaggable and only 67.7% of the low frequency variation was covered by the Pilot 14 M array.

CONCLUSIONS

This suggested GWAS based on SNP arrays would be ill-suited for association studies of low frequency variation.

摘要

背景

下一代测序技术可以有效地检测基因组变异的全貌,为系统探索全基因组常见、低频和稀有变异提供了强大的工具。然而,目前全基因组关联研究(GWAS)的范例是对常见变异(MAF<5%)进行编目和基因分型。低频(0.5%<MAF≤5%)和稀有变异(MAF≤0.5%)关联检验的方法和研究设计尚未得到彻底研究。1000 基因组计划(The 1000 Genomes Project)就是这样一个旨在以 MAF=1%的水平描述人类遗传变异模式的项目,作为关联研究的基础。在本报告中,我们基于 1000 基因组计划中的低覆盖率试点数据和外显子试点数据,探索了后 GWAS 时代的不同策略和研究设计。

结果

我们研究了常见和低频 SNPs 之间的连锁不平衡(LD)模式及其对关联研究的影响。我们发现,低频等位基因与低频等位基因之间,以及低频等位基因与常见等位基因之间的 LD 比常见等位基因与常见等位基因之间的 LD 弱得多。我们检查了各种带有和不带有统计推断方法的标记设计,并比较了它们在各种疾病模型下对因果变异进行映射的能力与从头重测序。我们使用包含约 1400 万个 SNPs 的低覆盖率试点数据作为假设的基因型阵列平台(Pilot 14 M),研究其对标记 SNP 选择、映射覆盖和关联检验功效的影响。我们发现,即使在推断之后,我们仍然观察到 45.4%的低频 SNP 是无法标记的,而且 Pilot 14 M 阵列只覆盖了 67.7%的低频变异。

结论

这表明基于 SNP 阵列的 GWAS 不适合低频变异的关联研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/dadc50eecaca/1471-2164-12-322-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/39fa99b8e71e/1471-2164-12-322-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/66ee1e169b52/1471-2164-12-322-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/92a8a8c0aeeb/1471-2164-12-322-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/c9b1ef8cafdd/1471-2164-12-322-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/983241478bef/1471-2164-12-322-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/dadc50eecaca/1471-2164-12-322-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/39fa99b8e71e/1471-2164-12-322-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/66ee1e169b52/1471-2164-12-322-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/92a8a8c0aeeb/1471-2164-12-322-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/c9b1ef8cafdd/1471-2164-12-322-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/983241478bef/1471-2164-12-322-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b454/3148210/dadc50eecaca/1471-2164-12-322-6.jpg

相似文献

1
Implication of next-generation sequencing on association studies.下一代测序对关联研究的影响。
BMC Genomics. 2011 Jun 17;12:322. doi: 10.1186/1471-2164-12-322.
2
A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法,用于针对下一代测序数据的全基因组关联分析。
J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.
3
Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data.利用全基因组测序数据量化全基因组关联研究的定位精度
Genome Biol. 2017 May 16;18(1):86. doi: 10.1186/s13059-017-1216-0.
4
Design considerations for genetic linkage and association studies.基因连锁与关联研究的设计考量
Methods Mol Biol. 2012;850:237-62. doi: 10.1007/978-1-61779-555-8_13.
5
Two-phase designs to follow-up genome-wide association signals with DNA resequencing studies.采用两阶段设计对全基因组关联信号进行 DNA 重测序研究的随访。
Genet Epidemiol. 2013 Apr;37(3):229-38. doi: 10.1002/gepi.21708. Epub 2013 Jan 24.
6
On optimal pooling designs to identify rare variants through massive resequencing.通过大规模重测序鉴定罕见变异的最优合并设计。
Genet Epidemiol. 2011 Apr;35(3):139-47. doi: 10.1002/gepi.20561. Epub 2011 Jan 19.
7
Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative.使用来自妇女健康倡议的约 4000 个非洲裔美国人的研究特定参考面板对 Metabochip SNPs 进行基因型推断。
Genet Epidemiol. 2012 Feb;36(2):107-17. doi: 10.1002/gepi.21603.
8
High imputation accuracy from informative low-to-medium density single nucleotide polymorphism genotypes is achievable in sheep1.在绵羊中,信息量较低且中等密度的单核苷酸多态性基因型也能实现高的插补准确性。1
J Anim Sci. 2019 Apr 3;97(4):1550-1567. doi: 10.1093/jas/skz043.
9
Imputation-based genomic coverage assessments of current human genotyping arrays.基于插补法的当前人类基因分型阵列的基因组覆盖度评估
G3 (Bethesda). 2013 Oct 3;3(10):1795-807. doi: 10.1534/g3.113.007161.
10
Imputation-based assessment of next generation rare exome variant arrays.基于插补法的新一代罕见外显子变异阵列评估
Pac Symp Biocomput. 2014:241-52.

引用本文的文献

1
Rare Coding Variants Associated with Breast Cancer.与乳腺癌相关的罕见编码变异。
Adv Exp Med Biol. 2021;1187:435-453. doi: 10.1007/978-981-32-9620-6_23.
2
Optimization of Genotype by Sequencing data for phylogenetic purposes.为系统发育目的对测序数据进行基因型优化。
MethodsX. 2020 Apr 20;7:100892. doi: 10.1016/j.mex.2020.100892. eCollection 2020.
3
Cytotoxicity and Transcriptomic Analysis of Silver Nanoparticles in Mouse Embryonic Fibroblast Cells.银纳米颗粒对小鼠胚胎成纤维细胞的细胞毒性及转录组学分析。

本文引用的文献

1
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
2
Ten years of genetics and genomics: what have we achieved and where are we heading?十年遗传学和基因组学:我们取得了哪些成就,未来的方向在哪里?
Nat Rev Genet. 2010 Oct;11(10):723-33. doi: 10.1038/nrg2878. Epub 2010 Sep 7.
3
Integrating common and rare genetic variation in diverse human populations.整合不同人类群体中的常见和罕见遗传变异。
Int J Mol Sci. 2018 Nov 16;19(11):3618. doi: 10.3390/ijms19113618.
4
Hierarchical Assembly of RNA Three-Dimensional Structures Based on Loop Templates.基于环模板的 RNA 三维结构的分级组装。
J Phys Chem B. 2018 May 31;122(21):5327-5335. doi: 10.1021/acs.jpcb.7b10102. Epub 2018 Jan 8.
5
Rare variants in optic disc area gene enriched in primary open-angle glaucoma.视盘区域基因中的罕见变异在原发性开角型青光眼中富集。
Mol Genet Genomic Med. 2016 Oct 3;4(6):624-633. doi: 10.1002/mgg3.248. eCollection 2016 Nov.
6
Transcriptomic analysis of human breast cancer cells reveals differentially expressed genes and related cellular functions and pathways in response to gold nanorods.对人类乳腺癌细胞的转录组分析揭示了响应金纳米棒时差异表达的基因以及相关的细胞功能和信号通路。
Biophys Rep. 2015;1:106-114. doi: 10.1007/s41048-015-0005-0. Epub 2015 Aug 21.
7
Advances in the Pharmacogenomics of Adverse Drug Reactions.药物不良反应的药物基因组学进展
Drug Saf. 2016 Jan;39(1):15-27. doi: 10.1007/s40264-015-0367-8.
8
Harvesting the promising fruits of genomics: applying genome sequencing technologies to crop breeding.收获基因组学的丰硕成果:将基因组测序技术应用于作物育种。
PLoS Biol. 2014 Jun 10;12(6):e1001883. doi: 10.1371/journal.pbio.1001883. eCollection 2014 Jun.
9
Feature selection methods for identifying genetic determinants of host species in RNA viruses.用于鉴定 RNA 病毒宿主物种遗传决定因素的特征选择方法。
PLoS Comput Biol. 2013;9(10):e1003254. doi: 10.1371/journal.pcbi.1003254. Epub 2013 Oct 10.
10
A systematic review of cancer GWAS and candidate gene meta-analyses reveals limited overlap but similar effect sizes.一项癌症 GWAS 和候选基因荟萃分析的系统回顾显示,两者的重叠有限,但效应大小相似。
Eur J Hum Genet. 2014 Mar;22(3):402-8. doi: 10.1038/ejhg.2013.161. Epub 2013 Jul 24.
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.
4
Uncovering the roles of rare variants in common disease through whole-genome sequencing.通过全基因组测序揭示常见疾病中罕见变异的作用。
Nat Rev Genet. 2010 Jun;11(6):415-25. doi: 10.1038/nrg2779.
5
Exome sequencing of a multigenerational human pedigree.外显子组测序一个多代人家系。
PLoS One. 2009 Dec 14;4(12):e8232. doi: 10.1371/journal.pone.0008232.
6
Genotype imputation.基因型推算
Annu Rev Genomics Hum Genet. 2009;10:387-406. doi: 10.1146/annurev.genom.9.081307.164242.
7
Next-generation DNA sequencing.下一代DNA测序
Nat Biotechnol. 2008 Oct;26(10):1135-45. doi: 10.1038/nbt1486.
8
Estimating coverage and power for genetic association studies using near-complete variation data.利用近乎完整的变异数据估计基因关联研究的覆盖率和效能。
Nat Genet. 2008 Jul;40(7):841-3. doi: 10.1038/ng.180. Epub 2008 Jun 22.
9
Linkage disequilibrium--understanding the evolutionary past and mapping the medical future.连锁不平衡——了解进化历程与描绘医学未来
Nat Rev Genet. 2008 Jun;9(6):477-85. doi: 10.1038/nrg2361.
10
Genome-wide association studies for complex traits: consensus, uncertainty and challenges.复杂性状的全基因组关联研究:共识、不确定性与挑战。
Nat Rev Genet. 2008 May;9(5):356-69. doi: 10.1038/nrg2344.