• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

下一代测序的关联研究。

Association studies for next-generation sequencing.

机构信息

Human Genetics Center, University of Texas School of Public Health, Houston, TX 77030, USA.

出版信息

Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.

DOI:10.1101/gr.115998.110
PMID:21521787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3129252/
Abstract

Genome-wide association studies (GWAS) have become the primary approach for identifying genes with common variants influencing complex diseases. Despite considerable progress, the common variations identified by GWAS account for only a small fraction of disease heritability and are unlikely to explain the majority of phenotypic variations of common diseases. A potential source of the missing heritability is the contribution of rare variants. Next-generation sequencing technologies will detect millions of novel rare variants, but these technologies have three defining features: identification of a large number of rare variants, a high proportion of sequence errors, and a large proportion of missing data. These features raise challenges for testing the association of rare variants with phenotypes of interest. In this study, we use a genome continuum model and functional principal components as a general principle for developing novel and powerful association analysis methods designed for resequencing data. We use simulations to calculate the type I error rates and the power of nine alternative statistics: two functional principal component analysis (FPCA)-based statistics, the multivariate principal component analysis (MPCA)-based statistic, the weighted sum (WSS), the variable-threshold (VT) method, the generalized T(2), the collapsing method, the CMC method, and individual tests. We also examined the impact of sequence errors on their type I error rates. Finally, we apply the nine statistics to the published resequencing data set from ANGPTL4 in the Dallas Heart Study. We report that FPCA-based statistics have a higher power to detect association of rare variants and a stronger ability to filter sequence errors than the other seven methods.

摘要

全基因组关联研究(GWAS)已成为鉴定常见变异影响复杂疾病的基因的主要方法。尽管取得了相当大的进展,但 GWAS 鉴定的常见变异仅占疾病遗传率的一小部分,不太可能解释常见疾病表型变异的大多数。遗传缺失的一个潜在来源是稀有变异的贡献。下一代测序技术将检测到数百万种新的稀有变异,但这些技术具有三个定义特征:大量稀有变异的识别、高比例的序列错误和大量缺失数据。这些特征为测试稀有变异与感兴趣的表型之间的关联带来了挑战。在这项研究中,我们使用基因组连续体模型和功能主成分作为开发针对重测序数据的新型强大关联分析方法的一般原则。我们使用模拟来计算 9 种替代统计量的Ⅰ型错误率和功效:两种基于功能主成分分析(FPCA)的统计量、基于多元主成分分析(MPCA)的统计量、加权和(WSS)、变量阈值(VT)方法、广义 T(2)、合并方法、CMC 方法和个体检验。我们还研究了序列错误对其Ⅰ型错误率的影响。最后,我们将这 9 种统计方法应用于达拉斯心脏研究中发表的 ANGPTL4 重测序数据集。我们报告称,基于 FPCA 的统计量在检测稀有变异的关联方面具有更高的功效,并且比其他七种方法具有更强的过滤序列错误的能力。

相似文献

1
Association studies for next-generation sequencing.下一代测序的关联研究。
Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.
2
A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法,用于针对下一代测序数据的全基因组关联分析。
J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.
3
Family-based association studies for next-generation sequencing.基于家系的下一代测序关联研究。
Am J Hum Genet. 2012 Jun 8;90(6):1028-45. doi: 10.1016/j.ajhg.2012.04.022.
4
Smoothed functional principal component analysis for testing association of the entire allelic spectrum of genetic variation.平滑功能主成分分析检验全等位基因谱遗传变异的关联。
Eur J Hum Genet. 2013 Feb;21(2):217-24. doi: 10.1038/ejhg.2012.141. Epub 2012 Jul 11.
5
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法,用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。
PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.
6
Weighted pedigree-based statistics for testing the association of rare variants.基于加权家系的统计方法用于检验罕见变异的关联。
BMC Genomics. 2012 Nov 24;13:667. doi: 10.1186/1471-2164-13-667.
7
A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.一种快速且抗噪的方法,用于检测复杂疾病的深度测序数据中的罕见变异关联。
Genet Epidemiol. 2012 Nov;36(7):675-85. doi: 10.1002/gepi.21662. Epub 2012 Aug 3.
8
A permutation method for detecting trend correlations in rare variant association studies.一种用于检测罕见变异关联研究中趋势相关性的排列方法。
Genet Res (Camb). 2019 Dec 13;101:e13. doi: 10.1017/S0016672319000120.
9
A weighted U-statistic for genetic association analyses of sequencing data.一种用于测序数据遗传关联分析的加权U统计量。
Genet Epidemiol. 2014 Dec;38(8):699-708. doi: 10.1002/gepi.21864. Epub 2014 Oct 20.
10
An evolutionary framework for association testing in resequencing studies.重测序研究中关联测试的进化框架。
PLoS Genet. 2010 Nov 11;6(11):e1001202. doi: 10.1371/journal.pgen.1001202.

引用本文的文献

1
An overview of recent technological developments in bovine genomics.牛基因组学近期技术发展概述。
Vet Anim Sci. 2024 Jul 23;25:100382. doi: 10.1016/j.vas.2024.100382. eCollection 2024 Sep.
2
Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework.使用广义线性模型框架基于下一代测序数据对一组遗传标记进行复杂反应的关联测试。
Mathematics (Basel). 2023 Jun 1;11(11). doi: 10.3390/math11112560. Epub 2023 Jun 2.
3
A tree-based gene-environment interaction analysis with rare features.一种基于树的具有罕见特征的基因-环境相互作用分析。
Stat Anal Data Min. 2022 Oct;15(5):648-674. doi: 10.1002/sam.11578. Epub 2022 Mar 1.
4
VIVID: A Web Application for Variant Interpretation and Visualization in Multi-dimensional Analyses.VIVID:一个用于多维分析中变体解释和可视化的 Web 应用程序。
Mol Biol Evol. 2022 Sep 1;39(9). doi: 10.1093/molbev/msac196.
5
Protein Sequencing, One Molecule at a Time.逐个分子进行蛋白质测序。
Annu Rev Biophys. 2022 May 9;51:181-200. doi: 10.1146/annurev-biophys-102121-103615. Epub 2022 Jan 5.
6
A Multi-Marker Test for Analyzing Paired Genetic Data in Transplantation.一种用于分析移植中配对遗传数据的多标记测试。
Front Genet. 2021 Oct 13;12:745773. doi: 10.3389/fgene.2021.745773. eCollection 2021.
7
Gene-Based Association Testing of Dichotomous Traits With Generalized Functional Linear Mixed Models Using Extended Pedigrees: Applications to Age-Related Macular Degeneration.使用扩展家系的广义功能线性混合模型对二分性状进行基于基因的关联测试:年龄相关性黄斑变性的应用。
J Am Stat Assoc. 2021;116(534):531-545. doi: 10.1080/01621459.2020.1799809. Epub 2020 Jul 28.
8
Integrative functional linear model for genome-wide association studies with multiple traits.基于多种性状的全基因组关联研究的综合功能线性模型。
Biostatistics. 2022 Apr 13;23(2):574-590. doi: 10.1093/biostatistics/kxaa043.
9
Genomic, proteomic, and systems biology approaches in biomarker discovery for multiple sclerosis.基因组学、蛋白质组学和系统生物学方法在多发性硬化症生物标志物发现中的应用。
Cell Immunol. 2020 Dec;358:104219. doi: 10.1016/j.cellimm.2020.104219. Epub 2020 Sep 20.
10
Adaptive Fisher method detects dense and sparse signals in association analysis of SNV sets.自适应 Fisher 方法可用于检测 SNV 集关联分析中的密集和稀疏信号。
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):46. doi: 10.1186/s12920-020-0684-3.

本文引用的文献

1
To identify associations with rare variants, just WHaIT: Weighted haplotype and imputation-based tests.为了鉴定罕见变异的关联,只需 WHaIT:加权单体型和基于推断的检验。
Am J Hum Genet. 2010 Nov 12;87(5):728-35. doi: 10.1016/j.ajhg.2010.10.014. Epub 2010 Nov 4.
2
Statistical analysis strategies for association studies involving rare variants.关联研究中罕见变异体的统计分析策略。
Nat Rev Genet. 2010 Nov;11(11):773-85. doi: 10.1038/nrg2867. Epub 2010 Oct 13.
3
Pooled association tests for rare variants in exon-resequencing studies.外显子重测序研究中罕见变异的合并关联分析。
Am J Hum Genet. 2010 Jun 11;86(6):832-8. doi: 10.1016/j.ajhg.2010.04.005. Epub 2010 May 13.
4
Accurate detection and genotyping of SNPs utilizing population sequencing data.利用群体测序数据进行 SNP 的精确检测和基因分型。
Genome Res. 2010 Apr;20(4):537-45. doi: 10.1101/gr.100040.109. Epub 2010 Feb 11.
5
Rare variants create synthetic genome-wide associations.罕见变异导致全基因组关联合成。
PLoS Biol. 2010 Jan 26;8(1):e1000294. doi: 10.1371/journal.pbio.1000294.
6
Population genetic inference from genomic sequence variation.从基因组序列变异推断种群遗传
Genome Res. 2010 Mar;20(3):291-300. doi: 10.1101/gr.079509.108. Epub 2010 Jan 12.
7
Detecting rare variants for complex traits using family and unrelated data.利用家系和无关数据检测复杂性状的罕见变异。
Genet Epidemiol. 2010 Feb;34(2):171-87. doi: 10.1002/gepi.20449.
8
Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。
Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.
9
Common vs. rare allele hypotheses for complex diseases.复杂疾病的常见等位基因与罕见等位基因假说
Curr Opin Genet Dev. 2009 Jun;19(3):212-9. doi: 10.1016/j.gde.2009.04.010. Epub 2009 May 28.
10
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.全基因组关联位点对人类疾病和性状的潜在病因学及功能影响。
Proc Natl Acad Sci U S A. 2009 Jun 9;106(23):9362-7. doi: 10.1073/pnas.0903103106. Epub 2009 May 27.